Qiang HE
Bochum, Germany.

Bochum, Germany
I’m currently a first second third fourth-year PhD student at Ruhr-University Bochum, supervised by Prof. Dr. Setareh Maghsudi. I earned my Master’s degree in Theory and Method of Artificial Intelligence from Institute of Automation, Chinese Academy of Sciences, supervised by Prof. Dr. Xinwen Hou.
I am currently on the industry/academic job market. Please feel free to contact me if you have any available positions.
Research Interests: Large Language Models, RLHF, Human-AI Alignment, Game AI, Reinforcement Learning
I'm broadly interested in large language models, human-AI alignment, RLHF, and AI security. Currently, my research aims to i) develop controllable AI in both training and inference/adaptation; ii) theory and real-world application of Human-AI alignment; and iii) understand the structural information of LLMs, RLHF & RL and how to leverage it to enable agent performance. And yes, we are developing these methods for RL and LLMs. Our [technology](https://openreview.net/pdf?id=eN1T7I7OpZ) powers the world's most popular fighting game.
Contact information
Email: qianghe97 AT gmail DOT com, Qiang DOT He AT rub DOT de.WeChat ID: pposac
Professional Service
Reviewer for ICLR, NeurIPS, ICML, AAAI, DMLR, ICPR,
news
May 28, 2025 | Our paper Pareto Multi-Objective Alignment for Language Models is accepted to ECML/PKDD 2025 research track (acceptance rate: 24%)! Looking forward to seeing you in Porto, Portugal! |
---|---|
May 6, 2024 | I attent the ICLR’24. Feel free to chat with me! Check Poster. |
May 3, 2024 | I attent the AISTATS’24. Feel free to chat with me! |
May 2, 2024 | Our paper “Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment” is accepted by ICML 2024! We introduce Shūkai, a game agent trained in the game Naruto Mobile. This work is the first example of a deep RL agent deployed in a commercial fighting game, and has been deployed for a year. |
Jan 17, 2024 | 2 papers accepted to ICLR 2024 and one of them is spotlight. Thank my supervisor and collaborators for their help! |
selected publications
- ICML’24Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human AlignmentIn Forty-first International Conference on Machine Learning , 2024