Qiang HE

Bochum, Germany.

qh.jpg

Bochum, Germany

I’m currently a first second third fourth-year PhD student at Ruhr-University Bochum, supervised by Prof. Dr. Setareh Maghsudi. I earned my Master’s degree in Theory and Method of Artificial Intelligence from Institute of Automation, Chinese Academy of Sciences, supervised by Prof. Dr. Xinwen Hou.

I am currently on the industry/academic job market. Please feel free to contact me if you have any available positions.

Research Interests: Large Language Models, RLHF, Human-AI Alignment, Game AI, Reinforcement Learning
I'm broadly interested in large language models, human-AI alignment, RLHF, and AI security. Currently, my research aims to i) develop controllable AI in both training and inference/adaptation; ii) theory and real-world application of Human-AI alignment; and iii) understand the structural information of LLMs, RLHF & RL and how to leverage it to enable agent performance. And yes, we are developing these methods for RL and LLMs. Our [technology](https://openreview.net/pdf?id=eN1T7I7OpZ) powers the world's most popular fighting game.


Contact information Email: qianghe97 AT gmail DOT com, Qiang DOT He AT rub DOT de.
WeChat ID: pposac


Professional Service
Reviewer for ICLR, NeurIPS, ICML, AAAI, DMLR, ICPR,


news

May 28, 2025 Our paper Pareto Multi-Objective Alignment for Language Models is accepted to ECML/PKDD 2025 research track (acceptance rate: 24%)! Looking forward to seeing you in Porto, Portugal!
May 6, 2024 I attent the ICLR’24. Feel free to chat with me! Check Poster.
May 3, 2024 I attent the AISTATS’24. Feel free to chat with me!
May 2, 2024 Our paper “Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment” is accepted by ICML 2024! We introduce Shūkai, a game agent trained in the game Naruto Mobile. This work is the first example of a deep RL agent deployed in a commercial fighting game, and has been deployed for a year.
Jan 17, 2024 2 papers accepted to ICLR 2024 and one of them is spotlight. Thank my supervisor and collaborators for their help!

selected publications

  1. ECML’25
    Pareto Multi-Objective Alignment for Language Models
    Qiang He ,  and  Setareh Maghsudi
    European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2025
  2. ICML’24
    Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment
    Chen Zhang ,  Qiang He ,  Yuan Zhou , and 4 more authors
    In Forty-first International Conference on Machine Learning , 2024
  3. ICLR24_beer.png
    Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation
    Qiang He ,  Tianyi Zhou ,  Meng Fang , and 1 more author
    Twelfth International Conference on Learning Representations, 2024
  4. NIPS23_TEEN.png
    Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control
    Chao Li ,  Chen Gong ,  Qiang He , and 1 more author
    Thirty-seventh Conference on Neural Information Processing Systems, 2023
  5. ECML’23
    Eigensubspace of Temporal-Difference Dynamics and How It Improves Value Approximation in Reinforcement Learning
    Qiang He ,  Meng Fang ,  Tianyi Zhou , and 1 more author
    European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2023
  6. CVPR’23
    Frustratingly Easy Regularization on Representation Can Boost Deep Reinforcement Learning
    Qiang He ,  Huangyuan Su ,  Jieyu Zhang , and 1 more author
    The Thirty-Fourth IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023