Hi! I'm Hao Li

Researcher at ERNIE Team, Baidu
Specialized in |
Scroll Down ↓

Hao Li (李昊)


I‘m a Member of Technical Staff @ ERNIE Team, Baidu.

Prior to this, I was a Post-doc @ Imperial College London and Research Intern @ Microsoft Research

My research interests lie in Post-training and Reinforcement Learning. Recently, focusing on:

  • Agentic RL & Reward Modeling
  • Training-Inference Mismatch
  • On-policy Distillation

Email / LinkedIn / Google Scholar / GitHub

My Photo

Selected Work

ERNIE Team, Baidu
🏆 #1 in China (LMArena) 🌎 #8 Globally 🤗 2.4T(2400B) LLM
[Product] [Blog] [Hugging Face] [Technical Report]
ERNIE Team, Baidu
7.6k+ Stars 👥 430M+ Users🚀 1.5B Daily Calls
[GitHub] [Hugging Face] [Technical Report]

Selected Publications

Arg-LLaDA: Argument Summarization via Large Language Diffusion Models
Hao Li, Yizheng Sun, Viktor Schlegel, Kailai Yang, Riza Batista-Navarro, Goran Nenadic
ACL 2026
[Paper]
Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training
Kailai Yang, Xiao Liu, Lei Ji, Hao Li, Yeyun Gong, Peng Cheng, Mao Yang
ACL 2026
[Paper]
MIRA: Medical Time Series Foundation Model for Real-World Health Data
Hao Li, Bowen Deng, Chang Xu, Zhiyuan Feng, Viktor Schlegel, Yu-Hao Huang, Yizheng Sun, Jingyuan Sun, Kailai Yang, Yiyao Yu, Jiang Bian
NeurIPS 2025
[Paper] [Code] [Talk]
BRIDGE: Bootstrapping Text to Control Time-Series Generation via Multi-Agent Optimization
Hao Li, Yu-Hao Huang, Chang Xu, Viktor Schlegel, Renhe Jiang, Riza Batista-Navarro, Goran Nenadic, Jiang Bian
ICML 2025
[Paper] [Code] [Talk]
TarDiff: Target-Oriented Diffusion Guidance for Synthetic EHR Time Series Generation
Bowen Deng, Chang Xu, Hao Li, Yu-Hao Huang, Min Hou, Jiang Bian
KDD 2025
[Paper] [Code] [Talk]
Does Acceleration Cause Hidden Instability in Vision Language Models?
Yizheng Sun, Hao Li, Chang Xu, Hongpeng Zhou, Chenghua Lin, Riza Batista-Navarro, Jingyuan Sun
EMNLP 2025
[Paper]
LVPruning: Language-Guided Vision Token Pruning for MLLMs
Yizheng Sun, Yanze Xin, Hao Li, Jingyuan Sun, Chenghua Lin, Riza Batista-Navarro
NAACL 2025 Findings
[Paper]
Which Side Are You On? Multi-task Dataset for End-to-End Argument Summarisation
Hao Li, Yuping Wu, Viktor Schlegel, Riza Batista-Navarro, Tharindu Madusanka, Iqra Zahid, Jiayan Zeng, Xiaochi Wang, Xinran He, Yizhi Li, Goran Nenadic
ACL 2024 Findings
[Paper] [Code]
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating LLMs
Yizhi Li, Ge Zhang, Xingwei Qu, Jiali Li, Zhaoqun Li, Zekun Wang, Hao Li, Ruibin Yuan, Yinghao Ma, Kai Zhang, Wangchunshu Zhou, Yiming Liang, Lei Zhang, Lei Ma, Jiajun Zhang, Zuowen Li, Stephen W Huang, Chenghua Lin, Wenhu Chen, Jie Fu
ACL 2024 Findings
[Paper] [Code]
Do You Hear the People Sing? Key Point Analysis via Iterative Clustering
Hao Li, Viktor Schlegel, Riza Theresa Batista-Navarro, Goran Nenadic
ACL 2023
[Paper] [Code]
Not All Quantifiers Are Equal: Probing Transformer-based Language Models
Tharindu Madusanka, Iqra Zahid, Hao Li, Ian Pratt-Hartmann, Riza Batista-Navarro
EMNLP 2023
[Paper]