Hao Li (李昊)
I‘m a Member of Technical Staff @ ERNIE Team, Baidu.
Prior to this, I was a Post-doc @ Imperial College London and Research Intern @ Microsoft Research
My research interests lie in Post-training and Reinforcement Learning. Recently, focusing on:
- Agentic RL & Reward Modeling
- Training-Inference Mismatch
- On-policy Distillation
Email / LinkedIn / Google Scholar / GitHub
Selected Work
ERNIE Team, Baidu
🏆 #1 in China (LMArena)
🌎 #8 Globally
🤗 2.4T(2400B) LLM
ERNIE Team, Baidu
7.6k+ Stars
👥 430M+ Users🚀 1.5B Daily Calls
Microsoft Research
1.1k+ Stars
Selected Publications
MIRA: Medical Time Series Foundation Model for Real-World Health Data
NeurIPS 2025
BRIDGE: Bootstrapping Text to Control Time-Series Generation via Multi-Agent Optimization
ICML 2025
TarDiff: Target-Oriented Diffusion Guidance for Synthetic EHR Time Series Generation
KDD 2025
Which Side Are You On? Multi-task Dataset for End-to-End Argument Summarisation
ACL 2024 Findings
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating LLMs
ACL 2024 Findings