About Me
I am a 2026 Master of Engineering graduate from the Institute of Computing Technology, Chinese Academy of Sciences, advised by Prof. Liang Li. Before that, I received my Bachelor’s degree in Data Science from Yuanpei College, Peking University in 2023.
My research interests include large language models (LLMs), multimodal large language models (MLLMs), and test-time scaling (TTS), with a focus on reliable multimodal understanding and evaluation-driven scaling. If you are interested in my work, please feel free to contact me.
Selected Publications
- SpatialWorld: Benchmarking Interactive Spatial Reasoning of Multimodal Agents in Real-World Tasks, arXiv preprint, 2026.
- Evaluation-driven Scaling for Scientific Discovery, technical report, 2026.
- Pixels, Patterns, but No Poetry: To See the World like Humans, accepted to ICML 2026 Position Paper Track.
- Klear-AgentForge Technical Report, technical report, 2025.
- Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation, TACL submission, minor revision, 2025.
