Portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 2 
TACL submission, minor revision, 2025
HAVEN is a benchmark and analysis suite for hallucination in large multimodal models for video understanding.
Technical report, 2025
A technical report on Klear-AgentForge, a guided perturbation learning framework for data-centric AI agents.
Technical report, 2026
SimpleTES is a framework for scaling evaluation-driven discovery loops across scientific problems.
arXiv preprint, 2026
SpatialWorld benchmarks interactive spatial understanding of multimodal agents in complex real-world tasks.
ICML 2026 Position Paper Track, accepted, 2026
A position paper proposing the Turing Eye Test for evaluating whether multimodal AI systems see the world like humans.
Published:
This is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown!
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
Undergraduate course, University 1, Department, 2014
This is a description of a teaching experience. You can use markdown like any other post.
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post.