About Me

Hello! I’m Yihao HU (胡益豪), a Ph.D. student in Artificial Intelligence at Westlake University. My research focuses on World Models, Agentic RL, Video Reasoning, Multimodal AI, and AI Agents.

I am currently an intern at Ant Group (蚂蚁集团). Previously, I was an Algorithm Intern at Meituan (Core R&D Platform, Native Multimodal LLM - LongCat) and an AI Agent & LLM Alignment Intern at Alibaba Group, Amap (高德).

My recent work spans agent-environment co-evolution, teacher-free VLM agents, multimodal reasoning, visual multi-agent systems, and computer vision for agriculture. I have also received 7+ national/international competition awards including the National Scholarship.

📰 News

2026.05: SEAL: Synergistic Co-Evolution of Agents and Learning Environments was released on arXiv.
2026.05: AtlasVA: Self-Evolving Visual Skill Memory for Teacher-Free VLM Agents was released on arXiv.
2026.05: Two papers, OmniVideo-R1 and Dual Latent Memory for Visual Multi-agent System, were accepted by ICML 2026.
2026.04: CFSR: Geometry-Conditioned Shadow Removal via Physical Disentanglement was released on arXiv.
2026.02: OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality Attention was released on arXiv.
2026.02: Dual Latent Memory for Visual Multi-agent System was released on arXiv.

🎓 Education

Present, Ph.D. student in Artificial Intelligence, Westlake University (西湖大学), Hangzhou, China.
2022.09 - 2026.06, B.Eng. in Computer Science, Hainan University (海南大学), Hainan, China.
- GPA: 3.93/4.0 (Top 1%, Rank 2/207)
- Key Courses: Data Structures (97), Data Mining (97), Machine Learning (95), Introduction to AI (95), Operating Systems (95), OOP (98), Database Design (96)

📜 Selected Publications

2026

F. A. Vasluianu, T. Seizinger, Z. Zhou, Z. Wu, R. Timofte, L. Beltrame, …, Y. Hu, et al., “Advances in Single-Image Shadow Removal: Results from the NTIRE 2026 Challenge”, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2026. [Paper]
Z. Chen, J. Tao, R. Li, Y. Hu, et al., “OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality Attention”, International Conference on Machine Learning (ICML), 2026. [Paper]
X. Yu, C. Xu, Z. Chen, B. Yin, C. Yang, Y. He, Y. Hu, et al., “Dual Latent Memory for Visual Multi-agent System”, International Conference on Machine Learning (ICML), 2026. [Paper]
P. Wang, Y. Hu, X. Liu, J. Yang, H. Wang, and Z. Wen, “AtlasVA: Self-Evolving Visual Skill Memory for Teacher-Free VLM Agents”, arXiv preprint, 2026. [Paper]
M. Gao, Z. Yue, W. Yan, Y. Hu, W. Ji, S. Tang, J. Xiao, T. S. Chua, Y. Zhuang, et al., “Counterfactual Evolution of Multimodal Datasets via Visual Programming”, Advances in Neural Information Processing Systems (NeurIPS), 2026. [Paper]
Y. Hu, P. Wang, X. Bai, S. Cai, H. Wang, H. Liu, A. Yang, X. Li, M. Ding, H. Liu, and J. Yao, “SDE-DET: A Precision Network for Shatian Pomelo Detection in Complex Orchard Environments”, Smart Agricultural Technology, 2026. [Paper]
Y. Hu, Z. Wen, X. Liu, P. Wang, X. Zhang, and W. Wu, “SEAL: Synergistic Co-Evolution of Agents and Learning Environments”, arXiv preprint, 2026. [Paper]
P. Wang, Y. Hu, X. Liu, and H. Wang, “CFSR: Geometry-Conditioned Shadow Removal via Physical Disentanglement”, arXiv preprint, 2026. [Paper]

2025

P. Wang, Y. Hu, X. Bai, J. Yang, L. Zhou, A. Yang, X. Li, M. Ding, and J. Yao, “A Multi-Strategy Framework for Enhancing Shatian Pomelo Detection in Real-World Orchards”, arXiv preprint, 2025. [Paper]

💼 Internships

Present, Intern, Ant Group (蚂蚁集团).
2025.10 - 2026, Algorithm Intern (Native Multimodal LLM - LongCat), Meituan (美团), Core R&D Platform, Shenzhen, China.
Previous, AI Agent & LLM Alignment Intern, Alibaba Group, Amap (高德), Shenzhen, China.

🔬 Research Experience

2023.06 - Present, Undergraduate Research Team Member (Group Leader), Prof. Xiaodong BAI’s Lab, Hainan University.
2024.10 - 2025.10, Group Leader, National-Level College Student Innovation & Entrepreneurship Training Program.

🌟 Honors & Awards

🏅 Scholarships & Honors

National Scholarship (国家奖学金, 50th among all undergraduates in the whole school), 2025.10
“WuXu” Scholarship (1/207), 2024.10
First-Class Academic Scholarship, Hainan University (Top 10 in Department), 2023.10
Merit Student, Hainan University (80th among all undergraduates), 2023.12

🥇 Competition Awards

International Finals – First Prize, 5th Global Campus AI Algorithm Elite Competition (Stable Diffusion Prompt Optimization Track), 2023.12
National First Prize, 15th National College Mathematics Competition, 2023.12
National Second Prize, 2023 China Collegiate Computer Programming Competition, 2024.01
International Finals – Second Prize, 6th Global Campus AI Algorithm Elite Competition (AI + New Discipline Track), 2024.12
International Finals – Third Prize, 6th Global Campus AI Algorithm Elite Competition (AI + New Medical Track), 2024.12
National Finals – Second Prize, 12th National College Digital Media Technology & Creativity Competition, 2024.12
National Finals – Second Prize, 27th China Robot & AI Competition, 2025.08
National Finals – Third Prize, 27th China Robot & AI Competition, 2025.08

🛠️ Skills

Programming: C/C++ (Proficient), Python (Proficient), Matlab (Familiar)
Tools: Tableau, SPSS, AutoCAD, CATIA, Solidworks, Hyperworks
Writing: LaTeX (Proficient), Microsoft Office (Familiar)
Research Areas: World Model, Agentic RL, Video Reasoning, Multimodal LLM, Reasoning VLM, AI Agent