Hi there! My name is Hang Wu (吴杭), you can also call me by my English name Laurent.
I am currently a first-year PhD student at the University of California, Merced, conducting research under the guidance of Prof. Yiwei Wang, with Prof. Ming-Hsuan Yang as my senior advisor. Additionally, I work closely with Prof. Yujun Cai. My main research interests are in vision-language models and large multimodal models, with a focus on improving their performance and specific applications. I received my bachelor’s degree from Tongji University, where I worked on image processing tasks in the low-level vision field.
You can find my CV here: Hang Wu’s Curriculum Vitae. If you are interested in my work, please feel free to drop me an email.
🎓 Educations
- 2025.08 - Present, PhD student, University of California, Merced.
- 2021.09 - 2025.06, Undergraduate student, Tongji University.
💻 Internships
- 2025.03 - 2025.07, Research Intern, vivo@Shenzhen, China.
- 2025.01 - 2025.07, Research Intern, UC Merced NLP Lab@University of California-Merced, Remote.
- 2023.09 - 2025.03, Research Intern, Ni’s Group@Tongji University, Shanghai, China.
📝 Selected Publications
CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning [Paper] [Code]
Hang Wu, Yujun Cai$^{\dagger}$, Zehao Li, Haonan Ge, Bowen Sun, Junsong Yuan, Yiwei Wang
Arxiv 2026
-
Structured Reasoning Paradigm: We propose the Observation-Thinking-Answer (O-T-A) paradigm, reformulating camera movement understanding from black-box classification into a structured ego-motion inference task that decouples camera trajectories from dynamic scenes.
-
Inference Trajectory Suite: We construct a Large-scale Inference Trajectory Suite with 18k SFT and 38k RL samples, transforming static labels into dense reasoning chains to instill spatio-temporal cinematic logic into MLLMs.
-
RL-driven Logical Alignment: We are the first to employ Reinforcement Learning for logical alignment in this domain, achieving state-of-the-art 78.4% and 74.5% accuracy in binary classification and visual question answering tasks.
RefineShot: Rethinking Cinematography Understanding with Foundational Skill Evaluation [Paper] [Code]
Hang Wu, Yujun Cai, Haonan Ge, Hongkai Chen, Ming-Hsuan Yang, Yiwei Wang$^{\dagger}$
Arxiv 2025
- Benchmark refinement: Enforces consistent option granularity, unified evaluation dimensions, and mutual exclusivity for greater dataset reliability.
- Baseline analysis: Thoroughly evaluates ShotVL, revealing weaknesses in reasoning, prompt adherence, and output consistency.
- Evaluation expansion: Adds a protocol assessing both task-specific performance and core model competencies, enabling more balanced and robust comparisons.
DiMo-GUI: Advancing Test-time Scaling in GUI Grounding via Modality-Aware Visual Reasoning [Paper] [Code]
Hang Wu, Hongkai Chen$^{\dagger}$, Yujun Cai, Chang Liu, Qingwen Ye, Ming-Hsuan Yang, Yiwei Wang
EMNLP 2025 Main Conference
- We propose DiMo-GUI, a training-free framework that can be seamlessly integrated as a plug-and-play component into any GUI agent.
- Without requiring additional training or external data, DiMo-GUI effectively enhances grounding performance across various GUI tasks.
📚 Projects

Research on Perception-oriented High Dynamic Range Imaging Systems
National-level Innovation Project
- We treat artifacts in HDR images as detectable entities, explicitly detect and suppress them to enhance HDR quality.
- National-level innovation project at Tongji University, with a funding of 10,000 RMB.
📖 Services
Conference Reviewer
- International Conference on Machine Learning (ICML) 2026
- The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026
- The IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2026
Teaching
- CSE 188: Natural Language Processing, Teaching Assistant (Spring 2026, UC Merced).
- CSE-022: Introduction to Programming, Teaching Assistant (Fall 2025, UC Merced).
🎨 About Me
- I’m ESFJ.
- I come from Maanshan, Anhui Province, China.
- I’m a huge sports fan and enjoy doing many kinds of sports in my spare time, including tennis🎾, basketball🏀, soccer⚽️, waterpolo🤽♂️, swimming🏊, badminton🏸…
- I love pop music and R&B, with Ed Sheeran and The Weeknd as my favorite English artists, David Tao and Khalil Fong as my favorite Chinese artists.