Hi there! My name is Hang Wu (吴杭), you can also call me by my English name Laurent.

I am currently a first-year PhD student at the University of California, Merced, conducting research under the guidance of Prof. Yiwei Wang, with Prof. Ming-Hsuan Yang as my senior advisor. Additionally, I work closely with Prof. Yujun Cai. My main research interests are in vision-language models and large multimodal models, with a focus on improving their performance and specific applications. I received my bachelor’s degree from Tongji University, where I worked on image processing tasks in the low-level vision field.

You can find my CV here: Hang Wu’s Curriculum Vitae. If you are interested in my work, please feel free to drop me an email.

🎓 Educations

  • 2025.08 - Present, PhD student, University of California, Merced.
  • 2021.09 - 2025.06, Undergraduate student, Tongji University.

💻 Internships

  • 2025.03 - 2025.07, Research Intern, vivo@Shenzhen, China.
  • 2025.01 - 2025.07, Research Intern, UC Merced NLP Lab@University of California-Merced, Remote.
  • 2023.09 - 2025.03, Research Intern, Ni’s Group@Tongji University, Shanghai, China.

📝 Selected Publications

sym

CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning [Paper] [Code]

Hang Wu, Yujun Cai$^{\dagger}$, Zehao Li, Haonan Ge, Bowen Sun, Junsong Yuan, Yiwei Wang

Arxiv 2026

  • Structured Reasoning Paradigm: We propose the Observation-Thinking-Answer (O-T-A) paradigm, reformulating camera movement understanding from black-box classification into a structured ego-motion inference task that decouples camera trajectories from dynamic scenes.

  • Inference Trajectory Suite: We construct a Large-scale Inference Trajectory Suite with 18k SFT and 38k RL samples, transforming static labels into dense reasoning chains to instill spatio-temporal cinematic logic into MLLMs.

  • RL-driven Logical Alignment: We are the first to employ Reinforcement Learning for logical alignment in this domain, achieving state-of-the-art 78.4% and 74.5% accuracy in binary classification and visual question answering tasks.

sym

RefineShot: Rethinking Cinematography Understanding with Foundational Skill Evaluation [Paper] [Code]

Hang Wu, Yujun Cai, Haonan Ge, Hongkai Chen, Ming-Hsuan Yang, Yiwei Wang$^{\dagger}$

Arxiv 2025

  • Benchmark refinement: Enforces consistent option granularity, unified evaluation dimensions, and mutual exclusivity for greater dataset reliability.
  • Baseline analysis: Thoroughly evaluates ShotVL, revealing weaknesses in reasoning, prompt adherence, and output consistency.
  • Evaluation expansion: Adds a protocol assessing both task-specific performance and core model competencies, enabling more balanced and robust comparisons.
sym

DiMo-GUI: Advancing Test-time Scaling in GUI Grounding via Modality-Aware Visual Reasoning [Paper] [Code]

Hang Wu, Hongkai Chen$^{\dagger}$, Yujun Cai, Chang Liu, Qingwen Ye, Ming-Hsuan Yang, Yiwei Wang

EMNLP 2025 Main Conference

  • We propose DiMo-GUI, a training-free framework that can be seamlessly integrated as a plug-and-play component into any GUI agent.
  • Without requiring additional training or external data, DiMo-GUI effectively enhances grounding performance across various GUI tasks.

📚 Projects

sym

Research on Perception-oriented High Dynamic Range Imaging Systems

National-level Innovation Project

  • We treat artifacts in HDR images as detectable entities, explicitly detect and suppress them to enhance HDR quality.
  • National-level innovation project at Tongji University, with a funding of 10,000 RMB.

📖 Services

Conference Reviewer

  • International Conference on Machine Learning (ICML) 2026
  • The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026
  • The IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2026

Teaching

  • CSE 188: Natural Language Processing, Teaching Assistant (Spring 2026, UC Merced).
  • CSE-022: Introduction to Programming, Teaching Assistant (Fall 2025, UC Merced).

🎨 About Me

  • I’m ESFJ.
  • I come from Maanshan, Anhui Province, China.
  • I’m a huge sports fan and enjoy doing many kinds of sports in my spare time, including tennis🎾, basketball🏀, soccer⚽️, waterpolo🤽‍♂️, swimming🏊, badminton🏸…
  • I love pop music and R&B, with Ed Sheeran and The Weeknd as my favorite English artists, David Tao and Khalil Fong as my favorite Chinese artists.