Several papers from our group were accepted at this year’s CVPR 2026 conference.

  • Rethinking UMM Visual Generation: Masked Modeling for Efficient Image-Only Pre-training
  • Dual-Granularity Memory for Efficient Video Generation
  • Exploring Spatial Intelligence from a Generative Perspective
  • Eliciting Complex Spatial Reasoning in MLLMs through Wide-Baseline Matching