Mingze Xu

Mingze Xu

徐 铭 泽


Senior Applied Scientist at Adobe Firefly


801 N 34th St

Seattle, WA 98103


xumingze0308 [at] gmail [dot] com

LinkedIn | Google Scholar | GitHub



News

  • 09/2025:   [New] Released AToken, a unified tokenizer for vision.

  • 09/2025:   [New] Two papers accepted to NeurIPS 2025!

  • 08/2025:   [New] Serve as Area Chair for CVPR 2026.

  • 07/2025:   Released Apple Foundation Models [Tech report].

  • 07/2025:   One paper accepted to COLM 2025!

  • 01/2025:   Serve as Area Chair for AAAI 2026.

We're hiring Applied Scientists in Multimodal LLM and GenAI, both full-time and interns!

Biography

Hi, I am Mingze. I am a Senior Applied Scientist at Adobe Firefly. Before joining Adobe, I worked or interned at Apple, Cruise, Amazon, and Microsoft Research. My research interests lie primarily in the area of computer vision and machine learning, and my current focus is on developing unified encoders and LLMs for multi-modalities (text, image, video, and 3D) across understanding and generative tasks.

I received my Ph.D. degree in Computer Science from Indiana University in 2020, advised by Prof. David Crandall. I was a visiting student researcher at Georgia Institute of Technology in 2018, working with Prof. Dhruv Batra and Prof. Devi Parikh. I have served or am serving as an Area Chair for CVPR (2024–2026), AAAI (2026), and WACV (2023–2024).

Selected Publications

(*Equal Contribution, Corresponding Author)

Last update: 09/2025