profile photo

Longfei Li 李龙飞

I am a first-year Ph.D. student at the MePro, Beijing Jiaotong University, advised by Prof. Yunchao Wei. Currently, I am also a research intern at ByteDance Seed.

My recent research focuses on:

  1. Defining and improving spatial abilities for multimodal large language models with 3D foundation models and principles of cognitive science.
  2. Creating spatial AI agents capable of perceiving, manipulating, and learning from the real physical world.
I am open for collaborations. Please feel free to contact me if you are interested in my research.

Email  /  Google Scholar  /  Github

profile photo
Research

(* indicates the equal contribution)

SpatialTree: How Spatial Abilities Branch Out in MLLMs
Yuxi Xiao*, Longfei Li*, Shen Yan, Xinhang Liu, Sida Peng, Yunchao Wei Xiaowei Zhou, Bingyi Kang
  Technical Report
Project / arXiv

StereoWorld
StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation
Ke Xing, Longfei Li, Yuyang Yin, Hanwen Liang, Guixun Luo, Chen Fang, Jue Wang, Konstantinos N. Plataniotis, Xiaojie Jin, Yao Zhao, Yunchao Wei
  arXiv 2025
Project / arXiv

Martian World Model Teaser
Martian World Model: Controllable Video Synthesis with Physically Accurate 3D Reconstructions
Longfei Li, Zhiwen Fan, Wenyan Cong, Xinhang Liu, Yuyang Yin, Matt Foutter, Panwang Pan, Chenyu You, Yue Wang, Zhangyang Wang, Yao Zhao, Marco Pavone, Yunchao Wei
  NeurIPS 2025
Project / arXiv / Code

AdGPT
AdGPT: Explore Meaningful Advertising with ChatGPT
Jiannan Huang, Mengxue Qu, Longfei Li, Yunchao Wei
  TOMM 2025
HTML / PDF / Code

Experiences
ByteDance Seed
Aug. 2025 - Present
Research Intern
Advised by Dr. Bingyi Kang
University of Texas at Austin
Sep. 2024 - Jul. 2025
Research Intern at VITA Group
Advised by Zhiwen Fan

This website template was borrowed from Jonathon Barron.