Yuqian Yuan 袁瑜谦
PhD Student
Zhejiang University
Email: yuanyuqian@zju.edu.cn
|
|
About me
I am currently a PhD student in Zhejiang University, advised by Prof. Wenqiao Zhang and Jun Xiao.
My recent research interests are multimodal large language models, image&video understanding and reasoning.
Before, I mainly focus on
the field of the techniques for object detection, image segmentaion under minimal human supervision, including label-efficient /weakly-supervised /un-supervised approaches.
News
-
[2025.6]: We released the EOC-Bench , an object-centric embodied cognition benchmark in dynamic egocentric scenarios.
-
[2025.5]: One paper, TokenPacker is accepted by IJCV 2025.
-
[2025.4]: Our VideoRefer and VideoRefer-Bench have been discussed and adopted by NVIDIA & UC Berkely in their DAM work.
-
[2025.2]: Two papers are accepted by CVPR 2025.
-
[2025.2]: We released the VideoRefer-700K dataset on HuggingFace. Please see the VideoRefer Suite for the details.
-
[2025.1]: We released VideoLLaMA3, frontier multimodal foundation models for both image and video understanding.
Publications&Preprints
EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?
Yuqian Yuan*, Ronghao Dang*, Long Li*, Wentong Li*, Diao Jiao, Xin Li, Deli Zhao, Fan Wang, Wenqiao Zhang, Jun Xiao, Yueting Zhuang
Arxiv, 2025
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM
Yuqian Yuan, Hang Zhang, Wentong Li, Zesen Cheng, Boqiang Zhang, Long Li, Xin Li, Deli Zhao, Wenqiao Zhang, Yueting Zhuang, Jianke Zhu, Lidong Bing
CVPR, 2025
ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark
Ronghao Dang*, Yuqian Yuan*, Wenqi Zhang*, Yifei Xin, Boqiang Zhang, Long Li, Liuyi Wang, Qinyang Zeng, Xin Li, Lidong Bing
CVPR, 2025
TokenPacker: Efficient Visual Projector for Multimodal LLM
Wentong Li*, Yuqian Yuan*, Jian Liu, Dongqi Tang, Song Wang, Jianke Zhu, Lei Zhang
IJCV, 2025
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding
Boqiang Zhang*, Kehan Li*, Zesen Cheng*, Zhiqiang Hu*, Yuqian Yuan*, Guanzheng Chen*, Sicong Leng*, Yuming Jiang*, Hang Zhang*, Xin Li*, Peng Jin, Wenqi Zhang, Fan Wang, Lidong Bing, Deli Zhao
Technical Report, 2025
HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation
Tianwei Lin, Wenqiao Zhang, Sijing Li, Yuqian Yuan, Binhe Yu, Haoyuan Li, Wanggui He, Hao Jiang, Mengze Li, Xiaohui Song, Siliang Tang, Jun Xiao, Hui Lin, Yueting Zhuang, Beng Chin Ooi
ICML, 2025 (Spotlight)
Osprey: Pixel Understanding with Visual Instruction Tuning
Yuqian Yuan*, Wentong Li*, Jian Liu, Dongqi Tang, Xinjie Luo, Chi Qin, Lei Zhang, Jianke Zhu
CVPR, 2024
Label-efficient Segmentation via Affinity Propagation
Wentong Li*, Yuqian Yuan*, Song Wang, Wenyu Liu, Dongqi Tang, Jian Liu, Jianke Zhu, Lei Zhang
NeurIPS, 2023
Point2Mask: Point-supervised Panoptic Segmentation via Optimal Transport
Wentong Li, Yuqian Yuan, Song Wang, Jianke Zhu, Jianshu Li, Jian Liu, Lei Zhang
ICCV, 2023
Honors
-
National Scholarship, 2021
-
Silver Medal, China Collegiate Programming Contest for Girls, 2021, 2020
-
Honorable Mention, The 45th ICPC Asia Regional Contest, 2021
-
Best Girl's Team, Jiangsu Collegiate Programming Contest, 2021
-
The 17th place, China Collegiate Programming Contest for Girls, 2020
-
Second Prize, The 11th "Blue Bridge Cup" National Software Competition Final, 2020
© Yuqian Yuan | Last update: May 2025 |