Ruiyang Zhang

I am an first-year Ph.D. student at University of Macau, working with Prof. Zhedong Zheng. Previously, I obtained bachelor degree of computer science at Fudan Unversity in 2021.

I am instereted in computer vision and related topics. I believe the ultimate goal of computer vision is to make machine see like human. Currenty, my reseach focuses on Multi-modal Large Language Models (MLLMs), especically their inherent uncertainty.

Email  /  CV  /  Bio  /  Scholar  /  Github  /  X

profile photo

News

  • 🔥🔥🔥 2025.3: Check out our latest work: Uncertainty-o, unveiling uncertainty in Large Multimodal Models (LMMs) in a model-agnostic manner, supporting both Large Comprehension Models and Large Generation Models!
  • 🔥 2024.12: Check out our: VL-Uncertainty, leveraging MLLM uncertainty for hallucination detection!
  • 🔥 2024.12: We release the first curated list of reseach on MLLM uncertainty: Awesome-MLLM-Uncertainty!
  • 🔥 2024.7: Our work LiSe is accepted by ECCV'24!
  • 2023.6: Leave Meituan to seek my path into academic world.

Research

Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Models
Ruiyang Zhang, Hu Zhang, Fei Hao, Zhedong Zheng
Preprint, 2025
[Website] / [Code]

We propose a unified framework for uncertainty estimation of Large Multimodal Models, and harness uncertainty for hallucination detection, hallucination mitigation, and uncertainty-aware CoT.

VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation
Ruiyang Zhang, Hu Zhang, Zhedong Zheng
Preprint, 2024
[Website] / [Paper] / [Code]

We leverage semantic-equivalent prompt perturbation for refined LVLM uncertainty estimation, thereby faciliating more accurate hallucination detection.

Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection
Ruiyang Zhang, Hu Zhang, Hang Yu, Zhedong Zheng
Preprint, 2024
[Paper] / [Code]

We harness uncertainty learning to mitigate the negative influence of inaccurate pseudo labels in unsupervised 3D object detection. Specifically, we estimate coordinate level uncerainty and then utilize the learned uncertainty to regularize the self-learning process.

Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene
Ruiyang Zhang, Hu Zhang, Hang Yu, Zhedong Zheng
ECCV, 2024
[Paper] / [Code] / [Blog]

We enhance Unsupervised 3D Object Detection via LiDAR and 2D fusion to improve detecting ability of far and small objects.

Work Experiences

  • 2021.07 - 2023.06: Backend Development Engineer, Meituan, Shanghai

Education

  • 2024.09 - Now: Ph.D. student in Faculty of Science and Technology, University of Macau
  • 2017.09 - 2021.06: Undergraduate student in School of Computer Science, Fudan University

Academic Service

  • Conference Reviewer:ICLR 2025,CVPR 2025
  • Journal Reviewer:TOMM

Teaching

  • University of Macau CISC3021: Multimedia Forensics and Security (TA)

Honors and Awards

  • 2019: Second Prize of China Undergraduate Mathematical Contest in Modeling (CUMCM)
  • 2018&2019: Third Class Scholarship for Outstanding Students, Fudan University
  • 2016: Province First Price of Chinese Mathematical Olympiad (CMO)
  • 2014&2015: First Price of National Olympiad in Informatics in Provinces (NOIP)

Interests

  • I enjoy playing table tennis🏓, a member of school team in my primary school hhh
  • Like walking around parks🏞️

Template from Jon Barron's website.