I’m a final-year Ph.D. candidate at the Department of Computing, The Hong Kong Polytechnic University, advised by Prof. Chang Wen Chen. I’m also closely collaborating with Tencent ARC Lab. Before that, I obtained my B.S. degree in Geographical Information Science and B.E. degree in Computer Science from Wuhan University. I spent memorable time at Show Lab @ NUS, VCG @ Harvard, SUNY Buffalo, and CUHK-Shenzhen during my academic journey.
My research lies in computer vision and multi-modal learning, with particular focuses on large multi-modal models for video & language understanding. Please feel free to reach out if you are interested in related topics :)
News
- Oct 18, 2025
I received PolyU Distinguished Postdoctoral Fellowship. - Oct 12, 2025
I received NeurIPS Scholar Award. - Sep 23, 2025
- Sep 23, 2025
- Sep 18, 2025
- Jun 26, 2025
One paper got accepted by ICCV 2025.
- Mar 21, 2025
Check out VideoMind, our exploration on agentic long video reasoning. - Sep 27, 2024
One paper got accepted by NeurIPS 2024. - Jul 02, 2024
One paper and its demo got accepted by ECCV 2024. - Mar 15, 2024
- Nov 20, 2023
One paper got accepted by TNNLS. - Aug 04, 2023
One paper got accepted by CIKM 2023. - May 10, 2023
- Jan 01, 2023
My startup on AI + Healthcare is granted by HKSTP for ideation. - Nov 16, 2022
I received The Most Appreciated Teaching Assistant (MATA) award. - Sep 16, 2022
- Mar 02, 2022
One paper got accepted by CVPR 2022. - Jul 08, 2021
I started my internship at ARC Lab, Tencent PCG. - Jul 29, 2020
One paper got accepted by ACM Multimedia 2020. - Jun 30, 2020
I graduated from Wuhan University with honors. - Jun 27, 2019
More
Our
Our
Our new work
I'm joining
I will join
I received
I started my internship at