Gengyuan Zhang

pronoun: he/him
Hi, I am Gengyuan(张耕源). I am currently pursuing my PhD degree at Ludwig-Maximilian University (aka LMU Munich/University of Munich), supervised by Prof. Volker Tresp.
My research interests include Video Understanding and Multimodal Reasoning as an intersection of Computer Vision and Natural Language Processing.
Prior to this, I attained my bachelor degree (2018) in Zhejiang University, China and my master degree (2021) in Technical University of Munich, Germany.
Originally, I am from Hunan, China.
- uni email: zhang{at}dbs[dot]ifi[dot]lmu[dot]de
- personal email: gengyuanmax{at}gmail[dot]com
- hobbies: Plants, Crusaeder Kings III, Travelling, Cooking
- have a cute Dackel (dachshund)
I am open to any collaboration and full-time job opportunities.
news
Apr 7, 2025 | I start my internship @Amazon London! |
---|---|
Mar 5, 2025 | One paper accepted by ICLR 2025 Workshop World Model |
Feb 26, 2025 | Two papers accepted at CVPR2025! See you in Nashville. |
Feb 20, 2025 | Our new paper is now on arXiv Memory Helps, but Confabulation Misleads: Understanding Streaming Events in Videos with MLLMs! |
Oct 28, 2024 | One new paper is accepted by WACV 2025, Tuscon, Arizona! |
selected publications
- Memory Helps, but Confabulation Misleads: Understanding Streaming Events in Videos with MLLMs2025
- Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal QueriesIn Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025, 2025
- Time-dependent Entity Embedding is not All You Need: A Re-evaluation of Temporal Knowledge Graph Completion Models under a Unified FrameworkIn Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Nov 2021