I am a PhD student in Computer Science at EPFL, VILAB, supervised by Prof. Amir Zamir. My research focuses on multi-modal foundation models. I am interested in how to build unified AI models that can effectively work with different types of data through pre-training, post-training, and reasoning approaches.
I received my M.Sc. degree from ETH Zurich, where I worked with Dr. Lei Ke and Dr. Martin Danelljan. I received my B.Sc. degree from Zhejiang University. Previously, I have interned at Adobe Research with Dr. Joon-Young Lee.
* denotes equal contribution