Jiachen Zhu

About Me (Jiachen Zhu / 朱家晨）

I am a fifth-year PhD candidate in Computer Science at NYU Courant, advised by Yann LeCun.
I am also a Visiting Researcher at FAIR, Meta, where I am hosted by Zhuang Liu and Koustuv Sinha.

Education

PhD, Computer Science, New York University, 2020 - Now
MSc, Computer Science, New York University, 2018 - 2020
BSc, Computer Science, The Hong Kong Polytechnic University, 2010 - 2015

Research Interests

My research focuses on self-supervised learning for images and videos, as well as pretraining vision encoders for vision-language models (VLMs). I am also interested in understanding the design of all kind of neural network architectures.

Papers

Scaling Language-Free Visual Representation Learning

Transformers without Normalization

MetaMorph: Multimodal Understanding and Generation via Instruction Tuning

Variance-Covariance Regularization Improves Representation Learning

VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature Alignment

Masked Siamese ConvNets

TiCo: Transformation Invariance and Covariance Contrast for Self-Supervised Visual Representation Learning

Contact

jiachen DOT zhu AT nyu DOT edu

Google Scholar

Appendix

My Favourite Illusion!

Two ideas that I find both shockingly simple and extremely clever: 1, 2