About Me

Hi 👋, I am a researcher at DeepSeek. I studied my PhD at ZIP Lab, Monash University, supervised by Prof. Bohan Zhuang and Prof. Jianfei Cai. Previously I was a master student at the University of Adelaide. Prior to that, I received my bachelors’ degree from Harbin Institute of Technology, Weihai, a beautiful coastal campus 🏖️, which left me with cherished memories. Here is my CV (usually outdated).

My research during PhD is all about efficiency in deep neural networks, including training, inference, and deployment. Some topics that I previously focused on:

  • Flexible model deployment: SN-Net, SN-Netv2
  • Transformer architcture optimization: LIT, LITv2
  • Efficient attention mechansims: HiLo, EcoFormer
  • Token pruning/merging for inference speedup: HVT
  • Memory-efficient training: Mesa

If you are working on the following topics and interested in internship/full-time opportunities at DeepSeek, please feel free to contact me via email. We are always looking for talents!

  • Multimodal LLMs
  • Visual Generative Model
  • Math/Code/LLM Alignment
  • Agent

News

  • 2024.09.26   One paper is accepted by NeurIPS 2024!
  • 2024.07.01   First day at DeepSeek & One paper is accepted by ECCV 2024!
  • 2023.12.29   One paper is accepted by CVPR 2024!
  • 2023.12.29   One paper is accepted by TPAMI!
  • 2023.10.25   I gave an online talk at University of Massachusetts Amherst.
  • 2023.04.20   One paper is accepted by IJCAI 2023!
  • 2023.04.11   I will be interning at NVIDIA Research this summer.
  • 2023.03.22   SN-Net was selected as a highlight at CVPR 2023!🔥
  • 2023.02.28   Two papers are accepted by CVPR 2023!
  • 2022.11.11   Both LITv2 and Ecoformer will be presented as Spotlight!
  • 2022.09.15   Our LITv2 and EcoFormer are accepted by NeurIPS 2022.
  • 2022.07.04   Our paper STPT is accepted by ECCV 2022.
  • 2021.12.01   Our paper LIT is accepted by AAAI 2022.
  • 2021.07.23   Our paper HVT and ORIST are accepted by ICCV 2021.

Education

  • Ph.D in Computer Science, Monash University, 2021 - 2024.
  • M.S. in Computer Science, The University of Adelaide, 2018 - 2020.
  • B.E. in Software Engineering, Harbin Institute of Technology, Weihai, 2015 - 2019.

Work Experience

  • Full-time researcher at DeepSeek, July, 2024 - Now.
  • Research intern at NVIDIA, AI Algorithm Group, July, 2023 - Oct, 2023

Research

Stitched ViTs are Flexible Vision Backbones
Zizheng Pan, Jing Liu, Haoyu He, Jianfei Cai, Bohan Zhuang
European Conference on Computer Vision (ECCV), 2024
[Paper] [Code] [Project Page]

PaRa: Personalizing Text-to-Image Diffusion via Parameter Rank Reduction
Shangyu Chen, Zizheng Pan, Jianfei Cai, Dinh Phung
Arxiv, 2024
[Paper]

MiniCache: KV Cache Compression in Depth Dimension for Large Language Models
Akide Liu, Jing Liu, Zizheng Pan, Yefei He, Gholamreza Haffari, Bohan Zhuang
Conference on Neural Information Processing Systems (NeurIPS), 2024
[Paper]

T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching
Zizheng Pan, Bohan Zhuang, De-An Huang, Weili Nie, Zhiding Yu, Chaowei Xiao, Jianfei Cai, Anima Anandkumar
ArXiv, 2024. [Paper] [Code] [Project Page]

Efficient Stitchable Task Adaptation
Haoyu He, Zizheng Pan, Jing Liu, Jianfei Cai, Bohan Zhuang
Conference on Computer Vision and Pattern Recognition (CVPR), 2024
[Paper] [Code]

Pruning Self-attentions into Convolutional Layers in Single Path
Haoyu He, Jing Liu, Zizheng Pan, Jianfei Cai, Jing Zhang, Dacheng Tao, Bohan Zhuang
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023.
[Code], [Paper]

A Survey on Efficient Training of Transformers
Bohan Zhuang, Jing Liu, Zizheng Pan, Haoyu He, Yuetian Weng, Chunhua Shen
International Joint Conference on Artificial Intelligence (IJCAI), 2023.
[Paper]

Stitchable Neural Networks
Zizheng Pan, Jianfei Cai, Bohan Zhuang
Conference on Computer Vision and Pattern Recognition (CVPR), 2023 (Highlight)
[Code], [Paper], [Project Page]

Dynamic Focus-aware Positional Queries for Semantic Segmentation
Haoyu He, Jianfei Cai, Zizheng Pan, Jing liu, Jing Zhang, Dacheng Tao and Bohan Zhuang.
Conference on Computer Vision and Pattern Recognition (CVPR), 2023
[Code], [Paper]

Fast Vision Transformers with HiLo Attention
Zizheng Pan, Jianfei Cai, Bohan Zhuang
Conference on Neural Information Processing Systems (NeurIPS), 2022 (Spotlight)
[Code], [Paper], [OpenReview]

EcoFormer: Energy-Saving Attention with Linear Complexity
Jing Liu*, Zizheng Pan*, Haoyu He, Jianfei Cai, Bohan Zhuang (*equal contribution)
Conference on Neural Information Processing Systems (NeurIPS), 2022 (Spotlight)
[Code], [Paper], [OpenReview]

An Efficient Spatio-Temporal Pyramid Transformer for Action Detection
Yuetian Weng, Zizheng Pan, Mingfei Han, Xiaojun Chang, Bohan Zhuang
European Conference on Computer Vision (ECCV), 2022
[Code], [Paper]

Mesa: A Memory-saving Training Framework for Transformers
Zizheng Pan, Peng Chen, Haoyu He, Jing Liu, Jianfei Cai, Bohan Zhuang
Arxiv, 2021
[Code], [Paper]

Less is More: Pay Less Attention in Vision Transformers
Zizheng Pan, Bohan Zhuang, Haoyu He, Jing Liu, Jianfei Cai
AAAI Conference on Artificial Intelligence (AAAI), 2022
[Code], [Paper]

Scalable Visual Transformers with Hierarchical Pooling
Zizheng Pan, Bohan Zhuang, Jing Liu, Haoyu He, Jianfei Cai
International Conference on Computer Vision (ICCV), 2021
[Code], [Paper]

The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
Yuankai Qi, Zizheng Pan, Yicong Hong, Ming-Hsuan Yang, Anton van den Hengel, Qi Wu
International Conference on Computer Vision (ICCV), 2021
[Code], [Paper]

Object-and-Action Aware Model for Visual Language Navigation
Yuankai Qi, Zizheng Pan, Shengping Zhang, Anton van den Hengel, Qi Wu
European Conference on Computer Vision (ECCV), 2020
[Paper]

Teaching

  • FIT5201 - Machine learning, 2022, TA

Talk

  • 2023.10.25   “Optimizing Vision Transformers for Efficient Training, Inference and Deployment”, invited online talk at University of Massachusetts Amherst.

Professional Activities

Reviewer: CVPR, ICCV, ECCV, NeurIPS, ICML, ICLR, IJCV

Awards

  • Google Travel and Conference Grants, 2023
  • Monash Graduate Scholarship, 2020
  • Adelaide Summer Research Scholarship, 2019
  • Outstanding Graduate in Harbin Institute of Technology, 2019