Shijie Ma

I'm currently a fourth-year Ph.D. student at Institute of Automation, Chinese Academy of Sciences, supervised by Prof. Cheng-Lin Liu.

Before that, I obtained my B.Eng. degree from Tsinghua University in 2021, where I was supervised by Prof. Guoqi Li and Prof. Luping Shi.

My previous research interests lie in open-world machine learning, encompassing novel class discovery, continual learning and data-centric learning. Currently, I focus on multimodal understanding and generation, particularly the relationship and synergy between them.

Email  /  Google Scholar  /  Github  /  DBLP  /  Twitter

profile photo

Education

News

  • [2025.05] I have been selected as a Top Reviewer in ICML 2025.
  • [2025.03] Excited to release GenHancer, in which we systemically explore how generative models enhance multimodal understanding and provide several key points.
  • [2025.03] My paper is accepted to TPAMI!!!
  • [2025.01] One paper is accepted to ICLR 2025.
  • [2024.12] I have been selected as a Top Reviewer in NeurIPS 2024.
  • [2024.09] Two papers are accepted to NeurIPS 2024.
  • [2024.07] One paper is accepted to ECCV 2024 as an oral presentation.
  • [2024.02] Two papers are accepted to CVPR 2024.
  • [2023.09] One papers is accepted to NeurIPS 2023.

Preprints

* indicates equal contribution

dise GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers
Shijie Ma, Yuying Ge, Teng Wang, Yuxin Guo, Yixiao Ge, Ying Shan
arXiv / Project Page / Code / Model
dise Open-world Machine Learning: A Review and New Outlooks
Fei Zhu*, Shijie Ma*, Zhen Cheng, Xu-Yao Zhang, Zhaoxiang Zhang, Cheng-Lin Liu
arXiv

Publications

dise ProtoGCD: Unified and Unbiased Prototype Learning for Generalized Category Discovery
Shijie Ma, Fei Zhu, Xu-Yao Zhang, Cheng-Lin Liu
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2025
Paper / arXiv / Code
dise Aligned Better, Listen Better for Audio-Visual Large Language Models
Yuxin Guo, Shuailei Ma, Shijie Ma, Xiaoyi Bao, Chen-Wei Xie, Kecheng Zheng, Tingyu Weng, Siyang Sun, Yun Zheng, Wei Zou
International Conference on Learning Representations (ICLR), 2025
Paper / arXiv
dise Towards Trustworthy Dataset Distillation
Shijie Ma, Fei Zhu, Zhen Cheng, Xu-Yao Zhang
Pattern Recognition (PR), 2025
Paper / arXiv / Code
dise MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution
Wenzhuo Liu, Fei Zhu, Shijie Ma, Cheng-Lin Liu
Advances in Neural Information Processing Systems (NeurIPS), 2024
Paper / arXiv / Code
dise Happy: A Debiased Learning Framework for Continual Generalized Category Discovery
Shijie Ma, Fei Zhu, Zhun Zhong, Wenzhuo Liu, Xu-Yao Zhang, Cheng-Lin Liu
Advances in Neural Information Processing Systems (NeurIPS), 2024
Paper / arXiv / Code
dise WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models
Xin-Jian Wu, Ruisong Zhang, Jie Qin, Shijie Ma, Cheng-Lin Liu
European Conference on Computer Vision (ECCV), 2024 (Oral)
Paper / arXiv / Code
dise CrossMAE: Cross-Modality Masked Autoencoders for Region-Aware Audio-Visual Pre-Training
Yuxin Guo, Siyang Sun, Shuailei Ma, Kecheng Zheng, Xiaoyi Bao, Shijie Ma, Wei Zou, Yun Zheng
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
Paper
dise Active Generalized Category Discovery
Shijie Ma, Fei Zhu, Zhun Zhong, Xu-Yao Zhang, Cheng-Lin Liu
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
Paper / arXiv / Code
dise Cross Pseudo-Labeling for Semi-Supervised Audio-Visual Source Localization
Yuxin Guo, Shijie Ma, Yuhao Zhao, Wei Zou
International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2024
Paper / arXiv
dise Dual Mean-Teacher: An Unbiased Semi-Supervised Framework for Audio-Visual Source Localization
Yuxin Guo, Shijie Ma, Hu Su, Zhiqing Wang, Yuhao Zhao, Wei Zou, Siyang Sun, Yun Zheng
Advances in Neural Information Processing Systems (NeurIPS), 2023
Paper / arXiv / Code
dise Rethinking Pretraining as a Bridge From ANNs to SNNs
Yihan Lin, Yifan Hu, Shijie Ma, Dongjie Yu, Guoqi Li
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2022
Paper / arXiv / Code

Honors and Awards

  • ICML Top Reviewer, 2025
  • National Scholarship, 2024
  • NeurIPS Top Reviewer, 2024
  • Merit Student, Chinese Academy of Sciences, 2022
  • Freshmen Scholarship, Chinese Academy of Sciences, 2021
  • Comprehensive Scholarship, Tsinghua University, 2020
  • Academic Scholarship, Tsinghua University, 2019
  • Academic Services

  • Conference Reviewer: NeurIPS, NeurIPS D&B Track, ICML, ICLR, CVPR, ICCV, ECCV, AISTATS, WACV, ICPR
  • Journal Reviewer: IJCV, IEEE TKDE, IEEE TCSVT, TMLR

  • Website Template


    © Shijie Ma | Last updated: May, 2025