Select language
< Return to main menu

Jingwen Leng

SQZ PI(July 2020-present)
SJTU Professor


Shanghai Qi Zhi Institute PI, Professor at Shanghai Jiao Tong University.

Graduated from the Department of Electrical and Computer Engineering at The University of Texas at Austin in December 2016 with a Ph.D. Graduated from Shanghai Jiao Tong University in July 2010 with a Bachelor's degree. The main focus during the doctoral studies was the architectural optimization of GPU processors. Currently, he is leading a National Natural Science Foundation for Young Scientists (2017) and several collaborative research topics, and was selected for the Microsoft Research Asia Young Scholars Star Program in 2018.

Research Direction

Computer Architecture

Software and hardware design for next generation computer systems

Hardware/software Codesign for Large Language Models

Reduce the memory and computation cost for large language models



1.  Cong Guo, Yuxian Qiu, Jingwen Leng, Xiaotian Gao, Chen Zhang, Yunxin Liu, Fan Yang, Yuhao Zhu, Minyi Guo,SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation,2022  International Conference on Learning Representations (ICLR),2022

2.  Guan Yue, Zhengyi Li, Jingwen Leng, Zhouhan Lin, and Minyi Guo,Transkimmer: Transformer Learns to Layer-wise Skim,2022 Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics(CCF-A),2022

3.  Yue Guan, Zhengyi Li, Jingwen Leng, Zhouhan Lin, Minyi Guo, Yuhao Zhu,Block-Skim: Efficient Question Answering for Transformer,2022 The Thirty-Sixth AAAI Conference on Artificial Intelligence(CCF-A),2022

4.  Zihan Liu, Jingwen Leng, Zhihui Zhang, Quan Chen, Chao Li, Minyi Guo,VELTAIR: Towards High-Performance Multi-Tenant Deep Learning Services via Adaptive Compilation and Scheduling,2022 Conference on Architectural Support for Programming Languages and Operating Systems(CCF-A),2022

5.  Cong Guo, Yuxian Qiu, Jingwen Leng, Chen Zhang, Ying Cao, Quanlu Zhang, Yunxin Liu, Fan Yang, Minyi Guo,Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training,2022  IEEE 40th International Conference on Computer Design

6. Cong Guo, Chen Zhang, Jingwen Leng, Zihan Liu, Fan Yang, Yunxin Liu, Minyi Guo, Yuhao Zhu,Ant: Exploiting adaptive numerical data type for low-bit deep neural network quantization,2022 IEEE/ACM International Symposium on Microarchitecture

7. Weihao Cui, Han Zhao, Quan Chen, Ningxin Zheng, Jingwen Leng, Jieru Zhao, Zhuo Song, Tao Ma, Yong Yang, Chao Li, Minyi Guo,Enable Simultaneous DNN Services Based on Deterministic Operator Overlap and Precise Latency Prediction,2021 Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis

8. Yangjie Zhou, Mengtian Yang, Cong Guo, Jingwen Leng, Yun Liang, Quan Chen, Minyi Guo, Yuhao Zhu,Characterizing and Demystifying the Implicit Convolution Algorithm on Commercial Matrix-Multiplication Accelerators,2021 IEEE International Symposium on Workload Characterization (IISWC)

9. 周杨杰;冷静文;杨孟天;过敏意,图像数据处理方法、装置、计算机设备和存储介质,2021年专利

10. 冷静文,朱禺皓,郭聪,姚斌,过敏意,可重构的单指令多数据脉动阵列结构、处理器及电子终端,2021年专利

11. Cong Guo, Bo Yang Hsueh, Jingwen Leng, Yuxian Qiu, Yue Guan, Zehuan Wang, Xiaoying Jia, Xipeng Li, Minyi Guo, Yuhao Zhu,Accelerating Sparse DNN Models without Hardware-Support via Tile-Wise Sparsity,2020 Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis

12.  Yue Guan, Jingwen Leng, Chao Li, Quan Chen, Minyi Guo,How Far Does BERT Look At: Distance-based Clustering and Analysis of BERT’s Attention,2020 Proceedings of the 28th International Conference on Computational Linguistics

13.  Wei Zhang, Ningxin Zheng, Quan Chen, Yong Yang, Zhuo Song, Tao Ma, Jingwen Leng, Minyi Guo,Precise Capacity Planning and Fair Scheduling based on Low-level Statistics for Public Clouds. International Conference on Parallel Processing,2020 49th International Conference on Parallel Processing

14.  Zihan Liu, Jingwen Leng, Guandong Lu, Minyi Guo, Quan Chen, Chao Li,一种资源配置方法、介质及服务端,2020年专利

15.  Zihan Liu, Jingwen Leng, Guandong Lu, Minyi Guo, Quan Chen, Chao Li,神经网络的编译方法、系统、计算机存储介质及编译设备,2020年专利