About

I am now machine learning infrastructure researcher at ByteDance. I am TopSeed researcher (less than 20 in China) in ByteDance. I am also going to take the ByteDance-Tsinghua postdoc program by the end of 2024.

I completed my Ph.D. in the School of CS at Peking University, where I was advised by Prof. Yun Liang. I also worked with Professor Luis Ceze on LLM serving and optimization from September 2023 to January 2024 as visiting Ph.D. in SAMPL at the University of Washington. My research interest is at LLM inference/serving optimization, high-performance computing for machine learning on emerging hardware accelerators, optimizing compiler design, and code generation. My recent publications investigate new algorithms, abstractions, and frameworks for efficient code generation on CPU and GPU. My research has been recognized with MICRO, ASPLOS, ISCA, HPCA, TPDS, DAC, and MLSys. I received my B.S. degree in the department of Computer Intelligence Science at Peking University. I am reviewer of TPDS and sub-reviewer of MICRO, PPoPP, MLSys, ICS, and ICCAD.