Zhichen Zeng

409, Paul Allen Center
185 E Stevens Way NE, Seattle, WA
I am Zhichen Zeng 「曾郅琛」, a first-year PhD student at the University of Washington, advised by Prof. Ang Li and Prof. Banghua Zhu. My research focuses on developing efficient system support for LLMs.
Before joining UW, I got the bachelor degree of Physics from USTC, where I was honored to receive the Guo Moruo Scholarship—the highest honor for USTC undergrads.
Previously, I had a enjoyable internship at Microsoft Research Asia, where I worked with Dr. Shijie Cao on efficient systems for long-context LLMs. I worked with Prof. Zhiru Zhang at Cornell on domain-specific compilers for accelerator design.
Feel free to connect with me!
news
Jul 16, 2025 | Our paper MHE-TPE micro-architecture has been accepted to MICRO’25! Congrats to all the coauthors! |
---|---|
May 23, 2025 | Excited to join ByteDance Seed-Infra-Training, working with Ziheng and Haibin! ![]() |
Nov 03, 2024 | Our Tensor Processing Engines paper has been accepted to HPCA’25 ![]() |
Sep 05, 2024 | Thrilled to share that I’ve completed my six-month intern at MSRA with an amazing team and honored with the Stars of Tomorrow award! ![]() ![]() |
Aug 01, 2024 | Our EN-Tensorcore paper has been accepted to ICCD’24 ![]() |
selected publications
- MICRO’25MHE-TPE: Multi-Operand High-Radix Encoder for Mixed-Precision Fixed-Point Tensor Processing EnginesIEEE/ACM International Symposium on Microarchitecture, 2025
- Under Review
- Under ReviewTactic: Adaptive Sparse Attention with Clustering and Distribution Fitting for Long-Context LLMsarXiv, 2025
- HPCA’25Exploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACsIEEE International Symposium on High-Performance Computer Architecture, 2025
- ICCD’24EN-T: Optimizing Tensor Computing Engines Performance via Encoder-Based MethodologyIEEE 42nd International Conference on Computer Design, 2024
- ISCA’25LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference AccelerationIEEE/ACM Annual International Symposium on Computer Architecture, 2025
service
- Artifact Evaluation Committee - MLSys 2025, ASPLOS 2025, HPCA 2025, MICRO 2024
- Conference Reviewer - ICLR 2025, ACL 2025, NeurIPS 2024
- Teaching Assistent - CSE 469: Computer Architecture, Spring 2025, UW