Zhichen Zeng

409, Paul Allen Center
185 E Stevens Way NE, Seattle, WA
This is Zhichen Zeng 「曾郅琛」, a first-year PhD student at the University of Washington, advised by Prof. Ang Li and working closely with Prof. Baris Kasikci. Before joining UW, I got the bachelor degree of Physics from USTC, where I was honored to receive the Guo Moruo Scholarship—the highest honor for USTC undergrads.
Previously, I had a enjoyable internship at Microsoft Research Asia, where I worked with Dr. Shijie Cao on efficient systems for long-context LLMs. I worked with Prof. Zhiru Zhang at Cornell on domain-specific compilers for accelerator design.
Feel free to connect with me!
news
Feb 23, 2025 | Excited to join ByteDance Seed MLSys Team as a Research Scientist Intern ![]() |
---|---|
Nov 03, 2024 | Our Tensor Processing Engines paper has been accepted to HPCA’25 ![]() |
Sep 05, 2024 | Thrilled to share that I’ve completed my six-month intern at MSRA with an amazing team and honored with the Stars of Tomorrow award! ![]() ![]() |
Aug 01, 2024 | Our EN-Tensorcore paper has been accepted to ICCD’24 ![]() |
Apr 20, 2024 | Awarded the 43rd Guo Moruo Scholarship (highest honor of USTC undergrads) ![]() ![]() |
selected publications
- Under Review
- Under ReviewTactic: Adaptive Sparse Attention with Clustering and Distribution Fitting for Long-Context LLMsarXiv, 2025
- HPCAExploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACs2025 IEEE International Symposium on High-Performance Computer Architecture, 2025
- ICCDEN-T: Optimizing Tensor Computing Engines Performance via Encoder-Based MethodologyIEEE 42nd International Conference on Computer Design, 2024
- ISCALUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration2025 IEEE/ACM Annual International Symposium on Computer Architecture, 2024
service
- Artifact Evaluation Committee - MLSys 2025, ASPLOS 2025, HPCA 2025
- Conference Reviewer - ICLR 2025, NeurIPS 2024