publications

\* indicates equal contribution

2025

2025

  1. MICRO’25
    MHE-TPE: Multi-Operand High-Radix Encoder for Mixed-Precision Fixed-Point Tensor Processing Engines
    Qizhe Wu , Jinyi Zhou , Zhanhe Hu , Zhichen Zeng, and 9 more authors
    IEEE/ACM International Symposium on Microarchitecture, 2025
  2. Under Review
    Tactic: Adaptive Sparse Attention with Clustering and Distribution Fitting for Long-Context LLMs
    Kan Zhu* , Tian Tang* , Qinyu Xu* , Yile Gu , and 6 more authors
    arXiv, 2025
  3. HPCA’25
    Exploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACs
    Qizhe Wu , Huawen Liang , Yuchen Gui , Zhichen Zeng, and 6 more authors
    IEEE International Symposium on High-Performance Computer Architecture, 2025
  4. ISCA’25
    LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration
    Zhiwen Mo , Lei Wang , Jianyu Wei , Zhichen Zeng, and 7 more authors
    IEEE/ACM Annual International Symposium on Computer Architecture, 2025

2024

2024

  1. Under Review
    SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs
    Yizhao Gao* , Zhichen Zeng*, Dayou Du , Shijie Cao , and 4 more authors
    arXiv, 2024
  2. PLDI’24
    Allo: A Programming Model for Composable Accelerator Design
    Hongzheng Chen* , Niansong Zhang* , Shaojie Xiang , Zhichen Zeng, and 2 more authors
    ACM SIGPLAN Conference on Programming Language Design and Implementation, 2024
  3. ICCD’24
    EN-T: Optimizing Tensor Computing Engines Performance via Encoder-Based Methodology
    Qizhe Wu , Yuchen Gui , Zhichen Zeng, Xiaotian Wang , and 2 more authors
    IEEE 42nd International Conference on Computer Design, 2024
  4. J. Phys. D
    Highly stable and fast response photodetector based on double perovskite Cs2AgBiCl6 crystals
    Zhengyu Han* , Mengjia Dai* , Zhichen Zeng*, Chunhui Ye , and 4 more authors
    Journal of Physics D: Applied Physics, Feb 2024