publications
\* indicates equal contribution
2025
2025
- MICRO’25MHE-TPE: Multi-Operand High-Radix Encoder for Mixed-Precision Fixed-Point Tensor Processing EnginesIEEE/ACM International Symposium on Microarchitecture, 2025
- Under ReviewTactic: Adaptive Sparse Attention with Clustering and Distribution Fitting for Long-Context LLMsarXiv, 2025
- HPCA’25Exploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACsIEEE International Symposium on High-Performance Computer Architecture, 2025
- ISCA’25LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference AccelerationIEEE/ACM Annual International Symposium on Computer Architecture, 2025
2024
2024
- Under Review
- ICCD’24EN-T: Optimizing Tensor Computing Engines Performance via Encoder-Based MethodologyIEEE 42nd International Conference on Computer Design, 2024
- J. Phys. DHighly stable and fast response photodetector based on double perovskite Cs2AgBiCl6 crystalsJournal of Physics D: Applied Physics, Feb 2024