publications
\* indicates equal contribution
2025
2025
- Under ReviewTactic: Adaptive Sparse Attention with Clustering and Distribution Fitting for Long-Context LLMsarXiv, 2025
- HPCAExploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACs2025 IEEE International Symposium on High-Performance Computer Architecture, 2025
2024
2024
- Under Review
- ICCDEN-T: Optimizing Tensor Computing Engines Performance via Encoder-Based MethodologyIEEE 42nd International Conference on Computer Design, 2024
- ISCALUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration2025 IEEE/ACM Annual International Symposium on Computer Architecture, 2024
- J. Phys. DHighly stable and fast response photodetector based on double perovskite Cs2AgBiCl6 crystalsJournal of Physics D: Applied Physics, Feb 2024