Skip to content

Publications

1
Rethinking KV Cache Eviction via a Unified Information-Theoretic Objective
Jiaming Yang, Chenwei Tang, Liangli Zhen, Jiancheng Lv
International Conference on Machine Learning (ICML 2026)
2
GWQ: Group-wise quantization framework for neural networks
Jiaming Yang, Chenwei Tang, Caiyang Yu, Jiancheng Lv
Asian Conference on Machine Learning (ACML 2024)
3
MPQ-YOLO: Ultra low mixed-precision quantization of YOLO for edge devices deployment
Xinyu Liu, Tao Wang, Jiaming Yang, Chenwei Tang, Jiancheng Lv
Neurocomputing
4
ASQ & POST: A synergistic framework for adaptive and non-uniform quantization
Wenqiang Zhou, Zhendong Yu, Xinyu Liu, Jiaming Yang, Rong Xiao, Tao Wang, Chenwei Tang, Jiancheng Lv
Neurocomputing
5
BOB-YOLO: Balancing Optimization Binarized YOLO via Module-Wise Latency
Xinyu Liu, Wenqiang Zhou, Zhendong Yu, Jiaming Yang, Tao Wang, Chenwei Tang, Jiancheng Lv
ECAI

最新更新: