UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs
Paper
•
2512.03383
•
Published
•
3
Energy-aware Computing, Low Power Design, EDA, Dark Silicon, Efficient Deep Learning
UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs
Quamba2: A Robust and Scalable Post-training Quantization Framework for Selective State Space Models