Quantization-Induced Degradation Scaling Laws: A Unified Mathematical Framework
Mohammad Ammar Hawwari1*, Yusuf Süha Yılmazer2, Kıvanç Efe İnan3, Gökdeniz Ayberk Türkmen1,4, Zehra Çetintaş1,3, Joohyun Lee1, İlayda Babüroğlu1,4
1 Department of Computer Engineering
2 Department of Software Engineering
3 Department of Electrical and Electronics Engineering
4 Department of Industrial Engineering
* Contact Author
** All authors are students at Bahçeşehir University, Istanbul, TR
Abstract
Quantization is a primary technique for efficient deployment of large language models (LLMs), yet the performance degradation it introduces—quantization-induced degradation (QiD)—is understood only through empirically fitted scaling laws. No first-principles derivation connecting bit-width, model size, and training tokens to QiD currently exists. In this paper, we provide the first analytical derivation of QiD scaling laws from first principles. By modeling uniform quantization as an additive perturbation on the trained weight tensor and expanding the Chinchilla loss surface to second order, we derive a closed-form QiD Master Equation: δL(b, N, D) = Γ · 2⁻²ᵇ · Nμ · D⁻ᵝ, where b is the bit-width, N the parameter count, D the training tokens, and Γ, μ, β are analytically determined constants. We prove that QiD converges to zero at rate O(4⁻ᵇ) per additional bit, establish an information-theoretic lower bound on achievable QiD via rate-distortion arguments, and demonstrate that the empirical scaling laws of Kumar et al. (2024), Ouyang et al. (2024), and Frantar et al. (2025) emerge as special cases of our unified framework. Numerical validation against published data confirms the analytical predictions. The framework provides design rules for minimum bit-width selection and reveals fundamental limits on quantization efficiency.
Keywords
How to Cite
APA:
Hawwari, M.A., Yilmazer, Y.S., Inan, K.E., Turkmen, G.A., Cetintas, Z., Lee, J., and Baburoglu, I. (2026). Quantization-Induced Degradation Scaling Laws: A Unified Mathematical Framework. AIR Journal of Engineering and Technology, Vol. 2026, AIRJET2026717.
https://doi.org/10.65737/AIRJET2026717
Indexed & Discoverable In
Plus automatic indexing in CORE, Scilit, and other DOI-triggered discovery services
Copyright & Open Access
© 2026 Mohammad Ammar Hawwari, Yusuf Süha Yılmazer, Kıvanç Efe İnan, Gökdeniz Ayberk Türkmen, Zehra Çetintaş, Joohyun Lee, İlayda Babüroğlu. This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author(s) and source are credited. Authors retain full copyright to their work.