Posts

Showing posts from April 20, 2025

Pushing the Limits of LLM Quantization via the Linearity Theorem

Comments from Hacker News https://ift.tt/LetV2g5 via