EfficientQAT: LLM Quantization, gets a 2-bit llama2-70B outperform regular 13B

📅 July 18, 2024 ⏱️ 1 min read

"\n

Article URL: <a href="https://old.reddit.com/r/LocalLLaMA/comments/1e5x2k4/new_llms_quantization_algorithm_efficientqat/">https://old.reddit.com/r/LocalLLaMA/comments/1e5x2k4/new_llms_quantization_algorithm_efficientqat/

Comments URL: <a href="https://news.ycombinator.com/item?id=40991588">https://news.ycombinator.com/item?id=40991588

Points: 7

# Comments: 0

\n" # Description used for search engine.