LLM compressor: compress models for efficient deployment

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM - vllm-project/llm-compressor

Read more here: External Link