Change the repository type filter
All
Repositories list
1.3k repositories
- Advanced quantization toolkit for LLMs and VLMs. Support for WOQ, MXFP4, NVFP4, GGUF, Adaptive Schemes and seamless integration with Transformers, vLLM, SGLang, and llm-compressor
- SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime
- A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
- A ROHD-based framework for connectivity and assembly of hardware designs.
- Collection of Intel device plugins for Kubernetes
pcm
Public