ModelCloud.ai
Pinned Loading
Repositories
-           GPTQModel PublicLLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang. ModelCloud/GPTQModel’s past year of commit activity 
-           PyPcre PublicModelCloud/PyPcre’s past year of commit activity 
-           Device-SMI PublicSelf-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separate tools such as nvidia-smi or /proc/cpuinfo and parsing it yourself. ModelCloud/Device-SMI’s past year of commit activity 
-           Tokenicer PublicA (nicer) tokenizer you want to use for model inference and training: with all known peventable gotchas normalized or auto-fixed. ModelCloud/Tokenicer’s past year of commit activity 
-           lm-evaluation-harness Public Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models. ModelCloud/lm-evaluation-harness’s past year of commit activity 
-           vllm Public Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs ModelCloud/vllm’s past year of commit activity 
-           rockthem PublicModelCloud/rockthem’s past year of commit activity 
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Most used topics
Loading…