🚀 3.3B Mamba-MoE running on Mac Mini M4 16GB! Breakthrough deployment of Concept models on consumer Apple Silicon via NF4 quantization & QLoRA. Pushing limits of on-device AI.
-
Updated
Jan 27, 2026 - Swift
🚀 3.3B Mamba-MoE running on Mac Mini M4 16GB! Breakthrough deployment of Concept models on consumer Apple Silicon via NF4 quantization & QLoRA. Pushing limits of on-device AI.
Action-conditioned game world model: fine-tunes HunyuanVideo (8.3B DiT) with LoRA on NitroGen gameplay data to generate future video frames from gamepad action sequences. Single-GPU training via NF4 quantization + flow matching.
Fine tuned qwen llm using qloRa for commentary generation and generating audio using edge-tts.
Add a description, image, and links to the nf4-quantization topic page so that developers can more easily learn about it.
To associate your repository with the nf4-quantization topic, visit your repo's landing page and select "manage topics."