chatml
Here are 16 public repositories matching this topic...
Chat data cleaning, filtering and deduplication pipeline.
-
Updated
Jul 25, 2023 - Python
Dolphin 3.0 🐬: Versatile AI for coding, math, and more
-
Updated
Mar 12, 2025 - Python
About working Propmting in OpenAI models, it is also used with deffrent pettren Alpaca prompt, INST prompt
-
Updated
May 31, 2025 - Python
Standardized spec and vendor-specific transforms for ChatML
-
Updated
May 27, 2024 - Python
A Python-based interactive CLI interface for chatting with Hugging Face language models, optimized for casual, Discord-style conversation using ChatML. Supports both quantized and full-precision models, live token streaming with color formatting, and dynamic generation parameter adjustment.
-
Updated
Oct 1, 2025 - Python
LLM Scribe is a toolkit for creating handwritten datasets quickly and easily for LLM fine-tuning. Automatically outputs into multiple common finetuning formats such as chatml, alpaca, and more.
-
Updated
Oct 5, 2025
Upload data to PostHog-LLM
-
Updated
May 17, 2024
A dataset toolbox for preparing and analyzing conversational datasets, including CSV splitting, CSV → Parquet conversion, dataset statistics, Parquet cleaning and sorting, HuggingFace–style metadata generation, and batched chain insertion into PostgreSQL — with Rich progress, multiprocessing, and 32 GB-RAM-friendly batching.
-
Updated
Oct 2, 2025 - Python
Upload data to PostHog-LLM
-
Updated
May 22, 2024
Paste your function, hit convert, and get a clean summary ready for use in LLM-based systems.
-
Updated
Apr 14, 2025 - HTML
Deepseek-Dataset-Generator creates conversational datasets for LLM fine-tuning via DeepSeek API. Supports various formats (ChatML, ShareGPT, Alpaca, JSON, CSV), easy configuration via YAML and detailed logs. Ideal for generating realistic and customized data quickly.
-
Updated
Jun 2, 2025 - Python
SmolLM2 🤗: Family of lightweight language models, performs diverse tasks on-device
-
Updated
Feb 11, 2025 - Python
Qwen2.5-Coder: Family of LLMs excels in code, debugging, etc
-
Updated
Feb 6, 2025 - Python
Week 5 project: build a hybrid retriever that fuses FAISS dense vectors with SQLite FTS5/BM25 keyword search (RRF/weighted fusion), plus a Supervised Fine-Tuning (SFT) pipeline (Full FT vs LoRA/QLoRA) using TRL/PEFT/DeepSpeed.
-
Updated
Oct 8, 2025 - Python
A flexible TypeScript framework for ingesting, processing, and exporting chat/conversation data for LLM training and analysis.
-
Updated
Apr 20, 2025 - TypeScript
Improve this page
Add a description, image, and links to the chatml topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the chatml topic, visit your repo's landing page and select "manage topics."