Experience the next evolution of dataset creation with Caption Creator v7.3. This fast, fully portable GUI tool is designed to generate exceptional image captions and tags with unparalleled ease. It's the ultimate assistant for creating high-quality datasets for AI models like Pony, SDXL, and Illustrious, perfect for both LoRA training and advanced image prompting.
The application runs entirely on your local machine, ensuring privacy and uncensored output.
Support the developer on Patreon: patreon.com/MM744
- Dual Generation Modes: Seamlessly switch between generating detailed Captions or concise, comma-separated Tags.
- Intelligent Tag Formatting: Automatically cleans AI output for tags into a perfect, single-line, comma-separated list.
- Powerful Batch Processing: Process entire folders of images with a clear, gallery-style progress view.
- Portable & Self-Contained: No installation needed. Runs from a single folder with its own embedded Python.
- Uncensored Local AI: Utilizes locally run models for full creative freedom without content filters.
- Complete UI Overhaul: A sleek, modern, and responsive dark-theme interface.
- LM Studio Integration: Connect directly to a running LM Studio instance to use any compatible model.
- Direct Image Pasting: Instantly process an image by simply pasting it from your clipboard (Ctrl+V).
- Interactive Model Management: Download, delete, and manage models directly from within the application.
- Prompt Enrichment: Add extra context or instructions to the AI on the fly.
- Built-in ZIP Archiving: Save your entire generation run into a single ZIP archive with one click.
- VRAM Optimization: Choose from models tailored for different GPU VRAM capacities (5GB, 8GB, 10GB, 20GB).
- Low-VRAM Mode: A dedicated checkbox to further reduce VRAM usage.
- Keep Model Loaded: An option to keep the AI model in VRAM, dramatically speeding up subsequent generations.
- Full Kohya_SS Export: Configure and export in a folder structure fully compatible for training.
- Flexible Formatting: Use Trigger Words, define a Max Word count, and format captions as a single paragraph.
- Convenient Actions: Instantly copy text to the clipboard or open the output folder from the UI.
-
Download and Unpack
Download the program and unpack the.ziparchive into a folder. -
Launch the Application
Double-clickCaption Creator.exeto launch the program. -
Manage Your Model
- Click the "Model / VRAM Configuration" button to open the model selection panel.
- To use a built-in model: If a model is not "Available," click the download icon (📥) next to it.
- To use LM Studio: Select the "Custom (LM Studio)" option and click "Connect".
- Select your desired model from the list to make it active.
-
Load Image(s)
- Single Mode: Drag & drop an image, click to browse, or paste an image from your clipboard.
- Batch Mode: Drag & drop multiple images or click to select a batch.
-
Configure and Generate
- Choose your generation type (Captions or Tags).
- Adjust settings like Max Words, Trigger Words, or enable options like Kohya_SS export.
- Click Generate.
-
Get Results
Processed images and their corresponding.txtfiles are saved in theoutputfolder, neatly organized by run.
- Automating image captioning and tagging.
- Extracting detailed prompts from existing images.
- Creating high-quality training datasets for AI models.
- Archiving and organizing image collections with descriptive metadata.
Captions (Format as Single Paragraph enabled):
The image is a digital illustration of a female character from the video game "Street Fighter II." She has blonde hair styled in two braids, each tied with red ribbons. Her skin tone is fair, and she has blue eyes that are focused intently forward. She wears a red beret hat with a white button on the front center, a green sleeveless tank top, and red fingerless gloves. Her right arm is extended forward, her fist clenched as if preparing for a punch or throwing a punch... The entire image conveys a sense of strength and readiness for combat.
Tags:
digital art, female character, muscular build, green tank top, red beret with white button, red fingerless gloves, blonde hair in braid, intense expression, right arm extended forward, clenched teeth, dark blue gradient background, vibrant colors, anime style, strong pose, upper body, dynamic lighting, high contrast, Illustrious quality, fighting game character, Camilla (Street Fighter), serious demeanor, confident stance, athletic physique, determined look...
#caption-creator #dataset #tagging #portable #uncensored #batch-processing #gui #ui-ux #kohya-ss #lm-studio #local-ai

