Implementation of cycle GAN-based model capable transforming a person’s clothing style based on user-specified attribute Paper: https://arxiv.org/pdf/1710.07346
WardrobeWizard is an AI-powered fashion manipulation system that uses advanced deep learning techniques to modify clothing styles in images based on text descriptions. The project implements a two-stage GAN (Generative Adversarial Network) architecture for precise clothing manipulation while preserving the person's identity.
- Text-Guided Image Manipulation: Transform clothing styles based on natural language descriptions
- Two-Stage Generation:
- Shape Generation: Creates segmentation masks for desired clothing
- Image Generation: Generates realistic clothing while preserving identity
- Identity Preservation: Maintains facial features and hair during clothing transformation
- Conditional Generation: Supports multiple clothing types and styles
- Reimplement G_shape GAN
- Create Data loader for training loop
- Implement GAN structure (generator / discriminator)
- Train
- Test
- Reimplement G_image
- Shape Generator: Creates semantic segmentation masks from text descriptions
- Image Generator: Transforms segmentation masks into realistic clothing
- Discriminators: Ensure realistic and accurate generation for both stages
- Conditional Inputs:
- Text embeddings for style description
- Segmentation masks for clothing regions
- Noise vectors for style variation
G_shape.py: Shape generation pipelineG_image.py: Image generation pipelineG_shape_model.py: Shape generator architectureG_image_model.py: Image generator architecturedataloader.py: Data preprocessing and loading utilitiessegmentation.py: Image segmentation utilities
plot_G_shape_results.py: Visualize shape generation resultsplot_G_image_results.py: Visualize final image generation resultsnet_graph_sr1.py: Network architecture visualization
- Custom loss functions for realistic generation
- Identity preservation metrics
- Style transfer accuracy evaluation
- Segmentation accuracy metrics
- PyTorch
- CUDA for GPU acceleration
- NumPy
- Matplotlib
- H5py for data management
- Enhanced text understanding capabilities
- More diverse clothing style options
- Improved resolution and detail generation
- Real-time processing capabilities
- Mobile deployment support
- Clone the repository
- Install dependencies:
pip install torch numpy h5py matplotlib- Prepare your data following the format in
dataloader.py - Run the main script:
python main.py- jak-weston
- Sean-Fuhrman
This project is open source and available under the MIT License.