Image Puzzle Solver

A tool for detecting and labeling objects in images that uses YOLO for object detection.

A work-in-progress to solve image-based puzzles in which the user is given the name of a thing and then must select images containing that thing. Images are usually arranged in a grid of 9.

Project Structure

image_puzzle_solver/
├── api/                        # FastAPI backend API
│   ├── core/
│   │   └── services/           # Service logic (detection, image handling)
│   ├── endpoints/              # API endpoint definitions
│   ├── static/                 # Static files (if any)
│   ├── main.py                 # FastAPI app entrypoint
│   └── run.py                  # API server runner
├── backend/                    # Core detection and processing logic
│   ├── core/
│   │   └── detector.py         # Object detection implementation (YOLO, etc.)
│   └── utils/                  # Utility functions
├── data/                       # Data storage
│   └── images/
│       ├── annotated/          # Annotated images (output)
│       ├── unprocessed/        # Raw/unprocessed images (input)
│       ├── train/              # Training images and labels
│       ├── test/               # Test images
│       └── val/                # Validation images
├── frontend/                   # React frontend
│   ├── public/                 # Static frontend files
│   └── src/
│       ├── components/         # React components
│       ├── App.js, index.js    # Main frontend logic
│       └── styles.css          # Frontend styles
├── tests/                      # Test suite
│   └── api/
│       ├── endpoints/          # Endpoint tests
│       └── core/               # Core service tests
├── training/                   # Model training scripts and utilities
│   ├── image_annotation.py     # Annotation and processing for training
│   ├── train_puzzle_model.py   # Model training script
│   ├── split_dataset.py        # Dataset splitting utility
│   └── download_training_images.py # Download images for training
├── requirements.txt            # Python dependencies
├── README.md                   # Project documentation
├── .gitignore
├── pytest.ini
├── yolov8n.pt                  # YOLOv8 model weights
├── test_api.py, test_endpoint.py, test_solver.py, test_rotated_bbox.py, test_rotated_detection.py
└── venv/                       # Python virtual environment (should be in .gitignore)

Features

Object detection using YOLOv8
RESTful API for image processing
Interactive web interface for image annotation
YOLO format label output

Requirements

Backend

Python 3.x
OpenCV
Ultralytics (YOLOv8)
FastAPI
NumPy

Frontend

Node.js
React

Installation

Clone the repository:

git clone <repository-url>
cd image_puzzle_solver

Set up the backend:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install -r requirements.txt

Set up the frontend:

cd frontend
npm install

Usage

Start the API server:

cd api
python run.py

2.Start the frontend development server:

cd frontend
npm start

The web interface will be available at http://localhost:3000, and the API at http://localhost:8000.

API Endpoints

GET /api/v1/images - List all available images
GET /api/v1/images/{image_name} - Get detections for a specific image
PUT /api/v1/images/{image_name}/annotations - Update (replace) annotations for a specific image
POST /api/v1/detection/upload - Upload an image for processing
POST /api/v1/detection/process/{image_name} - Process an image to get the annotated image and detection data for the image
GET /api/v1/detection/classes - Get available object classes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Image Puzzle Solver

Project Structure

Features

Requirements

Backend

Frontend

Installation

Usage

API Endpoints

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
api		api
backend		backend
frontend		frontend
scripts		scripts
tests/api		tests/api
training		training
.gitignore		.gitignore
README.md		README.md
image.png		image.png
pytest.ini		pytest.ini
requirements.txt		requirements.txt

JThomasDevs/image_puzzle_solver

Folders and files

Latest commit

History

Repository files navigation

Image Puzzle Solver

Project Structure

Features

Requirements

Backend

Frontend

Installation

Usage

API Endpoints

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages