Digital Inspector

An intelligent document analysis tool that automatically detects and annotates stamps, signatures, and QR codes in scanned PDF documents. Perfect for processing building/construction documentation and other structured documents.

Features

Multi-Object Detection: Detects stamps, signatures, and QR codes in PDF documents
PDF Processing: Converts PDF pages to images and processes them automatically
YOLO-Based Detection: Uses trained YOLO models for accurate stamp and signature detection
Robust QR Detection: Multiple QR detection methods (OpenCV, pyzbar, qreader, qrdet) for maximum accuracy
JSON Output: Generates structured JSON annotations matching hackathon requirements
Visual Annotations: Automatically draws bounding boxes with color-coded labels on images
Parallel Processing: Optimized for speed with configurable worker threads
Flexible Input: Supports both single PDF files and directories of PDFs

Quick Start

Installation

Clone the repository

git clone <repository-url>
cd InnovateX_Hack

Create virtual environment (recommended)

python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies
```
pip install -r requirements.txt
```
Install system dependencies

macOS:
```
brew install poppler zbar
```
Ubuntu/Debian:
```
sudo apt-get install poppler-utils libzbar0
```
Windows:
- Download Poppler from here
- Extract to a folder (e.g., C:\poppler)
- Hardcode the path in pdf_to_image_converter.py: Set POPPLER_PATH variable to C:\poppler\Library\bin (or your installation path)
- The code uses this hardcoded path for Windows compatibility
- ZBar is usually included in pyzbar wheel

Basic Usage

Process a directory of PDFs:

python digital_inspector.py test_pdfs/

Process a single PDF file:

python digital_inspector.py test_pdfs/document.pdf

With custom options:

python digital_inspector.py test_pdfs/ \
  --workers 8 \
  --conf-threshold 0.1 \
  --json-output results.json \
  --no-fast

Usage Examples

Process PDFs with Default Settings

python digital_inspector.py test_pdfs/

Converts PDFs to images (200 DPI)
Detects all objects using default YOLO model
Generates annotations.json in the output directory
Draws bounding boxes on converted images

Optimize for Speed

python digital_inspector.py test_pdfs/ --workers 8 --fast

Uses 8 parallel workers (adjust based on your CPU)
Fast QR detection mode (recommended for most cases)

Optimize for Accuracy

python digital_inspector.py test_pdfs/ --no-fast --conf-threshold 0.1

Exhaustive QR code search
Lower confidence threshold for more detections

Custom Output Location

python digital_inspector.py test_pdfs/ \
  --output my_results \
  --json-output custom_annotations.json

Command-Line Options

Option	Description	Default
`pdf_directory`	Path to PDF directory or single PDF file	Required
`--output`	Output directory name	`<input>_converted`
`--dpi`	Resolution for PDF conversion	200
`--workers`	Number of parallel workers	4
`--yolo-model`	Path to YOLO model file	`runs/detect/train8/weights/best.pt`
`--conf-threshold`	YOLO confidence threshold	0.25
`--json-output`	Path to save JSON file	`annotations.json`
`--fast`	Fast QR detection mode	Enabled
`--no-fast`	Exhaustive QR detection	Disabled
`--json`	Output results as JSON	False

Project Structure

InnovateX_Hack/
├── digital_inspector.py      # Main entry point
├── pdf_to_image_converter.py # PDF to image conversion
├── qr_detector.py            # QR code detection module
├── json_generator.py         # JSON output generation
├── draw_annotations.py       # Visual annotation drawing
├── requirements.txt         # Python dependencies
└── README.md               # This file

Output Format

The tool generates JSON annotations in the following format:

{
  "document.pdf": {
    "page_1": {
      "annotations": [
        {
          "annotation_1": {
            "category": "signature",
            "bbox": {
              "x": 510,
              "y": 146,
              "width": 250,
              "height": 98.89
            },
            "area": 24722.5
          }
        }
      ],
      "page_size": {
        "width": 1684,
        "height": 1190
      }
    }
  }
}

Visual Annotations

The tool also draws bounding boxes on converted images:

Blue boxes for signatures
Pink boxes for stamps
Green boxes for QR codes

Each box includes a label at the top edge.

System Requirements

Hardware

CPU: Multi-core recommended (4+ cores for best performance)
RAM: 8GB+ recommended
GPU: Optional (speeds up YOLO detection, but not required)

Software

Python: 3.8 or higher
Operating System: macOS, Linux, or Windows

Optimal Worker Count

Apple M1: 8-10 workers
AMD Ryzen 5 7535HS: 12-16 workers
General: Match your CPU thread count, add 20-30% for I/O overlap

Advanced Usage

Training Your Own YOLO Model

Prepare dataset:

python prepare_dataset.py create
python prepare_dataset.py split --source test_images

Annotate images using LabelImg (install: pip install labelImg)

Train model:

python train_yolo.py --epochs 50 --model yolov8s.pt

Use custom model:

python digital_inspector.py test_pdfs/ \
  --yolo-model runs/detect/train/weights/best.pt

Performance

Processing Speed: ~3-8 seconds per page (CPU), ~1-3 seconds (GPU)
Parallelization: Processes multiple pages simultaneously
Optimization: Cached YOLO model loading, optimized image I/O

Troubleshooting

YOLO Model Not Found

# Use default model or specify custom path
python digital_inspector.py test_pdfs/ \
  --yolo-model yolov8s.pt

No Detections Found

Lower confidence threshold: --conf-threshold 0.1
Try exhaustive search: --no-fast
Check that images were converted successfully

Poppler Not Found (PDF conversion fails)

macOS: brew install poppler
Linux: sudo apt-get install poppler-utils
Windows:
- Download from poppler-windows
- Extract and hardcode the path (C:\poppler\Library\bin) in pdf_to_image_converter.py

Import Errors

# Reinstall dependencies
pip install -r requirements.txt --upgrade

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Acknowledgments

YOLO: Ultralytics YOLO for object detection
OpenCV: Computer vision operations
Poppler: PDF rendering library

Made for InnovateX Hackathon

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Digital Inspector

Features

Quick Start

Installation

Basic Usage

Usage Examples

Process PDFs with Default Settings

Optimize for Speed

Optimize for Accuracy

Custom Output Location

Command-Line Options

Project Structure

Output Format

Visual Annotations

System Requirements

Hardware

Software

Optimal Worker Count

Advanced Usage

Training Your Own YOLO Model

Performance

Troubleshooting

YOLO Model Not Found

No Detections Found

Poppler Not Found (PDF conversion fails)

Import Errors

Contributing

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
datasets		datasets
models		models
runs/detect/train8		runs/detect/train8
source		source
templates		templates
test_pdfs		test_pdfs
.gitignore		.gitignore
README.md		README.md
annotations.json		annotations.json
app.py		app.py
detect_signatures_yolos.py		detect_signatures_yolos.py
digital_inspector.py		digital_inspector.py
draw_annotations.py		draw_annotations.py
install_with_cuda.bat		install_with_cuda.bat
install_with_cuda.sh		install_with_cuda.sh
json_generator.py		json_generator.py
pdf_to_image_converter.py		pdf_to_image_converter.py
qr_detector.py		qr_detector.py
requirements.txt		requirements.txt
start_web_app.bat		start_web_app.bat
start_web_app.sh		start_web_app.sh

Folders and files

Latest commit

History

Repository files navigation

Digital Inspector

Features

Quick Start

Installation

Basic Usage

Usage Examples

Process PDFs with Default Settings

Optimize for Speed

Optimize for Accuracy

Custom Output Location

Command-Line Options

Project Structure

Output Format

Visual Annotations

System Requirements

Hardware

Software

Optimal Worker Count

Advanced Usage

Training Your Own YOLO Model

Performance

Troubleshooting

YOLO Model Not Found

No Detections Found

Poppler Not Found (PDF conversion fails)

Import Errors

Contributing

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages