🧠 OCR Keyword Detection API

This project is a FastAPI-based OCR (Optical Character Recognition) service that detects whether a specific keyword appears in an uploaded image.
It uses Tesseract OCR, OpenCV, and RapidFuzz to read and analyze text content from images with automatic rotation and preprocessing for better accuracy.

🚀 Features

🖼️ Image upload API with keyword checking
🔄 Automatic image rotation (fixes portrait/landscape orientation)
🎨 Multiple OCR preprocessing strategies (contrast enhancement, grayscale, etc.)
📊 String similarity matching using RapidFuzz
🧾 Outputs OCR results and confidence scores
⚡ Built with FastAPI + Uvicorn for high performance

📁 Project Structure

├── main.py               # Main FastAPI application
├── gambar/               # Folder where uploaded images are saved
├── hasil/                # Folder for OCR output and processed images
└── requirements.txt      # Python dependencies

🧩 Dependencies

This project uses the following main libraries:

fastapi
uvicorn
opencv-python
pytesseract
pillow
numpy
rapidfuzz

⚙️ Installation

Clone this repository

git clone https://github.com/yourusername/ocr-api.git
cd ocr-api

Create a virtual environment

python -m venv venv
source venv/bin/activate  # On Linux/Mac
venv\Scripts\activate     # On Windows

Install dependencies
```
pip install -r requirements.txt
```
Install Tesseract OCR
- Ubuntu/Debian
```
sudo apt update
sudo apt install tesseract-ocr
```
- Windows
  - Download and install from: https://github.com/UB-Mannheim/tesseract/wiki

▶️ Running the Server

Start the FastAPI app using Uvicorn:

uvicorn main:app --host 0.0.0.0 --port 8124

or simply run:

python main.py

The server will start on:
👉 http://localhost:8124

🧠 API Usage

Endpoint: `/ocr`

Method: POST
Content-Type: multipart/form-data

Form Data:

Field	Type	Description
file	File	The image file to scan
keyword	String	The text keyword to search for

Example using `curl`

curl -X POST "http://localhost:8124/ocr" \
  -F "file=@sample.jpg" \
  -F "keyword=DIGITAL"

Example Response

{
  "found": true,
  "match": "DIGITAL",
  "score": 91
}

🧮 Processing Steps

Fix image orientation using EXIF metadata
Run raw OCR directly
Apply preprocessing techniques (grayscale, contrast adjustment)
If no match, rotate the image 180° and retry
Compare OCR results against the target keyword using fuzzy matching

📂 Output Files

All results are saved in the hasil/ directory:

step0_raw.jpg → The processed image
hasil_ocr.txt → The OCR extracted text

🧑‍💻 Example Workflow

Upload an image via /ocr endpoint
The server processes it, runs OCR, and checks similarity
Returns whether the keyword was found and the similarity score

🛠️ Customization

You can adjust:

OCR configuration in ocr_raw() and ocr_preprocessed()
Similarity threshold in check_similarity() (default: 85%)
Output directory names for results and uploads

📜 License

This project is open-source under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
gambar		gambar
README.md		README.md
client.py		client.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 OCR Keyword Detection API

🚀 Features

📁 Project Structure

🧩 Dependencies

⚙️ Installation

▶️ Running the Server

🧠 API Usage

Endpoint: `/ocr`

Form Data:

Example using `curl`

Example Response

🧮 Processing Steps

📂 Output Files

🧑‍💻 Example Workflow

🛠️ Customization

📜 License

About

Uh oh!

Releases

Packages

Languages

fckveza/ocr-api

Folders and files

Latest commit

History

Repository files navigation

🧠 OCR Keyword Detection API

🚀 Features

📁 Project Structure

🧩 Dependencies

⚙️ Installation

▶️ Running the Server

🧠 API Usage

Endpoint: /ocr

Form Data:

Example using curl

Example Response

🧮 Processing Steps

📂 Output Files

🧑‍💻 Example Workflow

🛠️ Customization

📜 License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Endpoint: `/ocr`

Example using `curl`

Packages