Mistral OCR Document Processor

A command-line tool to process documents using the Mistral AI OCR API.

Installation

Option 1: Install from Source

git clone https://github.com/yourusername/mistral-ocr.git
cd mistral-ocr
pip install .

Option 2: Install in Development Mode

git clone https://github.com/yourusername/mistral-ocr.git
cd mistral-ocr
pip install -e .

Configuration

Create a .env file in your working directory with your Mistral API key:

MISTRAL_API_KEY="YOUR_MISTRAL_API_KEY_HERE"

Usage

After installation, you can run the tool from anywhere:

mistral-ocr <document_source> [-v/--verbose]

Replace <document_source> with either:

A URL to the document (e.g., https://example.com/document.pdf)
A local file path (e.g., ./my_document.png)

Examples

Process a document from a URL:

mistral-ocr https://www.w3.org/WAI/ER/tests/xhtml/testfiles/resources/pdf/dummy.pdf

Process a local file with verbose logging:

mistral-ocr ./invoices/invoice_march.pdf --verbose

Output

The tool prints the extracted text from the document in Markdown format.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
mistral_ocr.py		mistral_ocr.py
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mistral OCR Document Processor

Installation

Option 1: Install from Source

Option 2: Install in Development Mode

Configuration

Usage

Examples

Output

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

aburkard/mistral-ocr

Folders and files

Latest commit

History

Repository files navigation

Mistral OCR Document Processor

Installation

Option 1: Install from Source

Option 2: Install in Development Mode

Configuration

Usage

Examples

Output

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages