Skip to content
@NanoNets

Nanonets

Popular repositories Loading

  1. docext docext Public

    An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

    Python 1.8k 133

  2. docstrange docstrange Public

    Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.

    Python 980 90

  3. nanonets-ocr-sample-python nanonets-ocr-sample-python Public

    NanoNets OCR API Example for Python

    Python 204 53

  4. RaspberryPi-ObjectDetection-TensorFlow RaspberryPi-ObjectDetection-TensorFlow Public

    Object Detection using TensorFlow on a Raspberry Pi

    Python 171 38

  5. ocr-with-tesseract ocr-with-tesseract Public

    A comprehensive tutorial for OCR in python using Tesseract-OCR and OpenCV

    Jupyter Notebook 126 73

  6. ocr-python ocr-python Public

    OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.

    Jupyter Notebook 120 18

Repositories

Showing 10 of 57 repositories
  • docstrange Public

    Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.

    NanoNets/docstrange’s past year of commit activity
    Python 980 MIT 90 21 2 Updated Oct 31, 2025
  • Nanonets-OCR2 Public

    Evaluations for Nanonets-OCR-1.5

    NanoNets/Nanonets-OCR2’s past year of commit activity
    Jupyter Notebook 9 1 0 0 Updated Oct 16, 2025
  • docext Public

    An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

    NanoNets/docext’s past year of commit activity
    Python 1,793 Apache-2.0 133 18 (1 issue needs help) 3 Updated Aug 25, 2025
  • llm-data-converter Public

    Convert any document format into LLM-ready data format (markdown) with advanced intelligent document processing capabilities powered by pre-trained models.

    NanoNets/llm-data-converter’s past year of commit activity
    Python 5 MIT 1 0 0 Updated Aug 14, 2025
  • nanonets-go Public

    Code samples in golang for nanonets API

    NanoNets/nanonets-go’s past year of commit activity
    Go 1 MIT 1 0 0 Updated May 29, 2025
  • DocAIAgent Public

    This code is part of a workshop conducted on how to build your own Document AI Agent using Open Source LLMs

    NanoNets/DocAIAgent’s past year of commit activity
    Jupyter Notebook 15 7 0 0 Updated May 8, 2025
  • table-metrics Public

    A repo with all metrics related to table extraction accuracy computation

    NanoNets/table-metrics’s past year of commit activity
    0 MIT 0 0 0 Updated Apr 24, 2025
  • nn-auto-bench Public

    AutoBench: Benchmarking Automation for Intelligent Document Processing (IDP) with confidence

    NanoNets/nn-auto-bench’s past year of commit activity
    Python 10 4 0 0 Updated Mar 18, 2025
  • search-kb Public
    NanoNets/search-kb’s past year of commit activity
    Python 0 0 0 0 Updated Feb 2, 2025
  • NanoNets/hands-on-vision-language-models’s past year of commit activity
    Jupyter Notebook 8 2 0 0 Updated Nov 15, 2024