You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Demonstration for the zai-org/GLM-OCR multimodal OCR model. Supports text, formula, and table recognition from uploaded images, with outputs in plain text and markdown formats.
Upload PDFs, extract text via OCR/PyMuPDF RAG Pipeline, and chat with your documents using a local LLM — fully offline via Ollama, with a Streamlit dashboard and MCP server for AI assistant integration.
AI-powered video hardcoded (burned-in) subtitle extraction using GLM-OCR. Features a Vue 3 interface, Electron desktop app, and Python CLI. Built for speed and accuracy.