This is a live OCR application that captures a region of your screen, recognizes words using OCR, and displays real-time word translations.
- Live screen capture from any monitor (ish)
- Real-time OCR using paddleocr
- Instant translation of recognized words (German → English)
- Responsive UI with word list prioritized by mouse position (X11 only)
- Details panel with detailed breakdown of word information
-
Clone the repository:
git clone <repository-url> cd screen-cap-translate
-
Install dependencies:
pip install -r requirements.txt
Run the application with:
python src/main.py- pipewire
- gstreamer
- gst-plugin-pipewire
Currently only supports Python 3.11
On gnome it should just work
On KDE, it requires a virtual output when choosing screenshare option