516 results for “topic:ocr-python”
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
结束和新的开始
Lightweight & fast OCR models for license plate text recognition.
Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition
OCR, Archive, Index and Search: Implementation agnostic OCR framework.
Perform text detection in a variety of languages with your computer webcam using Google Tesseract OCR and OpenCV. This script achieves a real-time OCR effect via multi-threading.
A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.
Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.
Manga OCR snipping application for desktop
Anansi is a computer vision (cv2 and FFmpeg) + OCR (EasyOCR and tesseract) python-based crawler for finding and extracting questions and correct answers from video files of popular TV game shows in the Balkan region.
Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源SOTA,推理速度超快。
A powerful LaTeX formula recognition tool powered by pix2tex and pix2text. Features real-time MathJax preview, multi-format export (LaTeX, Markdown, MathML, HTML, OMML, SVG), and one-click copy to Word/Office. Offline-first, privacy-focused portable executable.
PDF text data extraction web app with OCR for scanned documents
Multimodal document parser for high quality data understanding and extraction
A FLOSS software for Persian Optical Character Recognition
OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes
Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION
A program for extracting hard coded (burned in) subtitle from a video and generating an external subtitle.
Custom C++ implementation of deep learning based OCR
OCR Script CLI Tool for Extracting Text from Screenshots (images) using bash, and python scripts only
Turn any OCR models into online inference API endpoint 🚀 🌖
Official Python client library for Nutrient Document Web Services API - PDF processing, OCR, watermarking, and document manipulation with automatic Office format conversion
MyLittleOCR 是一个统一的 OCR 库包装器,提供一致的 API,便于集成和切换多个 OCR 引擎。 MyLittleOCR is a unified OCR wrapper providing a consistent API for seamless integration and switching between multiple OCR engines.
A project to bring high accuracy OCR to Persian language.
Optical Character Recognition in Python.
PDF Table Extractor is an innovative Python project designed to tackle the challenge of extracting tables from scanned PDF documents. Leveraging advanced optical character recognition (OCR) and image processing techniques.
Zefoy OCR captcha solver | 99% accurate
Deep Learning Individual Project - March 03, 2022.
dev repo for article