"topic:ocr" — Search

7,407 results for “topic:ocr”

Tesseract Open Source OCR Engine (main repository)

hacktoberfestlstmmachine-learningocrocr-enginetesseracttesseract-ocr

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python71.9k9.9kUpdated just now

ai4sciencechineseocrdocument-parsingdocument-translationkieocrpaddleocr-vlpdf-extractor-ragpdf-parserpdf2markdownpp-ocrpp-structurerag

opendatalab/MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python55.8k4.6kUpdated just now

ai4sciencedocument-analysisextract-datalayout-analysisocrparserpdfpdf-converterpdf-extractor-llmpdf-extractor-pretrainpdf-extractor-ragpdf-parserpython

hiroi-sora/Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。

Python42.5k4.2kUpdated just now

ocrocr-pythonpaddleocrqmlqtscreenshotumi-ocr

siyuan-note/siyuan

A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.

TypeScript41.8k2.6kUpdated just now

ankichatgptdeepseekelectronevernoteknowledge-baselocal-firstmarkdownnote-takingnotes-appnotionobsidianocrollamaopenaipdfs3self-hostedwebdav

naptha/tesseract.js

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

JavaScript37.9k2.4kUpdated 5 hours ago

deep-learningjavascriptocrtesseractwebassembly

paperless-ngx/paperless-ngx

A community-supported supercharged document management system: scan, index and archive all your documents

Python37.2k2.4kUpdated just now

angulararchivingdjangodmsdocument-managementdocument-management-systemhacktoberfestmachine-learningocroptical-character-recognitionpdf

ShareX/ShareX

ShareX is a free and open-source application that enables users to capture or record any area of their screen with a single keystroke. It also supports uploading images, text, and various file types to a wide range of destinations.

C#35.8k3.6kUpdated just now

capturecolor-pickercsharpdropboxfile-sharingfile-uploadftpgifgif-recorderimage-annotationimgurocrproductivityregion-capturescreen-capturescreen-recorderscreenshotsharesharexurl-shortener

ocrmypdf/OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python32.9k2.3kUpdated just now

image-processingocrpdfpythontesseract

JaidedAI/EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python29.1k3.5kUpdated 1 hour ago

cnncrnndata-miningdeep-learningeasyocrimage-processinginformation-retrievallstmmachine-learningocroptical-character-recognitionpythonpytorchscene-textscene-text-recognition

pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

JavaScript17.3k827Updated just now

linuxmacosocrpotpot-apprecognizetauritranslatetranslationttswindows

lukas-blecher/LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python16.2k1.3kUpdated 5 hours ago

datasetdeep-learningim2latexim2markupim2textimage-processingimage2textlatexlatex-ocrmachine-learningmath-ocrocrpythonpytorchtransformervision-transformervit

Unstructured-IO/unstructured

Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

HTML14.2k1.2kUpdated just now

data-pipelinesdeep-learningdocument-image-analysisdocument-image-processingdocument-parserdocument-parsingdocxdonutinformation-retrievallangchainllmmachine-learningmlnatural-language-processingnlpocrpdfpdf-to-jsonpdf-to-textpreprocessing

sml2h3/ddddocr

带带弟弟通用验证码识别OCR pypi版

Python13.7k2.2kUpdated just now

captchaddddocrocr

tisfeng/Easydict

一个简洁优雅的词典翻译 macOS App。开箱即用，支持离线 OCR 识别，支持有道词典，🍎 苹果系统词典，🍎 苹果系统翻译，OpenAI，Gemini，DeepL，Google，Bing，腾讯，百度，阿里，小牛，彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words and translating text.

Swift12.4k606Updated 5 hours ago

appbaidubingdeepldictionarygeminigooglemacosocropenaishortcutstencenttranslatetranslatoryoudao

DayBreak-u/chineseocr_lite

超轻量级中文ocr，支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M

C++12.3k2.3kUpdated 9 hours ago

ncnnocrpytorch

getomni-ai/zerox

OCR & Document Extraction using vision models

TypeScript12.2k833Updated 3 hours ago

ocrpdf

dataelement/bisheng

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.

TypeScript11.2k1.8kUpdated 9 hours ago

agentaichatbotenterprisefinetunegenaigptlangchianllamallmllmdevopsllmopsocropenaiorchestrationpythonragreactsftworkflow

HIllya51/LunaTranslator

视觉小说翻译器 / Visual Novel Translator

C++10.8k1.0kUpdated 2 hours ago

galgameocrreverse-engineeringtranslatorvisual-novelwin32

yusufkaraaslan/Skill_Seekers

Convert documentation websites, GitHub repositories, and PDFs into Claude AI skills with automatic conflict detection

Python10.5k1.0kUpdated just now

ai-toolsast-parserautomationclaude-aiclaude-skillscode-analysisconflict-detectiondocumentationdocumentation-generatorgithubgithub-scrapermcpmcp-servermulti-sourceocrpdfpythonweb-scraping

ripperhe/Bob

Bob 是一款 macOS 平台的翻译和 OCR 软件。

9.6k523Updated 6 hours ago

bobappchatgptdeepseekdoubaoerniegeminigroqhunyuankimimacosocropenaiqwentranslatetranslationtranslatorzhipuai

zyddnys/manga-image-translator

Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/ (no longer working)

Python9.5k930Updated 3 hours ago

animeauto-translationchinese-translationdeep-learningimage-processinginpaintingjapanese-translationsmachine-translationmanganeural-networkocrpytorch-implementationtext-detectiontext-detection-recognitiontransformer

pymupdf/PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Python9.2k696Updated 7 hours ago

data-scienceepubextract-datafontmupdfocrpdfpdf-documentspymupdfpythontable-extractiontesseracttext-processingtext-shapingxps

bytedance/Dolphin

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python8.9k747Updated 7 hours ago

document-analysislayout-analysisocrparserpdfpdf-converterpdf-parserpythonvlm-ocr

YaoFANGUK/video-subtitle-extractor

视频硬字幕提取，生成srt文件。无需申请第三方API，本地实现文本识别。基于深度学习的视频字幕提取框架，包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Python8.5k872Updated 2 hours ago

deep-learningextracthardsubocrrippersrtsubripsubtitles

CVHub520/X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python8.3k906Updated just now