"topic:document-analysis" — Search

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

C++1.8k199Updated 2 days ago

artificial-intelligencecomputer-visiondocumentdocument-analysisdocument-intelligencedocument-recognitiondocument-understandingdocumentaiend-to-end-ocrmultimodalmultimodal-deep-learningocrscene-text-detectionscene-text-detection-recognitionscene-text-recognitiontext-detectiontext-recognitionvision-languagevision-language-modelvision-language-transformer

tstanislawek/awesome-document-understanding

A curated list of resources for Document Understanding (DU) topic

1.5k166Updated 20 hours ago

awesomeawesome-listdeep-learningdocument-aidocument-analysisdocument-intelligencedocument-layout-analysisdocument-understandinginformation-extractionintelligent-processingkey-information-extractionmachine-learningnatural-language-processingnlpocrpdfpdf-documentsrobotic-process-automationrpaunstructured-data

DocumindHQ/documind

Open-source platform for extracting structured data from documents using AI.

JavaScript1.5k59Updated 12 hours ago

aideveloper-toolsdocument-analysisdocument-extractionextract-datallmsocropen-sourceparserpdfpdf-converterpdf-extractorpdf-extractor-llm

Topdu/OpenOCR

OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful reproductions of the core implementations from a wide range of academic papers.

Python1.3k111Updated 1 day ago

chineseocrdocument-analysisdocument-parsingdocument-processingocrocr-pytorchscene-text-detectionscene-text-recognition

Deodat-Lawson/PDR_AI_v2

AI-powered StartUp Accelerator Engine built with Next.js, LangChain, PostgreSQL + pgvector. Upload, organize, and chat with documents. Includes predictive missing-document detection, role-based workflows, and page-level insight extraction.

JavaScript788111Updated 3 days ago

ai-chatbotdocument-aidocument-analysisdrizzle-ormfull-stacklangchainllm-appnextjsocropenaipgvectorpostgresqlragrag-chatbottypescriptvector-search

Yuliang-Liu/Curve-Text-Detector

This repository provides train＆test code, dataset, det.&rec. annotation, evaluation script, annotation tool, and ranking.

Jupyter Notebook652157Updated 1 month ago

deep-learningdocument-analysisobject-detectionscene-text

ispras/dedoc

Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser

Python64951Updated 1 day ago

docdocument-analysisdocument-content-extractiondocumentsdocxdocx-parserexcelhtmlhtml-parserlogical-structure-extractionocrodtpdfpdf-parserscanned-documentstable-of-contentstable-recognitiontxt

wenwenyu/PICK-pytorch

Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)

Python570191Updated 2 weeks ago

document-analysisdocument-understandinggraph-convolutional-networkgraph-learninggraph-neural-networkskey-information-extraction

CybercentreCanada/assemblyline

AssemblyLine 4: File triage and malware analysis

Python45233Updated 2 hours ago

assemblylineautomation-frameworkcertcyber-securitycybersecuritydocument-analysisfile-analysisframeworkincident-responseinfosecmalwaremalware-analysismalware-analyzermalware-detectionmalware-researchpython3security-automationsecurity-automation-frameworksecurity-tools

jpWang/LiLT

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

Python36241Updated 1 week ago

document-aidocument-analysisdocument-understandinginformation-extractionmultilingual-modelsmultimodal-pre-trained-modelnlp

pandora-analysis/pandora

Pandora is an analysis framework to discover if a file is suspicious and conveniently show the results

Python27642Updated 16 hours ago

document-analysisdocument-analyzinginfosecmalware-detection

lazyFrogLOL/llmdocparser

A package for parsing PDFs and analyzing their content using LLMs.

Python2698Updated 2 weeks ago

chunkingdocument-analysisllmnlpocrpdf-parserpdfparserragtext-chunking

HackingLZ/IndicatorOfCanary

Canary Detection

Python19215Updated 2 days ago

canary-detectiondocument-analysishoneytokensopsecpythonredteamsecurity-research

masyagin1998/robin

RObust document image BINarization

Python18440Updated 5 months ago

computer-visiondeep-learningdocument-analysisdocument-binarizationkerasneural-networksocropencvpythonu-net

FreeOCR-AI/yolo-doclaynet

YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis

Python15220Updated 3 days ago

doclaynetdocument-analysislayout-analysisultralyticsyoloyolov8

AdemBoukhris457/Doctra

📄🔍 Parse, extract, and analyze documents with ease 📄🔍

Jupyter Notebook14721Updated 1 day ago

aidocument-analysisdocumentparsingextract-datageminiimage-restorationocropenaipdf-parserpdf2markdownpythonvlm

anisha2102/docvqa

Document Visual Question Answering

Python13025Updated 3 days ago

computer-visiondeep-learningdocument-analysisvisual-question-answering

mirabdullahyaser/Retrieval-Augmented-Generation-Engine-with-LangChain-and-Streamlit

Powerful web application that combines Streamlit, LangChain, and Pinecone to simplify document analysis. Powered by OpenAI's GPT-3, RAG enables dynamic, interactive document conversations, making it ideal for efficient document retrieval and summarization.

Python13066Updated 2 weeks ago

artificial-intelligencechat-applicationdocument-analysisgenerative-aigpt-3langchainlarge-language-modelsnatural-language-processingopenai-chatgptquestion-answeringretrieval-augmented-generationstreamlit

chriswolfvision/local_adaptive_binarization

Local adaptive image binarization

C++12625Updated 1 year ago

computer-visiondocument-analysisdocument-binarization

yogthos/Matryoshka

MCP server for token-efficient large document analysis via the use of REPL state

TypeScript11112Updated 11 hours ago

ai-assistantdocument-analysisllmllm-toolsmcpmcp-servermodel-context-protocol

aws-samples/amazon-textract-transformer-pipeline

Post-process Amazon Textract results with Hugging Face transformer models for document understanding

Python10223Updated 4 months ago

amazon-textractdocument-analysishuggingface-transformersocr

BjornMelin/docmind-ai-llm

DocMind AI is a powerful, open-source Streamlit application leveraging LlamaIndex, LangGraph, and local Large Language Models (LLMs) via Ollama, LMStudio, llama.cpp, or vLLM for advanced document analysis. Analyze, summarize, and extract insights from a wide array of file formats, securely and privately, all offline.

Python10014Updated 23 hours ago

ai-agentsdocument-analysishybrid-searchlangchainlanggraph-supervisor-pyllama-cppllamacpplmstudiolocal-llmmultimodal-embeddingsollamaprivate-ai-agentspythonqdrantsentence-transformersstreamlittorchtransformersvllm

monniert/docExtractor

(ICFHR 2020 oral) Code for "docExtractor: An off-the-shelf historical document element extraction" paper

Python8811Updated 1 year ago

document-analysishistorical-datapytorchsegmentation

Xyntopia/pydoxtools

Effortlessly extract information from unstructured data with this library, utilizing advanced AI techniques. Compose AI in customizable pipelines and diverse sources for your projects.

Python8714Updated 2 months ago

chatgptdocument-analysisdocument-extractionextractioninformation-retrievalllmnlppdfpython

Cross2pro/DeepSeek-OCR-Dashboard

An out-of-the-box local Web UI for DeepSeek-OCR. Built with FastAPI + Vue.js, it supports PDF/Image uploads, progress tracking, and result visualization with bounding boxes. Easily experience the power of a top-tier OCR model.

Python870Updated 15 hours ago

computer-visiondeepseekdeepseek-ocrdocument-analysisimage-to-textlarge-language-modellatex-ocrllmmath-ocrmultimodalocrocr-webserviceoptical-character-recognitionpdf-ocrresearch-tooltext-recognition

Page 1 of 12