339 results for “topic:document-analysis”
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
A system for agentic LLM-powered data processing and ETL
Read and extract text and other content from PDFs in C# (port of PDFBox)
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
A curated list of resources for Document Understanding (DU) topic
Open-source platform for extracting structured data from documents using AI.
OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful reproductions of the core implementations from a wide range of academic papers.
AI-powered StartUp Accelerator Engine built with Next.js, LangChain, PostgreSQL + pgvector. Upload, organize, and chat with documents. Includes predictive missing-document detection, role-based workflows, and page-level insight extraction.
This repository provides train&test code, dataset, det.&rec. annotation, evaluation script, annotation tool, and ranking.
Dedoc is a library (service) for automate documents parsing and bringing to a uniform format. It automatically extracts content, logical structure, tables, and meta information from textual electronic documents. (Parse document; Document content extraction; Logical structure extraction; PDF parser; Scanned document parser; DOCX parser; HTML parser
Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)
AssemblyLine 4: File triage and malware analysis
Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)
Pandora is an analysis framework to discover if a file is suspicious and conveniently show the results
A package for parsing PDFs and analyzing their content using LLMs.
Canary Detection
RObust document image BINarization
YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis
📄🔍 Parse, extract, and analyze documents with ease 📄🔍
Document Visual Question Answering
Powerful web application that combines Streamlit, LangChain, and Pinecone to simplify document analysis. Powered by OpenAI's GPT-3, RAG enables dynamic, interactive document conversations, making it ideal for efficient document retrieval and summarization.
Local adaptive image binarization
MCP server for token-efficient large document analysis via the use of REPL state
Post-process Amazon Textract results with Hugging Face transformer models for document understanding
DocMind AI is a powerful, open-source Streamlit application leveraging LlamaIndex, LangGraph, and local Large Language Models (LLMs) via Ollama, LMStudio, llama.cpp, or vLLM for advanced document analysis. Analyze, summarize, and extract insights from a wide array of file formats, securely and privately, all offline.
(ICFHR 2020 oral) Code for "docExtractor: An off-the-shelf historical document element extraction" paper
Effortlessly extract information from unstructured data with this library, utilizing advanced AI techniques. Compose AI in customizable pipelines and diverse sources for your projects.
An out-of-the-box local Web UI for DeepSeek-OCR. Built with FastAPI + Vue.js, it supports PDF/Image uploads, progress tracking, and result visualization with bounding boxes. Easily experience the power of a top-tier OCR model.