173 results for “topic:html-to-markdown”
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
🛏 An HTML to Markdown converter written in JavaScript
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML
⚙️ Convert HTML to Markdown. Even works with entire websites and can be extended through rules.
AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/Baidu/etc. Native multi-threading for bulk processing.
CommonMark/Markdown Java parser with source level AST. CommonMark 0.28, emulation of: pegdown, kramdown, markdown.pl, MultiMarkdown. With HTML to MD, MD to PDF, MD to DOCX conversion modules.
helloworld 开发者社区开源的一个轻量级,强大的 html 一键转 md 工具,支持多平台文章一键转换,并保存下载到本地。
🔥 This repository contains complete application examples, including websites and other projects, developed using Firecrawl.
HTML to Markdown converter and crawler.
100% free and full open-source edge Firecrawl alternative with better links extraction for agents - that you can deploy to cloudflare or vercel by yourself.
It's time for your markup to get down! HTML to markdown converter. Breakdance is a highly pluggable, flexible and easy to use.
Open-source, production-grade web scraping engine built for LLMs. Scrape and crawl the entire web, clean markdown, ready for your agents.
A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG
reader is for your command line what the “readability” view is for modern browsers: A lightweight tool offering better readability of web pages (and EML files!) on the CLI. (https://codeberg.org/mrus/reader)
📋 Browser extension to copy text as Markdown (with GFM and MathML support)
Export Atlassian Confluence pages as markdown files.
Slurps webpages and saves them as clean, uncluttered Markdown. Think Pocket, but better.
Claude Chat Exporter is a JavaScript tool that allows you to export your conversations with Claude AI into a well-formatted Markdown file.
High-performance web crawler API optimized for LLMs. Turn any search or website into clean Markdown using remote browsers. Firecrawl alternative
Firefox add-on to copy selection as Markdown
Full-content web fetcher for AI agents — Chrome TLS fingerprinting, browser impersonation, and multi-strategy article extraction
A CLI tool that converts exported Medium posts (html) to Jekyll/Hugo compatible markdown with front matter.
Multimodal document parser for high quality data understanding and extraction
:smirk_cat: Dependency-free and lean DOM parser that outputs Markdown
Transform your HTML into clean, easy-to-read markdown with html2md.
The best HTML to Markdown library, A esm-native & Useful Utilities with simple, lightweight and epic quality.
HTML-to-Markdown converter that adaptively preserves HTML when needed (eg. when center-aligning, or resizing images)
Using LLMs and AI browser automation to robustly extract web data
:pencil: XK-Editor | 一个支持富文本和Markdown的编辑器
A simple Swift package that converts HTML into Markdown