43 results for “topic:ai-scraping”
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
Python scraper based on AI
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
AnyCrawl 🚀: A Node.js/TypeScript crawler that turns websites into LLM-ready data and extracts structured SERP results from Google/Bing/Baidu/etc. Native multi-threading for bulk processing.
Crawl a website starting from a URL, find relevant pages, and extract data – all guided by your natural language prompt.
Structured data gathering from any website using AI-powered scraper, crawler, and browser automation. Scraping and crawling with natural language prompts. Equip your LLM agents with fresh data. AI Studio python SDK for intelligent web data gathering.
Lightweight library for scraping web-sites with LLMs
🔥 This repository contains complete application examples, including websites and other projects, developed using Firecrawl.
AI Scraper is a powerful scraping tool and scrape agent built to automate data extraction with unmatched precision. Ideal for scalable AI scraping tasks across diverse web sources, this tool simplifies complex scraping operations into efficient, intelligent workflows.
High-performance web crawler API optimized for LLMs. Turn any search or website into clean Markdown using remote browsers. Firecrawl alternative
[Mirror] Self-hosted abuse detection and rule enforcement against low-effort mass AI scraping and bots.
⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git repos, and more...
AI web scraper built with Crawl4AI for extracting structured leads data from websites.
Structured data gathering from any website using AI-powered scraper, crawler, and browser automation. Scraping and crawling with natural language prompts. Equip your LLM agents with fresh data. AI Studio JS SDK for intelligent web data gathering.
How to guides on web-crawling or scraping
Python, Javascript, and Rust libraries for the Spider Cloud API.
Extract Google Maps business leads and enrich contact details using AI & web scraping
Fastest and cheapest distributed residential proxy network.
AI Scraper : scrap and extract data from website in any format (CSV, JSON, HTML...) using Selenium or Crawl4ai, and using Ollama or Sambanova API, and using Streamlit for UI as chatbot
A CLI tool and REST API that converts web content to clean Markdown, bypassing anti-scraping measures using headless browsers. Perfect for AI/LLM applications
Integrating OpenAI Agents SDK with Bright Data Web Unlocker, enabling AI agents to access, extract, and process structured data from protected web pages
Use LLaMA 3 and Python to extract structured data from websites like Amazon, leveraging LLM-powered parsing for resilient, AI-driven web scraping.
AI-powered web scraper using Javascript/Typescript.
The definitive list of the latest libraries, tools, APIs and providers for web scraping. The only daily-updated collection of web scraping resources.
AI tools to enhance productivity and automate web-scraping
🎧 Download audio from YouTube and more with ease using this simple command-line tool that simplifies common audio extraction tasks.
Web Scraper powered by Gemini AI in Python.
🚀 Analyze your website's AI readiness and optimize for performance with real-time scoring, recommendations, and detailed metrics.
Disneyland Resorts Hotels Investigation Study