Automation & Bots

Workflow automation, bots, and scripting tools

automation bot scraping web-scraping chatbot workflow

845 repositories found

Collection of scripts corresponding to LucidProgramming YouTube tutorials

ctci-solutionslucidprogrammingpythonpython-tutorialpython3python3-tutorialtechnical-interviewweb-scrapingyoutube-tutorial

scrapfly/scrapfly-scrapers

Scalable Python web scraping scripts for +40 popular domains

Python899187Updated 10 hours ago

antibotautomationcaptcha-bypasscrawlercrawlingcrawling-pythondatascrapingproxiespythonpython-scraperscraperscrapingscraping-pythonspidertwitter-scraperweb-crawlerweb-scrapingweb-scraping-pythonwebscraperwebscraping

je-suis-tm/web-scraping

Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, MacroTrends, SHFE and alternative data crawlers on Tomtom, BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist

Python867189Updated 3 days ago

bloombergdata-scraperdata-scrapingfinancial-datafinancial-timesfuturesfutures-historical-datanews-scrapernews-websitesnewsletteroptions-datapython-web-scraperreutersscrappersrapingwall-street-journalwallstreetbetsweb-scraperweb-scrapersweb-scraping

ttlns/Selenium-Driverless

a stealthy browser automation framework

Python84984Updated 18 hours ago

automationdetection-evasiondriverless-chromepythonpython3reverse-engineeringscraping-pythontestingvulnerability-researchweb-scrapingwebdriver

oxylabs/how-to-scrape-google-finance

Use Web Scraper API to extract data from Google Finance, including stock titles, pricing, and price changes in percentages.

Python8451Updated 3 days ago

finance-apifinancial-data-extractiongoogle-financegoogle-finance-apigoogle-finance-scrapergoogle-scraperscrape-googlestock-scraperstocks-apistocks-datastocks-pricesweb-scraping

postmodern/spidr

A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Ruby834108Updated 2 weeks ago

crawlerrubyscraperspiderspider-linkswebweb-crawlerweb-scraperweb-scrapingweb-spider

sardanioss/httpcloak

Go HTTP client with browser-identical TLS/HTTP2 fingerprinting. Bypass bot detection by perfectly mimicking Chrome, Firefox, and Safari at the cryptographic level (JA3/JA4, Akamai fingerprint, header order). Supports HTTP/1.1, HTTP/2, HTTP/3, sessions, cookies, and proxies.

Go82362Updated 13 hours ago

anti-botbot-detectionbrowser-fingerprintbrowser-fingerprintingcloudflaregogolanghttp-clienthttp2http3ja3-fingerprintja4-fingerprintjsnodejspythonpython3quictls-fingerprinttls-fingerprintingweb-scraping

DataHenHQ/till

DataHen Till is a companion tool to your existing web scraper that instantly makes it scalable, maintainable, and more unblockable, with minimal code changes on your scraper. Integrates with any scraper in 5 minutes.

Go81523Updated 1 month ago

crawlerman-in-the-middlemitmproxy-serverscraperscrapingweb-scraping

z0m31en7/Uscrapper

Uscrapper Vanta: Dive deeper into the web with this powerful open-source tool. Extract valuable insights with ease and efficiency, from both surface and deep web sources. Empower your data mining and analysis with Vanta's advanced capabilities. Fast, reliable, and user-friendly, Uscrapper Vanta is the ultimate choice for researchers and analysts.

Python77284Updated 1 day ago

darkwebdarkweb-crawlerinformation-extractioninformation-gatheringosintosint-pythonosint-toolpythonreconnaissanceseleniumselenium-webscrapertorweb-scrapingwebcrawebcrawlerwebscrapingwebsite-scraperwebsites

serpapi/google-search-results-python

Google Search Results via SERP API pip Python Package

Python732117Updated 1 week ago

bing-imagegoogle-crawlergoogle-imagespythonscrapingserp-apiserpapiweb-scraping

alecxe/scrapy-fake-useragent

Random User-Agent middleware based on fake-useragent

Python68994Updated 1 week ago

pythonscrapyweb-scraping

achuthasubhash/Complete-Life-Cycle-of-a-Data-Science-Project

Complete-Life-Cycle-of-a-Data-Science-Project

638253Updated 3 days ago

analysisdata-analysisdata-sciencedatasetdeep-learningedaexploratory-data-analysisfeature-engineeringfederated-learningmachine-learningnlp-modelspythonpython-librarypytorchreinforcement-learningscrapersupervised-learningtransfer-learningunsupervised-learningweb-scraping

CloakHQ/CloakBrowser

Stealth Chromium that passes every bot detection test. Drop-in Playwright replacement with source-level fingerprint patches. 30/30 tests passed.

Python63946Updated just now

ai-agentsanti-detectantidetect-browserbot-detectionbrowser-automationcaptcha-bypasschromiumcloudflarecloudflare-bypassfingerprintheadless-browserplaywrightpuppeteerpythonrecaptchaseleniumstealth-browserundetectedweb-scrapingwebscraping

dinubs/coolqlcool

Nextjs server to query websites with GraphQL

JavaScript63046Updated 3 weeks ago

graphqljavascriptnextjsschemaweb-scraping

Kaliiiiiiiiii-Vinyzu/patchright-nodejs

Undetected NodeJS version of the Playwright testing and automation library.

JavaScript61437Updated 19 hours ago

automationbotbotsbottingbrowserchromechromedriverchromiumcloudflarecloudflare-bypassplaywrightstealthundetectableundetectedweb-autoweb-scrapingwebautomationwebdriverwebscraping

ScrapeGraphAI/scrapecraft

🤖 AI-powered web scraping editor with visual workflow builder. Build, test & deploy web scrapers using natural language. Powered by ScrapeGraphAI & LangGraph.

Python61097Updated 1 week ago

aiautomationdata-extractiondockerfastapihacktoberfestlanggraphpythonreactscrapegraphaitypescriptweb-scrapingwebscraping

4ier/neo

Turn any web app into an API. Chrome extension captures browser traffic, auto-generates schemas, lets AI replay APIs directly. No official API needed.

JavaScript58945Updated just now

ai-agentapi-discoverybrowser-automationchrome-extensionweb-scraping

spekulatius/PHPScraper

A universal web-util for PHP.

PHP58075Updated 1 week ago

beautifulsoupchromiumheadless-chromephpphp-crawlerphp-scraperphp-spiderphp-spiderspuppeteerpyppeteerscraperscrapingscraping-websitesscrapyweb-scraperweb-scraping

web-agent-master/google-search

A Playwright-based Node.js tool that bypasses search engine anti-scraping mechanisms to execute Google searches. Local alternative to SERP APIs with MCP server integration.

TypeScript56293Updated 1 hour ago

aigoogle-searchllmmcp-serverweb-scraping

lumpinif/deepcrawl

100% free and full open-source edge Firecrawl alternative with better links extraction for agents - that you can deploy to cloudflare or vercel by yourself.

TypeScript55863Updated 2 days ago

ai-agent-toolsai-sdkbetter-authcloudflare-workerscrawlingdeepcrawlhonohtml-cleanerhtml-to-markdownlinks-extractionlinks-treenextjsnextjs16orpctypescriptweb-scraperweb-scraping

orangecoding/fredy

❤️ Fredy - [F]ind [R]eal [E]state [D]amn Eas[y] - Fredy keeps searching for new apartments, houses, and flats in Germany on platforms like ImmoScout24, Immowelt, Immonet, eBay Kleinanzeigen, and WG-Gesucht and instantly delivers the results to you via Slack, Telegram, Email, Discord or ntfy, so you can focus on the more important things in life ;)

JavaScript553132Updated 12 hours ago