GitHunt

Internet Archive

internetarchive

The Internet Archive is "the library of the Internet", and a big supporter of Free Software.

Languages

Python38%JavaScript28%TypeScript10%HTML7%Go7%Kotlin3%PHP3%Java3%

Repos

271

Stars

14.6k

Forks

4.0k

Top Language

Python

Loading contributions...

Top Repositories

Repositories

271
IN
internetarchive/wayback-machine-android

No description provided.

Kotlin258Updated 3 months ago
IN
internetarchive/openlibrary

One webpage for every book ever published!

Python6.3k1.8kUpdated 3 hours ago
bookshacktoberfestinternet-archivelibrary-catalogueopen-source
IN
internetarchive/openlibrary-client

Python Client Library for the Archive.org OpenLibrary API

Python476104Updated 2 months ago
IN
internetarchive/bookreader

The Internet Archive BookReader

JavaScript1.1k483Updated 4 days ago
bookreaderebookshacktoberfestinternetarchive
IN
internetarchive/wiki-references-extractor

Extracts references from Wikipedia articles

Python72Updated 1 week ago
IN
internetarchive/brozzler

brozzler - distributed browser-based web crawler

Python792115Updated 5 days ago
IN
internetarchive/tracey

Tracey Jaquith, Internet Archive 🏛️, talks and slides

HTML40Updated 1 month ago
cicddevopsinternet-archivejavascriptmarkdownslides
IN
internetarchive/wayback-diff

React components to render differences between captures at the Wayback Machine

JavaScript4118Updated 1 day ago
IN
internetarchive/openlibrary-bots

A repository of cleanup bots implementing the openlibrary-client

Python7662Updated 6 days ago
IN
internetarchive/Zeno

State-of-the-art web crawler 🔱

Go39455Updated 3 hours ago
archivingweb-crawlerzeno
IN
internetarchive/iaux-monthly-giving-circle

No description provided.

TypeScript10Updated 2 days ago
IN
internetarchive/internetarchivebot

No description provided.

PHP15240Updated 3 days ago
botcomposerphptorwebserver
IN
internetarchive/archive-pdf-tools

Fast PDF generation and compression. Deals with millions of pages daily.

Python13717Updated 2 weeks ago
compressionocrpdfpdf-compressionpdf-compressorpdf-generationpdf-generatorpdf-to-imagepython
IN
internetarchive/cdx-summary

Summarize web archive capture index (CDX) files.

Python8726Updated 1 month ago
archivecdxcollectionnodejspythonreportstatisticssummarywarcweb-archivewebcomponents
IN
internetarchive/heritrix3

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Java3.2k781Updated 1 week ago
heritrixjavawarcwebcrawling
IN
internetarchive/scholar

IA Scholar

HTML30Updated 4 days ago
IN
internetarchive/internet-archive-voice-apps

Voice Apps (Actions on Google, Alexa Skill) of Internet Archive. Just say: "Ok Google, Ask Internet Archive to Play Jazz" or "Alexa, Ask Internet Internet Archive to play Instrumental Music"

JavaScript5042Updated 3 days ago
actions-on-googlealexa-skilldialog-flowinternet-archivevoice-assistant
IN
internetarchive/crawling-for-nomore404

No description provided.

Python3113Updated 4 days ago
IN
internetarchive/wayback-radial-tree

No description provided.

JavaScript98Updated 3 days ago
IN
internetarchive/wayback-machine-webextension

A web browser extension for Chrome, Firefox, Edge, and Safari 14.

JavaScript782227Updated 1 day ago
IN
internetarchive/infogamiFork

No description provided.

Python4928Updated 2 days ago
IN
internetarchive/tapestry-project

A Tapestry is a digital format describing an endless canvas that hosts a variety of interconnected multimedia items.

TypeScript37Updated 1 month ago
IN
internetarchive/gifcities

gifcities.org web app

Go40Updated 1 year ago
IN
internetarchive/dweb-mirror

Offline Internet Archive project

JavaScript31234Updated 2 years ago
IN
internetarchive/ads-common

Common components and utilities for the Archiving & Data Services (ADS) team at the Internet Archive

TypeScript30Updated 1 week ago
IN
internetarchive/trough

Trough: Big data, small databases.

Python427Updated 1 year ago
databasepythonpython3sqlite
IN
internetarchive/warcprox

WARC writing MITM HTTP/S proxy

Python44766Updated 1 month ago
IN
internetarchive/emularity-config

archive.org software emulation

JavaScript61Updated 1 week ago
IN
internetarchive/wmdFork

Open Library branch of WMD.

JavaScript7221Updated 3 years ago
IN
internetarchive/internet-archive-skillsFork

Claude Code skill for uploading to, downloading from, and searching the Internet Archive (archive.org)

101Updated 1 month ago

Gists

Recent Activity

Internet Archive (internetarchive) | GitHunt