Skip to content
@internetarchive

Internet Archive

The Internet Archive is "the library of the Internet", and a big supporter of Free Software.

Pinned Loading

  1. openlibrary openlibrary Public

    One webpage for every book ever published!

    Python 5.7k 1.6k

  2. bookreader bookreader Public

    The Internet Archive BookReader

    JavaScript 1.1k 438

  3. heritrix3 heritrix3 Public

    Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

    Java 3k 762

  4. cicd cicd Public

    build & test using github registry; deploy to nomad clusters

    19 2

Repositories

Showing 10 of 261 repositories
  • doppelganger Public

    URL-agnostic WARC dedupe server

    internetarchive/doppelganger’s past year of commit activity
    Go 10 AGPL-3.0 0 3 0 Updated Jun 18, 2025
  • Zeno Public

    State-of-the-art web crawler 🔱

    internetarchive/Zeno’s past year of commit activity
    Go 179 AGPL-3.0 34 24 (3 issues need help) 7 Updated Jun 17, 2025
  • openlibrary Public

    One webpage for every book ever published!

    internetarchive/openlibrary’s past year of commit activity
    Python 5,700 AGPL-3.0 1,567 775 (19 issues need help) 111 Updated Jun 18, 2025
  • bookreader Public

    The Internet Archive BookReader

    internetarchive/bookreader’s past year of commit activity
    JavaScript 1,054 AGPL-3.0 438 129 (3 issues need help) 99 Updated Jun 18, 2025
  • rclone Public Forked from rclone/rclone

    [vault fork] of "rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Yandex Files

    internetarchive/rclone’s past year of commit activity
    Go 3 MIT 4,745 0 0 Updated Jun 17, 2025
  • iaux-reviews Public

    Web component for displaying and editing Internet Archive reviews

    internetarchive/iaux-reviews’s past year of commit activity
    TypeScript 1 AGPL-3.0 0 1 6 Updated Jun 17, 2025
  • internetarchive/openlibrary-api’s past year of commit activity
    HTML 7 2 1 0 Updated Jun 17, 2025
  • snakebite-py3 Public

    Pure python HDFS client: python3.x version

    internetarchive/snakebite-py3’s past year of commit activity
    Python 23 Apache-2.0 24 4 6 Updated Jun 17, 2025
  • internetarchive/internetarchivebot’s past year of commit activity
    PHP 140 AGPL-3.0 34 0 2 Updated Jun 16, 2025
  • internetarchive/iaux-collection-browser’s past year of commit activity
    TypeScript 7 AGPL-3.0 1 2 15 Updated Jun 15, 2025