Skip to content

Pinned Loading

  1. ArchiveBot ArchiveBot Public

    ArchiveBot, an IRC bot for archiving websites

    Python 400 77

  2. wpull wpull Public

    Wget-compatible web downloader and crawler.

    HTML 593 84

  3. seesaw-kit seesaw-kit Public

    Making a reusable toolkit for writing seesaw scripts

    Python 72 34

  4. grab-site grab-site Public

    The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

    Python 1.5k 150

  5. warrior-dockerfile warrior-dockerfile Public

    A Dockerfile for the ArchiveTeam Warrior

    Shell 406 64

  6. warrior4-vm warrior4-vm Public

    Warrior virtual machine appliance (version 4)

    Rust 49 1

Repositories

Showing 10 of 753 repositories
  • oshietegoo-items Public

    Managing items for oshietegoo-grab.

    ArchiveTeam/oshietegoo-items’s past year of commit activity
    0 0 0 0 Updated Sep 16, 2025
  • oshietegoo-grab Public

    Archiving 教えて!goo.

    ArchiveTeam/oshietegoo-grab’s past year of commit activity
    Lua 0 Unlicense 2 0 0 Updated Sep 16, 2025
  • urls-grab Public

    Archiving URLs (outlinks) from a variety of sources.

    ArchiveTeam/urls-grab’s past year of commit activity
    Lua 22 Unlicense 8 2 0 Updated Sep 15, 2025
  • typepad-grab Public

    Archiving Typepad.

    ArchiveTeam/typepad-grab’s past year of commit activity
    Lua 0 Unlicense 0 0 0 Updated Sep 12, 2025
  • ludios_wpull Public

    wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improved

    ArchiveTeam/ludios_wpull’s past year of commit activity
    HTML 30 GPL-3.0 8 11 0 Updated Sep 6, 2025
  • terroroftinytown Public

    URLTeam's second generation of URL shortener archiving tools

    ArchiveTeam/terroroftinytown’s past year of commit activity
    Python 80 MIT 15 16 2 Updated Sep 5, 2025
  • typepad-items Public

    Managing items for typepad-grab.

    ArchiveTeam/typepad-items’s past year of commit activity
    0 0 0 0 Updated Sep 3, 2025
  • megawarc Public Forked from alard/megawarc

    Nondestructive warc-in-tar to warc conversion

    ArchiveTeam/megawarc’s past year of commit activity
    Python 8 7 5 1 Updated Aug 31, 2025
  • grab-base-df Public

    Base Dockerfile for warrior project grab scripts

    ArchiveTeam/grab-base-df’s past year of commit activity
    Dockerfile 5 MIT 5 2 1 Updated Aug 31, 2025
  • telegram-grab Public

    Archiving public telegram messages.

    ArchiveTeam/telegram-grab’s past year of commit activity
    Lua 14 Unlicense 9 5 1 Updated Aug 29, 2025