Skip to content
@ArchiveBox

ArchiveBox

The self-hosted internet archiving solution maintained by @pirate. #webarchiving #internetarchiving #digipres

Pinned Loading

  1. ArchiveBox ArchiveBox Public

    🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

    Python 22.4k 1.2k

  2. abx-spec-behaviors abx-spec-behaviors Public

    🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and …

    JavaScript 9

  3. abx-dl abx-dl Public

    ⬇️ A CLI tool to download all discovered content from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screensh…

    Python 23 2

  4. archivebox-browser-extension archivebox-browser-extension Public

    Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.

    TypeScript 249 23

  5. abx-pkg abx-pkg Public

    📦 Modern strongly typed Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.

    Python 14

  6. good-karma-kit good-karma-kit Public

    😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, BOINC, and more...

    320 9

Repositories

Showing 10 of 18 repositories
  • ArchiveBox Public

    🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

    ArchiveBox/ArchiveBox’s past year of commit activity
  • abx-pkg Public

    📦 Modern strongly typed Python library for managing system dependencies with package managers like apt, brew, pip, npm, etc.

    ArchiveBox/abx-pkg’s past year of commit activity
    Python 14 MIT 0 0 0 Updated Nov 19, 2024
  • docs Public

    Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.

    ArchiveBox/docs’s past year of commit activity
    CSS 14 4 0 0 Updated Nov 17, 2024
  • abx-spec-behaviors Public

    🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and many other contexts with minimal adjustment.

    ArchiveBox/abx-spec-behaviors’s past year of commit activity
    JavaScript 9 MIT 0 2 0 Updated Nov 13, 2024
  • abx-dl Public

    ⬇️ A CLI tool to download all discovered content from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl). 🎭 Uses headless Chrome to get HTML, JS, CSS, images/video/audio/subtitles, PDFs, screenshots, article text, git srcs, and more...

    ArchiveBox/abx-dl’s past year of commit activity
    Python 23 MIT 2 0 0 Updated Oct 21, 2024
  • docker-archivebox Public

    Home of the official docker image for ArchiveBox

    ArchiveBox/docker-archivebox’s past year of commit activity
    47 GPL-3.0 12 1 1 Updated Oct 15, 2024
  • pip-archivebox Public archive

    Official Python package for ArchiveBox, the self-hosted internet archiving solution.

    ArchiveBox/pip-archivebox’s past year of commit activity
    13 GPL-3.0 2 0 7 Updated Oct 5, 2024
  • homebrew-archivebox Public archive

    Homebrew formula for the ArchiveBox self-hosted internet archiving solution.

    ArchiveBox/homebrew-archivebox’s past year of commit activity
    Ruby 26 GPL-3.0 3 0 0 Updated Oct 5, 2024
  • debian-archivebox Public archive

    Home of the official apt/deb package for Ubuntu/Debian-based systems.

    ArchiveBox/debian-archivebox’s past year of commit activity
    Python 17 GPL-3.0 5 0 1 Updated Oct 5, 2024
  • readability-extractor Public

    Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page's article text.

    ArchiveBox/readability-extractor’s past year of commit activity
    JavaScript 37 14 0 2 Updated Sep 16, 2024