Chub Archive Takeout Part 1

The Chub Archive is operated by Cyberes - chub-archive@evulid.cc - @cyberes:evulid.cc - chub-archive.evulid.cc

SNAPSHOT DATE: November 18, 2023
SNAPSHOT SIZE: 321G
ARCHIVE SIZE: 132G

This is the first part of the chub.ai archive from chub-archive.evulid.cc. Subsequent parts will only include the modified files.

Magnet

magnet:?xt=urn:btih:4BV7G6AUWJTLUGSSPNZH5TAKZFCHFW23&dn=chub-archive_part-1&tr=udp%3A%2F%2Ftracker.opentrackr.org%3A1337%2Fannounce

Torrent File Download

chub-archive_part-1.torrent

Contents

Chub

A near-complete archive of chub.ai. Cards and lorebooks are versioned using git.

Characters:

.card.json: The interpreted Tavern JSON card. This is formed from the .card_raw.json file to match what the site downloads for the JSON card.
.card.png: Tavern PNG card from the frontend.
.card.webp: SillyTavern WEBP card from the frontend.
.chub.json: Chub card metadata.
.node.json: Card metadata returned by the Chub API when running the search query.

Files in api/ are pulled from the Chub backend API (rather than the frontend) and saved for archival purposes. Sometimes these files are different than those from the frontend (probably older, cached versions).

.card_api.png: Tavern card returned by the API.
.card_cai_api.json: CAI card returned by the API.
.card_spec_v2_api.png: V2 spec card returned by the API.
.card_raw.json: What the API returns when asked for the Tavern JSON card. Includes extra data that isn't needed for the Tavern JSON card.

Lorebooks:
.lorebook_sillytavern.json: The original SillyTavern lorebook.
.lorebook_sillytavern_inferred.json: The SillyTavern lorebook formed from other sources.
.lorebook_agnai.json: The original AgnAI lorebook.
.lorebook_agnai_inferred.json: The AgnAI lorebook formed from other sources.
.chub.json: Same as above.
.node.json: Same as above.

Files in api/ were pulled from the backend API.

Users:
.chub.json: Chub profile data scraped from the frontend.

Catbox

Cards hosted on catbox.moe, scraped from 4chan.org/g. If a card contained data that identified its author (presumably due to a re-upload from chub.ai) it is organized into that author's folder.

Historical

Miscellaneous original compressed archives.

JanitorAI

Cards scraped from JanitorAI before they made card definitions private.

Other

Miscellaneous data related to character cards and /aicg/.

proxy_stats.db

The SQLite3 database from the Chub Archive oai-reverse-proxy tracker.

Third-Party

Archives that were donated to and not created by the Chub Archive.

Webring

Authors'/botmakers' cards from the chatbot webring. Some botmakers opted out via the meta tag <meta name="chub-archive-exclude" content="true">.

Archived Tree

.
├── [1.5G]  chub-archive_part-1_catbox.7z
├── [ 25G]  chub-archive_part-1_chub-ai.7z.001
├── [ 25G]  chub-archive_part-1_chub-ai.7z.002
├── [ 25G]  chub-archive_part-1_chub-ai.7z.003
├── [341M]  chub-archive_part-1_chub-ai.7z.004
├── [ 25G]  chub-archive_part-1_janitorai.7z.001
├── [ 19G]  chub-archive_part-1_janitorai.7z.002
├── [ 88M]  chub-archive_part-1_other.7z
├── [3.8G]  chub-archive_part-1_third-party.7z
├── [155M]  chub-archive_part-1_webring.7z
├── [4.0K]  historical
│   ├── [2.9G]  chub_ai-mega-nz_scrape.7z
│   ├── [511M]  Pygmalion_Discord_Server_04-18-2023.7z
│   ├── [ 78M]  Rokos_Basilisk_Archive.7z
│   ├── [1.6G]  venusai-chat_05-27-2023.7z
│   └── [1.6G]  Venusai-Official-Discord-Server.7z
└── [304M]  proxy_stats.db.7z

Extracted Tree

.
└── chub-archive_part-1
    ├── catbox
    │   ├── cards
    │   └── catbox-latest.txt
    ├── chub.ai
    │   ├── characters
    │   ├── July 2023 Leak
    │   ├── latest.txt
    │   ├── lorebooks
    │   ├── mega.nz scrape
    │   └── users
    ├── historical
    │   ├── chub_ai-mega-nz_scrape.7z
    │   ├── Pygmalion_Discord_Server_04-18-2023.7z
    │   ├── Rokos_Basilisk_Archive.7z
    │   ├── venusai-chat_05-27-2023.7z
    │   └── Venusai-Official-Discord-Server.7z
    ├── janitorai
    ├── other
    │   └── cute logs - Oct 14 2023.xlsx
    ├── proxy_stats.db.7z
    ├── third-party
    │   ├── Pygmalion Discord Server 04-18-2023
    │   ├── Roko's Basilisk Archive
    │   ├── Venusai 05-27-2023
    │   └── Venusai Official Discord Server
    └── webring
        ├── cards
        └── webring-latest.txt

sourced from https://chub-archive.evulid.cc/api/file/download?path=/other/takeout/chub-archive_part-1.md

Edit Report
Pub: 27 Nov 2023 23:58 UTC
Edit: 29 Nov 2023 02:19 UTC
Views: 343