Downloads posts onto your computer from cohost.org, which is shutting down.
- Post pages are downloaded exactly as they appear on Cohost, including shared posts, comments, and with your display settings (silenced tags, etc.)
- Downloads all of your own posts and all of your liked posts
- If you have a data portability archive: also downloads all posts you’ve commented on
- Legal: using this software does not somehow grant you a license to re-publish posts and comments from other people
See also: cohost-dl 2, which is easier to use and handles large amounts of data much better
Downloaded data will be placed in an out
directory.
Detailed breakdown
- HTML files openable in a web browser
out/index.html
: a simple overview pageout/~all/index.html
: The Cohost Archive Global Feedout/{handle}/index.html
: page that shows all posts from {handle}out/{handle}/post/12345-example.html
: page that shows just that post, as it appeared on cohost.org
- Page resources
out/static/
: files from cohost.org/static, such as CSS filesout/rc/attachment/
: post images and audio filesout/rc/attachment-redirect/
: honestly, no idea. ostensibly also post attachmentsout/rc/avatar/
,out/rc/default-avatar/
: user avatarsout/rc/header/
: user header imagesout/rc/external/
: external images not hosted on cohost.org but included in postsout/{handle}/cdl-index.js
: full-text search indexout/{handle}/cdl-chunk~{handle}~{n}.js
: post data used in the list of all postsout/~cohost-dl/
: Javascript for all generated pages
- Data files
out/{your-handle}/liked.json
: data for all posts you likedout/{your-handle}/posts.json
: data for all posts you madeout/{handle}/post/12345-example
(without.html
): original data for that post from cohost.orgout/~src/{site-version}/
: unpacked source code for the Cohost frontend (used to create cohost-dl Javascript)out/~headers.json
: stores content type headers for some URLs that don’t have a good file extension
For file size, expect something around 1 GB for 1000 posts.
Files you can probably safely rehost online:
out/{your-handle}/index.html
out/{your-handle}/cdl-index.js
out/{your-handle}/cdl-chunk~{...}.js
out/~cohost-dl/
- files in
out/rc/
required for the above page(s) to work
Why other files may not be safe to rehost online:
out/{your-handle}/post/12345-example.html
: is a very faithful Cohost page and hence contains all of your settings (sideblogs, muted tags, etc.)- The
GENERIC_OBSERVER
setting attempts to mitigate this, but it breaks a bunch of other things
- The
out/{not-your-handle}/
: not yours
- Copy
CONFIG.example.ts
toCONFIG.ts
- edit
CONFIG.ts
appropriately - Install Deno
./run.sh
- if you’re using a system that doesn’t support Bash, such as Windows,
you can just copy the
deno run ...
command from this file and run it directly.
- if you’re using a system that doesn’t support Bash, such as Windows,
you can just copy the
It's safe to interrupt and re-start the script at any time. Things that have already been downloaded will not be downloaded again, and any changes in configuration will be taken into account upon restart.