Migrating from old!SneerClub to New

@blakestacey · 1 year ago

Migrating from old!SneerClub to New

@blakestacey · 1 year ago

Sounds like a good plan.

@self · 1 year ago

some work in progress on this is available here. the SneerClub directory is the output of the bulk downloader for all 1000 (deduplicated) posts it could grab from each of SneerClub’s hot, top, new, rising, and controversial tabs, and the jsonl files are just the ones you posted decompressed for convenience. so far I’m just using jq to process the data sets

SneerClub has 1940 posts with nested comments and attached media where the downloader could parse it; the archive team files have 3851 posts and 100149 comments in a (much less convenient) flattened format without media. both sets have a few posts from 2015, so I’ll need to do more looking to see how much we’ve salvaged overall