Migrating from old!SneerClub to New

@blakestacey · 1 year ago

Migrating from old!SneerClub to New

@self · 1 year ago

Now that the software is running with (at least for me) a low level of jank

this is actually a big relief. I’m still monitoring in the background to see if anything’s silently broken but other than lemmy really wanting access to a mail server, everything seems good on this end too

Unfortunately, since I’m a physicist by trade, anything I do with webstuff tends to involve starting from scratch with compass, straightedge and wget. There’s got to be a better method of archiving.

that’s not a bad way to do it. one thing that’d be cool is if we could archive sneers in a form that could be cited, which seemed like a pretty common ask back on reddit. some options for that are:

if we want a prebuilt automated system, this thing from the internet archive seems promising, but is fairly vague on how it actually works and usage is by invitation only (though I’d argue we’ve got a valid use case that the archive might be interested in)
we write a script (or modify an existing one) to scrape reddit and output comments to something like a set of JSON files. then a static site generator could reconstruct a sneerclub archive from that JSON into a rendered site, which could be hosted someplace free and permanent like github
same as above, but have the bot output to rationalwiki. not sure if this would flood the wiki or particularly match up with its formatting style though

for accessibility, I imagine having a sneerclub archive here could be a good thing too. that might be fairly easy to do; we’d need to set up a dedicated community and account for it so main sneerclub doesn’t get flooded, but then we’d just have an existing crosspost bot run and grab everything in sneerclub and post it here

this may become much harder once reddit’s API closes, which gives me some anxiety. it might make sense to speedrun an archival script before that happens

@blakestacey · 1 year ago

RationalWiki runs on MediaWiki, which is kind of awful for discussion threads.

I will try to have more thoughts about this later (and do a bit more research into pre-existing scraping tools and such).

@self · 1 year ago

https://github.com/toonvandeputte/reddit_archive this might be adaptable into something that archives a subreddit instead of a single user’s posts. there may be a slight complication if we’re dealing with more than 1000 posts though. but since we’ve got an archive already that has everything up to december, we really only need this year’s posts

David Gerard · 1 year ago

gawd, definitely not suitable to dump on RW