db0@lemmy.dbzer0.com to TechTakesEnglish · 2 年前The Google AI isn’t hallucinating about glue in pizza, it’s just over indexing an 11 year old Reddit post by a dude named fucksmith.message-squaremessage-square258linkfedilinkarrow-up1954arrow-down10file-text
arrow-up1954arrow-down1message-squareThe Google AI isn’t hallucinating about glue in pizza, it’s just over indexing an 11 year old Reddit post by a dude named fucksmith.db0@lemmy.dbzer0.com to TechTakesEnglish · 2 年前message-square258linkfedilinkfile-text
minus-squareJonathan Hendry@iosdev.spacelinkfedilinkarrow-up6·2 年前@harrys_balzac Posts there are expired and deleted over time, so unless someone’s made an effort to archive them, they’re gone. Of course, the AI people could hoover up new horrible posts.
minus-squarenickwitha_k (he/him)@lemmy.sdf.orglinkfedilinkEnglisharrow-up7·2 年前I would be surprised if someone hasn’t been scraping it for years.
minus-squaremoving to lemme.zip. @lemm.eeBannedlinkfedilinkEnglisharrow-up9·2 年前**Moe.archive and 4chan archive have entered the chat. **
minus-squareirelephant [he/him]🍭@lemm.eelinkfedilinkEnglisharrow-up2·9 个月前There is dozens of 4chan data archives.
…yet
@harrys_balzac
Posts there are expired and deleted over time, so unless someone’s made an effort to archive them, they’re gone.
Of course, the AI people could hoover up new horrible posts.
I would be surprised if someone hasn’t been scraping it for years.
**Moe.archive and 4chan archive have entered the chat. **
There is dozens of 4chan data archives.
Yea there are multiple 4chan archives…