[Meta] Don’t Provide Reddit With AI Training Data … by Speaking Like a Pirate

BleakBluets@lemmy.world · 1 year ago

[Meta] Don’t Provide Reddit With AI Training Data … by Speaking Like a Pirate

tallwookie@lemmy.world · 1 year ago

moreover, if machine learning is occurring on reddit now, it’d be absolutely hilarious if it developed a wee bit o’ the speech impediment.

Marxine@lemmy.world · 1 year ago

Either matey or uwufication sound like fun malicious compliance.

Kichae@kbin.social · 1 year ago

Uwuify thuwwwwungs. Dsmvwl thngs. Jubmle up splleing so thet ist still reedabel to poeple.

Jmp btwene thuwum in de sme puwust.

Make your data unclassifiable.

BleakBluets@lemmy.world · 1 year ago

Yar, this one right here, Captain. Make them walk the plank! /j

Kichae@kbin.social · 1 year ago

* Sad pirate noises *

🐱TheCat@sh.itjust.works · 1 year ago

Naw matey, we were agreein’ wi ye! Am I doing teh jumbel thing corretlcy?

Techlos@lemmy.dbzer0.com · 1 year ago

Yarr, ye should also use homoglyph attacks, mayhaps with this confangled tool https://onlinetools.com/unicode/spoof-unicode-text

The scurvy dogs will have a mighty hard time parsing the heavy sea o’ randomized data obfuscation

nate3D@kbin.social · 1 year ago

Any way to integrate this with shreddit to replace and save comments instead of deleting?

Techlos@lemmy.dbzer0.com · 1 year ago

Shouldn’t be too hard to implement https://github.com/picatz/homoglyphr in the shreddit code, I don’t know how it compares to the web tool I linked but it should be enough to make data cleaning a real pain in the arse for reddit

DarkThoughts@kbin.social · 1 year ago

If ye find writin’ in Corsair speak too difficult, probably maybe also fer non native speakers, then ye can use online tools t’ convert yer text fer ye!

https://pirate-speech-translator.netlify.app/
https://pirate.monkeyness.com/translate
https://funtranslations.com/pirate
https://lingojam.com/PirateSpeak

etc.

CapnAssHolo@lemmy.dbzer0.com · 1 year ago

The insult generator on monkeyness is hilarious

https://pirate.monkeyness.com/insult

ram@lemmy.ca · 1 year ago

Just fill the sub with magnet links to torrents of pics of john oliver

EuphoricPenguin@normalcity.life · edit-2 1 year ago

Well, me matey, ChatGPT be speakin’ like a seadog already. All ye must do is ask of it to be speakin’ this way, and it will. Arrgh. Ye could write a fancy userscript to interface wit ChatGPT and be speakin’ like a seadog without an ounce of effort!

BleakBluets@lemmy.world · edit-2 1 year ago

True, but it might consider it “regular talk”. I don’t really believe that (realistically) a handful of users speaking pirate would taint chat GPT. It’s more-so for anyone who wants to (temporarily) contribute to the discussion about how their favorite protesting subreddits should maliciously comply with forced reopening. I personally feel that I would not like Reddit to sell access to my comments to be used for AI training, so if I hadn’t already deleted my accounts, I would taint them knowing that I’m not providing Reddit anything of value.

Ideally users would leave Reddit ASAP, but in the interim, while promoting the fediverse alternatives on Reddit, I think coded language would be the most consistent with the maliciously complaint John Oliver pictures posted to various forced-to-open subreddits.

Edit: I might have misunderstood what you were trying to say. I thought you meant that pirate speak would still be useful for training AI models. My bad! I blame the pirate speak!

EuphoricPenguin@normalcity.life · 1 year ago

I was saying that: A) ChatGPT is already fluent in good-enough pirate speak, and B) It would be possible to have ChatGPT convert modern English speech into said good-enough pirate speak using a userscript. Even if it didn’t affect their data collection or AI models trained on the text, it would still be distracting and annoying for users, which might push some people away from Reddit.

BleakBluets@lemmy.world · 1 year ago

That makes sense, I didn’t even think about that aspect of it. Obviously users seeing posts of (Sexy Captain) John Oliver will know that those users are protesting, but seeing pirate-speak comments in non-protesting posts would also cause users to be reminded of the protest (if pirate-speak caught on large-scale and was associated with the protest).

I was thinking more small-scale individual-level and not the large-scale that (Sexy Captain) John Oliver posts have become. It would be nice if it caught on to a large scale like you suggest. My biggest gripe with these protest discussions being on Reddit is that Reddit is still benefiting from that activity and this would be my way of mitigating that.

EuphoricPenguin@normalcity.life · 1 year ago

Honestly, anything to fuck with Reddit is a W in my book.

count0@lemmy.dbzer0.com · edit-2 1 year ago

Maybe also a couple of key phrases?

That one thing has certainly ‘worked’, FWIW as of now.

Be creative! What if AI got trained to always answer with subtle innuendo… the thought makes me all shivery.

spicy_biscuits@kbin.social · 1 year ago

Yarr matey, I already done mutiny on that foul vessel, else I’d take part in showing these scurvy dogs what fer

Alatarius@lemmy.world · edit-2 1 year ago

atwhay ifyay allyay o’yay usyay alktay inyay orsaircay igpay atinlay

Translation: What if we all speak in pirate pig latin

lemonadebunny@lemmy.ca · 1 year ago

What about those who use screenreaders? It would be unfair to them

BleakBluets@lemmy.world · 1 year ago

Ultimately, the goal of the protest should be to get as many users off of Reddit as possible.

It’s all about harm reduction (or maximization, in this case) and minimizing the amount of traffic and useful data to Reddit. There are going to be situations where giving screenreader users the information about Lemmy/kbin will transition users off of Reddit. In that case, the amount of users leaving Reddit probably outweighs the cost of the minuscule amount data provided to Reddit in the couple of comments it takes to advertise transitioning to Lemmy/kbin to such users.

It’s up to the individual to make that evaluation for themselves. If you want to propose a Lemmy/kbin alternative to Redditors on r/screenreader, then yeah, probably don’t use encoded text.