(Feel free to remove this as off-topic, but this relates to the post about the r/Piracy poll regarding what content will be permitted upon reopening. The body of this post wouldn’t get the same reach as a comment on that post.)
Ahoy hearties! Here what I be thinkin’. Reddit be chargin’ tens of millions of doubloons for third-mates to access the API, aye? They be claimin’ to deserve a share of the booty for providin’ trainin’ data for AI (and obviously to kill competition with third-mate apps to boot).
Methinks if yee MUST chatter with those landlubbers (such as for the purpose of recruitin’ new mates or cussing out mutinous scabs), then yee ought to make any text data yee provide unappealing and unusable to potential AI-training-customers.
Paintings of (Sexy) Captain John Oliver will only sully the attention of the human users. But (pirate) coded language mayhaps be an obstruction for bots? For those who find pirate speak to be too much effort, an alternative be to speak “sdrawkcaB”.
I can no longer cast my bottled messages to Reddit’s shore, so any of you seadogs are free to pass it along.
moreover, if machine learning is occurring on reddit now, it’d be absolutely hilarious if it developed a wee bit o’ the speech impediment.
Either matey or uwufication sound like fun malicious compliance.
Uwuify thuwwwwungs. Dsmvwl thngs. Jubmle up splleing so thet ist still reedabel to poeple.
Jmp btwene thuwum in de sme puwust.
Make your data unclassifiable.
Yar, this one right here, Captain. Make them walk the plank! /j
* Sad pirate noises *
Naw matey, we were agreein’ wi ye! Am I doing teh jumbel thing corretlcy?
Yarr, ye should also use homoglyph attacks, mayhaps with this confangled tool https://onlinetools.com/unicode/spoof-unicode-text
The scurvy dogs will have a mighty hard time parsing the heavy sea o’ randomized data obfuscation
Any way to integrate this with shreddit to replace and save comments instead of deleting?
Shouldn’t be too hard to implement https://github.com/picatz/homoglyphr in the shreddit code, I don’t know how it compares to the web tool I linked but it should be enough to make data cleaning a real pain in the arse for reddit
If ye find writin’ in Corsair speak too difficult, probably maybe also fer non native speakers, then ye can use online tools t’ convert yer text fer ye!
https://pirate-speech-translator.netlify.app/
https://pirate.monkeyness.com/translate
https://funtranslations.com/pirate
https://lingojam.com/PirateSpeaketc.
The insult generator on monkeyness is hilarious
Just fill the sub with magnet links to torrents of pics of john oliver
Well, me matey, ChatGPT be speakin’ like a seadog already. All ye must do is ask of it to be speakin’ this way, and it will. Arrgh. Ye could write a fancy userscript to interface wit ChatGPT and be speakin’ like a seadog without an ounce of effort!
True, but it might consider it “regular talk”. I don’t really believe that (realistically) a handful of users speaking pirate would taint chat GPT. It’s more-so for anyone who wants to (temporarily) contribute to the discussion about how their favorite protesting subreddits should maliciously comply with forced reopening. I personally feel that I would not like Reddit to sell access to my comments to be used for AI training, so if I hadn’t already deleted my accounts, I would taint them knowing that I’m not providing Reddit anything of value.
Ideally users would leave Reddit ASAP, but in the interim, while promoting the fediverse alternatives on Reddit, I think coded language would be the most consistent with the maliciously complaint John Oliver pictures posted to various forced-to-open subreddits.
Edit: I might have misunderstood what you were trying to say. I thought you meant that pirate speak would still be useful for training AI models. My bad! I blame the pirate speak!
I was saying that: A) ChatGPT is already fluent in good-enough pirate speak, and B) It would be possible to have ChatGPT convert modern English speech into said good-enough pirate speak using a userscript. Even if it didn’t affect their data collection or AI models trained on the text, it would still be distracting and annoying for users, which might push some people away from Reddit.
That makes sense, I didn’t even think about that aspect of it. Obviously users seeing posts of (Sexy Captain) John Oliver will know that those users are protesting, but seeing pirate-speak comments in non-protesting posts would also cause users to be reminded of the protest (if pirate-speak caught on large-scale and was associated with the protest).
I was thinking more small-scale individual-level and not the large-scale that (Sexy Captain) John Oliver posts have become. It would be nice if it caught on to a large scale like you suggest. My biggest gripe with these protest discussions being on Reddit is that Reddit is still benefiting from that activity and this would be my way of mitigating that.
Honestly, anything to fuck with Reddit is a W in my book.
Yarr matey, I already done mutiny on that foul vessel, else I’d take part in showing these scurvy dogs what fer
atwhay ifyay allyay o’yay usyay alktay inyay orsaircay igpay atinlay
Translation: What if we all speak in pirate pig latin
What about those who use screenreaders? It would be unfair to them
Ultimately, the goal of the protest should be to get as many users off of Reddit as possible.
It’s all about harm reduction (or maximization, in this case) and minimizing the amount of traffic and useful data to Reddit. There are going to be situations where giving screenreader users the information about Lemmy/kbin will transition users off of Reddit. In that case, the amount of users leaving Reddit probably outweighs the cost of the minuscule amount data provided to Reddit in the couple of comments it takes to advertise transitioning to Lemmy/kbin to such users.
It’s up to the individual to make that evaluation for themselves. If you want to propose a Lemmy/kbin alternative to Redditors on r/screenreader, then yeah, probably don’t use encoded text.