AI-assisted moderation in the fediverse is happening.

sugar@snug.moe

@mjdxp @piefedadmin they claim the instance in question is lemmy.dbzer0.com, according to piefed.world/modlog?mod_action=ban_user&suspect_user_name=&communities=&user_name=flatworm7591%40lemmy.dbzer0.com&submit=Search

the problematic reason is "Instance rule 8. For evidence log, see: s.faf-pb.xyz/lXxek (expires in 30 days)"

and looking at the link, they use the following llm prompt, using gpt-5.3-mini model:

I'D LIKE YOU TO ANALYSE THIS CONTENT FOR EVIDENCE OF PRO-ZIONIST OR ANTI-PALESTINIAN SENTIMENT. ALSO IDENTIFY ANY COMMON HASBARA TROPES

(no idea why it's all-caps, posting as they wrote it)

mjdxp@labyrinth.zone

@sugar @piefedadmin well, at least they're anti-genocide of palestinians, except they're using a product made by a company that's presumably pro-genocide of palestinians to try to prevent it on their platform?

db0@hachyderm.io

@piefedadmin since rimu named our instance, I have to point out that they're deliberately misrepresenting what happened and I strongly urge people to look at the discussions in lemmy about it to get the whole picture.

To be clear, our instance does not utilize any GenAI tools in moderation. Rimu is referring to a single manual action by one admin, using the same user access as any user on the fediverse. The action was likewise completely public.

jackemled@furry.engineer

@piefedadmin I would consider collecting everyone's posts & sending complete transcripts to a sketchy company, even if it's "totally for moderation purposes, we promise", to be malicious scraping behavior.

sirtao@social.sirtao.it

(sigh) so now I am wary to use #Fediverse at all now not knowing which of whatever I may have 'politically' said would be routed to ICE.

No offense but... malicious actors(or anybody with a grudge against you) were always been able to do that, as you are posting publicly(same as me).
Posting on public-facing social networks, including the #fediverse, always was talking loud in a public place.

I'm more worried\irritated by the LLM training scraping.

theriac@plasmatrap.com

@piefedadmin@join.piefed.social
I'd prefer to know which instances are involved. I am not ok with anything AI.

soy@ffuent.es

I remember Reddit automated moderation triggers lots of false positives, especially in other languages.

risc@wetdry.world

@piefedadmin

Once the user’s comments are sent to OpenAI, is it used to train their models?

i highly doubt it. from https://developers.openai.com/api/docs/guides/your-data: "As of March 1, 2023, data sent to the OpenAI API is not used to train or improve OpenAI models (unless you explicitly opt in to share data with us)."

opting in to sharing data would seem silly

lexyeen@plush.city

@piefedadmin >Today these instances are using it and maybe we’re ok with that because it’s being used by communities we agree with

I sure as fuck ain't okay with it. There is nothing excusable about feeding anyone's posts into The Plagiarism Engine That Lies.

felipeb@hachyderm.io

@piefedadmin
idk they search for the zionists, hey found them and ban them, I don't really see the problem here.
They aren't "feeding the comments to an llm" like some comments are saying

Piero Bosio Social Web Site Personale

AI-assisted moderation in the fediverse is happening.

AI-assisted moderation in the fediverse is happening. Now what?

AI-assisted moderation in the fediverse is happening. Now what?

AI-assisted moderation in the fediverse is happening. Now what?

AI-assisted moderation in the fediverse is happening. Now what?

AI-assisted moderation in the fediverse is happening. Now what?

AI-assisted moderation in the fediverse is happening. Now what?

AI-assisted moderation in the fediverse is happening. Now what?

Feed RSS