Reddit is using AI to determine users beliefs, values, stances and more based on their activity (posts and comments) summarizing it to Subreddit Mods.

Pro@programming.dev · 4 months ago

Reddit is using AI to determine users beliefs, values, stances and more based on their activity (posts and comments) summarizing it to Subreddit Mods.

ViatorOmnium@piefed.social · edit-2 4 months ago

That’s probably a massive GDPR violation. Automated processing of extra sensitive data like political beliefs and religion is not outright forbidden but it’s subject to extra protections.

bridgeenjoyer@sh.itjust.works · 4 months ago

Yeah this seems illegal as fuck

Ex Nummis@lemmy.world · 4 months ago

I doubt it, since all it ostensibly does is summarize info the user has released freely. How that info is stored and retained exactly might be up for debate though.

NoneOfUrBusiness@fedia.io · 4 months ago

The title is likely inaccurate. The post only contains a summary of the user’s posting history. It makes no statements regarding the user’s beliefs.

basiclemmon98@lemmy.dbzer0.com · 4 months ago

Nah, I think all of it is literally just public data offered up by users themselves. If you didn’t want those opinions shared, you shouldn’t have posted them on Reddit.

ViatorOmnium@piefed.social · 4 months ago

GDPR also applies to data you get from public sources.

GamingChairModel@lemmy.world · 4 months ago

I don’t understand.

If someone writes a reddit post and says “I’m fasting for Ramadan,” can I not infer from that public post that the user is probably Muslim?

ViatorOmnium@piefed.social · edit-2 4 months ago

You cannot use an algorithm to correlate it with other data without express consent.

GamingChairModel@lemmy.world · 4 months ago

What counts as an algorithm? Surely it can’t be the actual definition of algorithm.

Because in most forum software (even the older stuff that predates reddit or social media) if I just click on a username, that fetches from the database every comment that the user has ever made, usually sorted in reverse chronological order. That technically fits the definition of an algorithm, and presents that user’s authored content in a manner that correlates the comments with the same user, regardless of where it originally appeared (in specific threads).

So if it generates a webpage that shows the person once made a comment in a cooking subreddit that says “I’m a Muslim and I love the halal version” next to a comment posted to a college admissions subreddit that says “I graduated from Harvard in 2019” next to a comment posted to a gardening subreddit that says “I live in Berlin,” does reddit violate the GDPR by assembling this information all in one place?

bridgeenjoyer@sh.itjust.works · 4 months ago

They will get your data from everywhere not just reddit. There needs to be many more laws and punishments for doing this.

ayyy@sh.itjust.works · 4 months ago

Have there been any enforcement actions against big companies yet?

ViatorOmnium@piefed.social · 4 months ago

Meta got a fine of over a billion euros. Google got a bunch of smaller fines, but it’s probably way above everyone else in terms of fines. Microsoft got half a billion. Even Apple got an 8 million euro fine, but that was more a tap in the wrist to make them think twice about some data collection.

And besides this, large companies are constantly in contact with the authorities and in smaller violations the general policy is to give a warning and let companies stop the illegal data processing voluntarily.

axEl7fB5@lemmy.cafe · 4 months ago

All of them are a slap on a wrist, even Meta

https://proton.me/tech-fines-tracker

ayyy@sh.itjust.works · 4 months ago

I’m so jealous.

friend_of_satan@lemmy.world · 4 months ago

yepowertrippinbastards++

sk1nnym1ke@piefed.social · edit-2 13 days ago

deleted by creator

chunes@lemmy.world · 4 months ago

That’s a useless summary that describes 99% of reddit and lemmy users

😈MedicPig🐷BabySaver😈@lemmy.world · 4 months ago

Fuck Reddit and Fuck Spez.

Pro@programming.dev · 4 months ago

What does “Fuck” mean in this use case?

I know that people generally use it to express dissatisfaction or disrespect, As intensifier generally.

But Why the Fuck do we have to bring sex into degradation?

What does this serve exactly?

Am I the only one here thinking about this

TeddE@lemmy.world · 4 months ago

Yeah; I just chalk it up to the fact that humans for thousands of years have been obsessed with sex and poop on a level more basic than language - it’s not as surprising then to think our language is riddled with allusions to it.

crapton_america@lemmy.world · 4 months ago

Fuck you

😈MedicPig🐷BabySaver😈@lemmy.world · 4 months ago

Fuck that.

NotASharkInAManSuit@lemmy.world · 4 months ago

Fuck off.

embMaster@lemmy.world · 4 months ago

fuck Fuck

unphazed@lemmy.world · 4 months ago

I always read into as a Vlad the Impaler situation. Sex is not involved whatsoever.

WizardofFrobozz@lemmy.ca · 4 months ago

Spock, is that you?

Echo Dot@feddit.uk · 4 months ago

Am I the only one here thinking about this

Probably.

Most insults are scatterological or sexual in nature. But you must understand that the word isn’t the thing itself. I don’t think anyone has sexual aspirations of Spez

Bennyboybumberchums@lemmy.world · edit-2 4 months ago

Am I the only one here thinking about this

Yup

ronigami@lemmy.world · 4 months ago

So Reddit is the scumbag steve.

cosmicrookie@lemmy.world · 4 months ago

Just dont use Reddit already

StinkyFingerItchyBum@lemmy.ca · 4 months ago

Is Lemmy somehow invulnerable to this? Can’t it just be scraped the same way.

octopus_ink@slrpnk.net · edit-2 4 months ago

100% it can, and worse yet, it can be scraped and analyzed like this by Meta, or Reddit, or your nearest fascist dictator.

But that’s also pretty much the entire internet in any forum or social media platform, so although I don’t like it, I figure that’s more or less just part of using the internet.

To me it’s much like the whole “There’s no expectation of privacy in public.”

I hope it doesn’t take AI to figure out my attitudes when I regularly proclaim that there are no maga in my life more than required by various pre-existing obligation, Luigi Mangione will be remembered as a folk hero, Trump will one day be shown to be ALL through the Epstein files in all the worst ways (as will many Dems and they should all be prosecuted to the fullest extent of the law), and most Democratic leadership is 100% owned by the oligarchy too, as evidenced by things such as their support of Israeli genocide and how much more urgently they are fighting Mamdani than they are Trump in many cases.

Michael@slrpnk.net · 4 months ago

“There’s no expectation of privacy in public.”

There’s no expectation of privacy anywhere. This has all been taken way too far.

WhatAmLemmy@lemmy.world · edit-2 4 months ago

Most importantly, by using Lemmy you aren’t enriching a techno-fascist platform. They can scrape lemmy the same way they used to scrape reddit, but at least my content does not benefit a greedy little pig boy fascist fucknut.

Ex Nummis@lemmy.world · 4 months ago

Lemmy isn’t monolithic. They’d have to hit every instance separately. The big ones like lemmy.world are a prime target though.

scintilla@crust.piefed.social · 4 months ago

They can even scrape likes/dislikes and make profiles for people that don’t comment.

hisao@ani.social · 4 months ago

It can, but it’s not built-in into the system and shown to every moderator in their UI.

jaybone@lemmy.zip · 4 months ago

Also your email and IP address is only available to the instance owner. So the only PII is what you share.

panda_abyss@lemmy.ca · 4 months ago

Nope, but at least I’m not supporting a company that actively does this.

JollyG@lemmy.world · 4 months ago

The screenshot shows an llm summary of a users posting history. Is that what you mean by “determine belief values stance and more” ? Is there more to this? How is that summary different from scrolling through someone’s posting history to see what they post about?

Passerby6497@lemmy.world · 4 months ago

How is that summary different from scrolling through someone’s posting history to see what they post about?

How is reading the Clif notes/summary different from reading the book? Time and effort taken, as well as a much shallower understanding of the material (assuming your summary is even relatively accurate).

It’s an easy way to get an instant opinion of someone so you can make a determination on whether you like it without having to tax your poor brain into actually thinking, and you can let something decide your opinion before you even know what you want to know. A summary provided by a product that is notoriously frequently wrong or lies and makes shit up out of whole cloth.

breakingcups@lemmy.world · 4 months ago

It’s made by a machine and can be biased by its prompt, training, and owners political beliefs (see Elon’s Grok).

JollyG@lemmy.world · 4 months ago

The post title makes it sound like Reddit is doing some sort of automated classification of user politics with some sort of ml technique. But the screenshot does not show that. It shows an llm summary of a users posting history . If the tool was run on a user that posted exclusively to a cat subreddit, the summary would have been about how the user likes cats. Despite the utility or accuracy of llm summaries, what the screenshot shows is far more anodyne than what this post’s title implies is happening.

fluxixx@piefed.social · 4 months ago

this is absolutely fucking terrifying

einlander@lemmy.world · 4 months ago

Thought/ Pre Crime.

TeddE@lemmy.world · 4 months ago

Crime by statistical association

NoodlePoint@lemmy.world · 4 months ago

Also known as profiling.

danc4498@lemmy.world · 4 months ago

What’s that utility that helps you delete your profile the right way?

Grostleton@lemmy.dbzer0.com · 4 months ago

Powerdeletesuite?

ansiz@lemmy.world · 4 months ago

I’m sure it’s as accurate and expensive as most A.I. slop.

FenderStratocaster@lemmy.world · 4 months ago

Good thing we’re not on reddit

Perspectivist@feddit.uk · 4 months ago

How exactly is it a good thing in this particular case? All this information is only more accessible on Lemmy.

ronigami@lemmy.world · 4 months ago

It’s easier to use a pseudonym on Lemmy.

FenderStratocaster@lemmy.world · edit-2 4 months ago

Lemmy instances aren’t summarizing their users. You can do it on ChatGPT, but Lemmy isn’t doing it.

Perspectivist@feddit.uk · 4 months ago

But what’s the good thing? Yeah Lemmy might not be doing it but I can do it, and Elon can, and Zuck, and Putin and your grandmom. Whatever you post on Lemmy is as public as it can get.

FenderStratocaster@lemmy.world · 4 months ago

The good thing is that we aren’t sucking reddit’s teat. If you are so worried, then why are you here? Why do you keep posting comments?

Perspectivist@feddit.uk · 4 months ago

I’m just trying to pick your brain and figure out what exactly is good about Lemmy in this case, but you seem reluctant to give me a cohesive answer.

FenderStratocaster@lemmy.world · 4 months ago

Lemmy doesn’t use AI to track their user base’s opinions and summarize it to mods. Is that clear enough? I’m not sure what you don’t understand.

xthexder@l.sw0.com · 4 months ago

I think their point is there’s nothing stopping Lemmy mods from using the exact same type of AI summarization tools. It may not be built-in, but many instances have their own addons for improving moderation

peaceful_world_view@lemmy.world · 4 months ago

No wonder I got banned.

r00ty@kbin.life · 4 months ago

Maybe they can check people like me that deleted on their reddit posts and comments… See if the AI can see all that “removed” content :P

octopus_ink@slrpnk.net · 4 months ago

I can tell you with near certainty there’s a backup somewhere of the pre-exodus database with all those deleted accounts and comments intact.

r00ty@kbin.life · 4 months ago

Oh I am sure there’s a backup. I guess I’m interested in whether they’re actually using it or not.

Dyskolos@lemmy.zip · 4 months ago

Guess that’d be another safe bet. They had be silly not to.

octopus_ink@slrpnk.net · edit-2 4 months ago

I know I’m in super-cynic mode this morning, but my guess is they finished training on all the data to that point already, so it’s probably not online anywhere currently, but also probably 100% a part of whatever training they are doing. Again, IMO only, and rampant speculation.

Though frankly, all that is worrying me a lot less now that I realize Doge took all the info that any identity thief could possibly want about every citizen in the US (and more), plus whatever classified info they have, and Putin is probably months into Russia’s analysis and training on every last bit of that data. (or there was a recent dead drop in Alaska 🤔 )