That’s probably a massive GDPR violation. Automated processing of extra sensitive data like political beliefs and religion is not outright forbidden but it’s subject to extra protections.
Yeah this seems illegal as fuck
I doubt it, since all it ostensibly does is summarize info the user has released freely. How that info is stored and retained exactly might be up for debate though.
The title is likely inaccurate. The post only contains a summary of the user’s posting history. It makes no statements regarding the user’s beliefs.
Nah, I think all of it is literally just public data offered up by users themselves. If you didn’t want those opinions shared, you shouldn’t have posted them on Reddit.
GDPR also applies to data you get from public sources.
I don’t understand.
If someone writes a reddit post and says “I’m fasting for Ramadan,” can I not infer from that public post that the user is probably Muslim?
You cannot use an algorithm to correlate it with other data without express consent.
What counts as an algorithm? Surely it can’t be the actual definition of algorithm.
Because in most forum software (even the older stuff that predates reddit or social media) if I just click on a username, that fetches from the database every comment that the user has ever made, usually sorted in reverse chronological order. That technically fits the definition of an algorithm, and presents that user’s authored content in a manner that correlates the comments with the same user, regardless of where it originally appeared (in specific threads).
So if it generates a webpage that shows the person once made a comment in a cooking subreddit that says “I’m a Muslim and I love the halal version” next to a comment posted to a college admissions subreddit that says “I graduated from Harvard in 2019” next to a comment posted to a gardening subreddit that says “I live in Berlin,” does reddit violate the GDPR by assembling this information all in one place?
They will get your data from everywhere not just reddit. There needs to be many more laws and punishments for doing this.
Have there been any enforcement actions against big companies yet?
Meta got a fine of over a billion euros. Google got a bunch of smaller fines, but it’s probably way above everyone else in terms of fines. Microsoft got half a billion. Even Apple got an 8 million euro fine, but that was more a tap in the wrist to make them think twice about some data collection.
And besides this, large companies are constantly in contact with the authorities and in smaller violations the general policy is to give a warning and let companies stop the illegal data processing voluntarily.
All of them are a slap on a wrist, even Meta
I’m so jealous.
yepowertrippinbastards++
deleted by creator
That’s a useless summary that describes 99% of reddit and lemmy users
Fuck Reddit and Fuck Spez.
What does “Fuck” mean in this use case?
I know that people generally use it to express dissatisfaction or disrespect, As intensifier generally.
But Why the Fuck do we have to bring sex into degradation?
What does this serve exactly?
Am I the only one here thinking about this
Yeah; I just chalk it up to the fact that humans for thousands of years have been obsessed with sex and poop on a level more basic than language - it’s not as surprising then to think our language is riddled with allusions to it.
Fuck you
Fuck that.
Fuck off.
fuck Fuck
I always read into as a Vlad the Impaler situation. Sex is not involved whatsoever.
Spock, is that you?
Am I the only one here thinking about this
Probably.
Most insults are scatterological or sexual in nature. But you must understand that the word isn’t the thing itself. I don’t think anyone has sexual aspirations of Spez
Am I the only one here thinking about this
Yup
So Reddit is the scumbag steve.
Just dont use Reddit already
Is Lemmy somehow invulnerable to this? Can’t it just be scraped the same way.
100% it can, and worse yet, it can be scraped and analyzed like this by Meta, or Reddit, or your nearest fascist dictator.
But that’s also pretty much the entire internet in any forum or social media platform, so although I don’t like it, I figure that’s more or less just part of using the internet.
To me it’s much like the whole “There’s no expectation of privacy in public.”
I hope it doesn’t take AI to figure out my attitudes when I regularly proclaim that there are no maga in my life more than required by various pre-existing obligation, Luigi Mangione will be remembered as a folk hero, Trump will one day be shown to be ALL through the Epstein files in all the worst ways (as will many Dems and they should all be prosecuted to the fullest extent of the law), and most Democratic leadership is 100% owned by the oligarchy too, as evidenced by things such as their support of Israeli genocide and how much more urgently they are fighting Mamdani than they are Trump in many cases.
“There’s no expectation of privacy in public.”
There’s no expectation of privacy anywhere. This has all been taken way too far.
Most importantly, by using Lemmy you aren’t enriching a techno-fascist platform. They can scrape lemmy the same way they used to scrape reddit, but at least my content does not benefit a greedy little pig boy fascist fucknut.
Lemmy isn’t monolithic. They’d have to hit every instance separately. The big ones like lemmy.world are a prime target though.
They can even scrape likes/dislikes and make profiles for people that don’t comment.
It can, but it’s not built-in into the system and shown to every moderator in their UI.
Also your email and IP address is only available to the instance owner. So the only PII is what you share.
Nope, but at least I’m not supporting a company that actively does this.
The screenshot shows an llm summary of a users posting history. Is that what you mean by “determine belief values stance and more” ? Is there more to this? How is that summary different from scrolling through someone’s posting history to see what they post about?
How is that summary different from scrolling through someone’s posting history to see what they post about?
How is reading the Clif notes/summary different from reading the book? Time and effort taken, as well as a much shallower understanding of the material (assuming your summary is even relatively accurate).
It’s an easy way to get an instant opinion of someone so you can make a determination on whether you like it without having to tax your poor brain into actually thinking, and you can let something decide your opinion before you even know what you want to know. A summary provided by a product that is notoriously frequently wrong or lies and makes shit up out of whole cloth.
It’s made by a machine and can be biased by its prompt, training, and owners political beliefs (see Elon’s Grok).
The post title makes it sound like Reddit is doing some sort of automated classification of user politics with some sort of ml technique. But the screenshot does not show that. It shows an llm summary of a users posting history . If the tool was run on a user that posted exclusively to a cat subreddit, the summary would have been about how the user likes cats. Despite the utility or accuracy of llm summaries, what the screenshot shows is far more anodyne than what this post’s title implies is happening.
this is absolutely fucking terrifying
Thought/ Pre Crime.
Crime by statistical association
Also known as profiling.
What’s that utility that helps you delete your profile the right way?
Powerdeletesuite?
I’m sure it’s as accurate and expensive as most A.I. slop.
Good thing we’re not on reddit
How exactly is it a good thing in this particular case? All this information is only more accessible on Lemmy.
It’s easier to use a pseudonym on Lemmy.
Lemmy instances aren’t summarizing their users. You can do it on ChatGPT, but Lemmy isn’t doing it.
But what’s the good thing? Yeah Lemmy might not be doing it but I can do it, and Elon can, and Zuck, and Putin and your grandmom. Whatever you post on Lemmy is as public as it can get.
The good thing is that we aren’t sucking reddit’s teat. If you are so worried, then why are you here? Why do you keep posting comments?
I’m just trying to pick your brain and figure out what exactly is good about Lemmy in this case, but you seem reluctant to give me a cohesive answer.
Lemmy doesn’t use AI to track their user base’s opinions and summarize it to mods. Is that clear enough? I’m not sure what you don’t understand.
I think their point is there’s nothing stopping Lemmy mods from using the exact same type of AI summarization tools. It may not be built-in, but many instances have their own addons for improving moderation
No wonder I got banned.
Maybe they can check people like me that deleted on their reddit posts and comments… See if the AI can see all that “removed” content :P
I can tell you with near certainty there’s a backup somewhere of the pre-exodus database with all those deleted accounts and comments intact.
Oh I am sure there’s a backup. I guess I’m interested in whether they’re actually using it or not.
Guess that’d be another safe bet. They had be silly not to.
I know I’m in super-cynic mode this morning, but my guess is they finished training on all the data to that point already, so it’s probably not online anywhere currently, but also probably 100% a part of whatever training they are doing. Again, IMO only, and rampant speculation.
Though frankly, all that is worrying me a lot less now that I realize Doge took all the info that any identity thief could possibly want about every citizen in the US (and more), plus whatever classified info they have, and Putin is probably months into Russia’s analysis and training on every last bit of that data. (or there was a recent dead drop in Alaska 🤔 )













