👏 Poison 👏 your 👏 data ☠️
-
@alice NULL is also a good answer for when you don't want to give out a particular personal detail.
Aside from phone, date of birth, and email, most of the time the front end form fields will accept NULL as an answer.
https://en.wikipedia.org/wiki/Null_(SQL)@aj@gts.sadauskas.id.au @alice@lgbtqia.space Mind you, a well designed application should not interpret a string saying null as a null value.
You probably won't pull a Bobby Tables off on Facebook. -
The goal is to make corporate data less profitable.
Even stuff as simple as setting your birthdate to 1970-01-01 everywhere, adding [TEST] or [DELETED] as your name or account notes anywhere you don't need them to know your name.
Using plugins like AdNauseam to poison ad trackers (and cost them marketing dollars).
Using VPNs set to different locations.
Signing into data broker sites to "correct" outdated info (they'll often let you do that with little-to-no proof of identity, but will require your passport or state ID in order to delete your info). Bonus points if you correct it to someone else's info on their site that's similar to yours.
Only fill in required fields when you sign up for anything, but only provide correct info if it matters for you to use the service, otherwise provide plausible, but incorrect, data.
If you use LLMs anywhere, use the free tier and always vote thumbs up for bad answers and down for good ones. It wastes their resources and drives up their costs while making their training data worse.
@alice given that the alternative approach is to complain to them that collecting my postcode violates GDPR as they don't need it, just to have them say they'll fix it then they don't
I think I'm going to keep entering ZZ9 2ZA for postcodes
-
The goal is to make corporate data less profitable.
Even stuff as simple as setting your birthdate to 1970-01-01 everywhere, adding [TEST] or [DELETED] as your name or account notes anywhere you don't need them to know your name.
Using plugins like AdNauseam to poison ad trackers (and cost them marketing dollars).
Using VPNs set to different locations.
Signing into data broker sites to "correct" outdated info (they'll often let you do that with little-to-no proof of identity, but will require your passport or state ID in order to delete your info). Bonus points if you correct it to someone else's info on their site that's similar to yours.
Only fill in required fields when you sign up for anything, but only provide correct info if it matters for you to use the service, otherwise provide plausible, but incorrect, data.
If you use LLMs anywhere, use the free tier and always vote thumbs up for bad answers and down for good ones. It wastes their resources and drives up their costs while making their training data worse.
@alice This sounds like a job for Little Bobby Tables.
-
👏 Poison 👏 your 👏 data ☠️
@alice my first name is
"' or 1=1" -
The goal is to make corporate data less profitable.
Even stuff as simple as setting your birthdate to 1970-01-01 everywhere, adding [TEST] or [DELETED] as your name or account notes anywhere you don't need them to know your name.
Using plugins like AdNauseam to poison ad trackers (and cost them marketing dollars).
Using VPNs set to different locations.
Signing into data broker sites to "correct" outdated info (they'll often let you do that with little-to-no proof of identity, but will require your passport or state ID in order to delete your info). Bonus points if you correct it to someone else's info on their site that's similar to yours.
Only fill in required fields when you sign up for anything, but only provide correct info if it matters for you to use the service, otherwise provide plausible, but incorrect, data.
If you use LLMs anywhere, use the free tier and always vote thumbs up for bad answers and down for good ones. It wastes their resources and drives up their costs while making their training data worse.
@alice I like to select wrong answers on captchas until I get bored.
-
@alice This sounds like a job for Little Bobby Tables.
-
-
The goal is to make corporate data less profitable.
Even stuff as simple as setting your birthdate to 1970-01-01 everywhere, adding [TEST] or [DELETED] as your name or account notes anywhere you don't need them to know your name.
Using plugins like AdNauseam to poison ad trackers (and cost them marketing dollars).
Using VPNs set to different locations.
Signing into data broker sites to "correct" outdated info (they'll often let you do that with little-to-no proof of identity, but will require your passport or state ID in order to delete your info). Bonus points if you correct it to someone else's info on their site that's similar to yours.
Only fill in required fields when you sign up for anything, but only provide correct info if it matters for you to use the service, otherwise provide plausible, but incorrect, data.
If you use LLMs anywhere, use the free tier and always vote thumbs up for bad answers and down for good ones. It wastes their resources and drives up their costs while making their training data worse.
@alice I thought everyone had a standard "birthdate" that they used when asked on the internet.
I was clearly just using the wrong one.
-
The goal is to make corporate data less profitable.
Even stuff as simple as setting your birthdate to 1970-01-01 everywhere, adding [TEST] or [DELETED] as your name or account notes anywhere you don't need them to know your name.
Using plugins like AdNauseam to poison ad trackers (and cost them marketing dollars).
Using VPNs set to different locations.
Signing into data broker sites to "correct" outdated info (they'll often let you do that with little-to-no proof of identity, but will require your passport or state ID in order to delete your info). Bonus points if you correct it to someone else's info on their site that's similar to yours.
Only fill in required fields when you sign up for anything, but only provide correct info if it matters for you to use the service, otherwise provide plausible, but incorrect, data.
If you use LLMs anywhere, use the free tier and always vote thumbs up for bad answers and down for good ones. It wastes their resources and drives up their costs while making their training data worse.
I have been so many John Smiths along with First Last.
-
@alice when i have to use a web app to order food, e.g. CoolBurgz (fictional) i will always put my email as e.g.
coolburgz@coolburgz.coolburgz
usually counts as valid.
-
@aj@gts.sadauskas.id.au @alice@lgbtqia.space Mind you, a well designed application should not interpret a string saying null as a null value.
You probably won't pull a Bobby Tables off on Facebook. -
Wrt #PII, It might be a good idea to avoid entering data easily identifiable as trash, and use generators instead. E.g.:
@penguinrebellion that's why I said plausible, but fake.
Generators are good though.
There are, however, reasons to enter something wildly off every so often, like "test@example.com", because it tells companies that field is obviously fake. This both makes the plausible fakes more likely to slip by if they do use your data, but also makes them more likely to discard your data for marketing and analytics purposes in general.
-
The goal is to make corporate data less profitable.
Even stuff as simple as setting your birthdate to 1970-01-01 everywhere, adding [TEST] or [DELETED] as your name or account notes anywhere you don't need them to know your name.
Using plugins like AdNauseam to poison ad trackers (and cost them marketing dollars).
Using VPNs set to different locations.
Signing into data broker sites to "correct" outdated info (they'll often let you do that with little-to-no proof of identity, but will require your passport or state ID in order to delete your info). Bonus points if you correct it to someone else's info on their site that's similar to yours.
Only fill in required fields when you sign up for anything, but only provide correct info if it matters for you to use the service, otherwise provide plausible, but incorrect, data.
If you use LLMs anywhere, use the free tier and always vote thumbs up for bad answers and down for good ones. It wastes their resources and drives up their costs while making their training data worse.
@alice Even better if we can find a way to sneak data into databases that, if dumped into JSON or CSV or what have you, would match common antivirus signatures.
-
Haha, you'd like my mother, the guerilla witch. She makes customer cards in every shop and switches them then with other people, bonus points if both have a strongly different consumer profile.
When she's bored, she responds maliciously questionnaires of evil corporations.
She studied psychology and statistics and says "it is anyway horribly difficult to get useful answers out of these marketing datasets, why not make it a bit harder for them?" 😈.
@earthworm TIL I have a second kid.
My education is in psychology and statistics, and I do shit like that whenever I can.
-
@alice got to show my ignorance here, but how do I find which brokers have my info?!
@apriloq that's easy! They all do!
But they're also very incestuous, so poisoning one will often cause a trickle down their pantleg to the next one who is thirsty for your data.
An easy way to start finding targets is to google your name, then explore the links you didn't sign up for.
Sites like:
- Spokeo
- BeenVerified
- Intelius
- TruthFinder
- MyLife
- Whitepages -
@djtoebeans @isol I use the nearby pizza place's number for my loyalty cards—as do a lot of other people.
-
@theorangetheme I'd love to make a bot that just hits all the big LLMs and corporate "help" bots, and simply shuffles their answers to each other randomly, then randomly up or downvotes the replies.
Distant memories of hooking together two ELIZA instances...
-
Hi, this is relevant to my interests. Is there a full set of instructions available for the data broker part of it or is that something I should just go look up?
Thanks for your efforts so far...
@resister check out the list of brokers here: https://github.com/optery/optery-data-brokers-directory
Removal is a game of whack-a-mole (as they regularly just reimport or purchase data from each other if there's a gap). That's why poisoning is more effective.
-
Thanks @alice and everybody else for the info shared!
@zavaj you're welcome!