Hacker News new | past | comments | ask | show | jobs | submit login

I think the issue right now is that private user information is viewed as an asset, not a liability. If we could find a way to make it more of a liability, companies would be less likely to collect it just for the sake of having it, and they would be more proactive in securing it.



Alternatively, if it's truly an asset, can it be taxed as an asset? If I give a company a car, that is taxed. If I give a company my data which is worth more than a car, it isn't.

Is it possible that current accounting/tax law can be interpreted so that these are viewed similarly?


Using the black market as a standard, your identity-related information isn't worth enough to be taxable.[0][1][2]

The more common data you give away is worth even less. Your "gift" is akin to giving away a few grains of sand to a glassmaker who provides a free grain counting service.

Now let's say you dumped a lot sand that we could value at $10K. Any smart sand-counting glassmaker will claim his once "free" sand counting costs $10K, which amounts to an equal, zero-profit trade.

[0]: http://www.bankrate.com/finance/credit/what-your-identity-is...

[1]: https://qz.com/460482/heres-what-your-stolen-identity-goes-f...

[2]: http://www.businessinsider.com/heres-how-much-your-personal-...


Value is derived from user data when its used to target ads. Black market data is never used for that purpose, so its value is much lower. (A company would never take the risk of using black market data)


You (and every other responder) miss larger the point of my comment. Let's use Google as an example. Your clicks throughout the internet, like sand, don't amount to much of value. It's a very unrefined, raw material, with limited quantity. Even if Google were forced to value that raw material, they can argue they're trading it in equal exchange for whatever service they offer you, so there would still be no tax.

In any situation, potentially derived value is not taxable. A car is worth whatever a car was bought/sold for, not including some hypothetical such as whatever I could make by driving it for Uber/Lyft. What matter's here is what will actually occur in the transaction. If Equifax chooses to sell its data, that income will be taxed at whatever price Equifax chooses to sell the data.

Note this doesn't change that your "gift" of peanuts of data is not taxable because (a) your data alone isn't worth squat, (b) even if it was, you got something in exchange for it.


I'll go one step further in saying the discussion framed around clicks being interpreted as a product is incorrect altogether. I think user metadata is part of a users identity and the friction we run up against is whether it ought to be legally protected. It's currently not illegal to sit outside a restaurant and records information about all of its patrons. You'd certainly be in hot water if you tried to do that at any federal building. At some level we know collecting that data is wrong because it can be used against us. Even the judicial branch knows this and requires the storage of user data to be encrypted by security agencies. That's not conclusive proof but evidence of our general outlook on the legality of tracking people.

If we agree in a truly free society then collecting and monetizing metadata should be illegal. If we don't mind giving up that freedom then there's nothing wrong with companies creating a profile on you and tracking you no matter where you go and what you do. But the internet has spoken and we're gladly, albeit unknowingly, giving up any right of protection. I find it worrisome to think of what society will be like in another 50 years if nothing is done to curtail the fleecing of user data.


Aren't you describing the market research industry? Intent is the value, predicting intent is the goal.


Like data in general, user data is essentially worthless until aggregated en masse and refined into insights. But when you consider how much data companies are hoarding, it doesn't take much of an assessed value to create a nontrivial taxable asset.

Also, it doesn't matter that there's an exchange happening. Sales tax and income tax are assessed on fair exchanges of goods, services, and currency.


Sales tax isn't paid on trades. When you trade in a car for $2k off a $10k car, you only pay taxes on the difference: $8k.

And if you grow oranges on your property that you never sell, you never pay taxes on those oranges.


Not in California.

If you accept a trade-in on the sale of a vehicle, the allowance for the trade-in cannot be excluded from the amount on which tax is based.

For example, if you sell a car for $20,000 and accept a trade-in valued at $4,000 as partial payment, tax is based on the $20,000 selling price.

Source: http://www.boe.ca.gov/pdf/pub34.pdf


I shouldn't have used vehicles since they are a special case in some states. At the federal level, taxes on barter income match my description. [0] Taxes are still zero if the company can claim the fair market value of the services they offer you equal the value of your clicks.

[0]: https://www.irs.gov/taxtopics/tc400/tc420


Never heard of this, probably because in my state it doesn't apply. In 42 out of the 50 states (excluding California, Hawaii, Kentucky, Maryland, Michigan, Montana and Virginia [Oregon has no sales Tax]) however, this does apply.

Still though, the example is a good one. You're trading your data for a service. There isn't anything to tax.


It isn't clear where the data comes from. A business in a non-US friendly jurisdiction can make an online business selling black market data in a way that looks legal to data warehouse groups who in turn sell it to the company using it. Once you get to the actual company using the data, there is no indication of it being black market data. The middle company, if it is buying enough data, only takes a few employees who care more about getting the data than verifying the ethics.

It's kind of like arguing a bank would never create fake accounts because the risk of doing so is too large.


Value is derived from the potential application of data. Ads as an application isn't worth much since you can still be shown ads just fine without any personal data targeting.

Black market data is worth way more because it's often more personal than just demographic markers and interests, and can potentially lead to large sums of money.


Personal information is less valuable on the black market due to the difficulty in monetising and extracting the cash. If I have your bank account login details I can move cash out of your bank, but almost no hope of sending cash from a U.K. Domestic savings account to my friendly philapines bank over the web UI. That's why Nigerian Princes still send out emails - the find the one idiot willing to walk into his bank and move the 5million that just arrived in his account

Alternatively you need real criminal gangs - dozens of people willing to walk up and down a London street withdrawing 5k at a time from a 1000 Pre-prepared cards and put the money in their rucksacks. They don't come cheap. And its still cash and still in the U.K.

Get Amazon to send you two dozen laptops to the same address with two dozen cards. All as "gifts". Yeah right. Now you gotta sell them - fences run at 10% if you are lucky.

The lowest effort are simple impersonation for loans, but still you have to take the money and move it somewhere. Into cash? Into the phillipines? See above problems. Open a credit card account? How to intercept it and the PIN number sent by post?

All in all, it's actually pretty darn hard to take personal details and monetise at the "real money" level. These things stop being scalable. You could probably fund a student lifestyle off any combination of the above but millions - not really.

Cf interesting Microsoft paper on this a few years back


I guess we have nothing to worry about with all these data breaches then, right? Might as well tell equifax that it’s no big deal too.

These criminals trade data because it makes them money, otherwise there wouldn’t be much of a makrket.


You're correct, of course, but missing the implications raised by the parent poster, and they are important.

Sensitive personal data is necessary but not sufficient to rip someone off. And if you want to try to make a living ripping people off, there is even more business overhead, making the cost of sensitive personal data an even smaller portion of overall operational costs.

From the point of view of the thief, our personal data is a vital but cheap input into an operation that tends to have very high security costs, viciously expensive liquidity issues and terrible personnel problems, among other more quotidian business headaches.[1]

I suggest trying to think like a crook now and then. Trying to try on other people's lives is a useful way of shaking up one's thinking habits, empathy (don't confuse with sympathy) is always useful, and it can help you keep yourself more secure.

[1] I am leaving out things like several potential fates far worse than bankruptcy and related issues because they aren't opex-related, but they probably do effect retirement planning.


The black market value is like 10-$15 dollars for a basic credit card number I believe. But gold cards and plat cards can be several times more. But drivers ID and ss# that's worth way more for what I believe are obvious reasons. My prices might be a little off, haven't checked in ages. You get the idea. Say 130 million cards at $10 each. Isn't that almost one and a half billion. Not chump change. Ad


Well no that's incorrect. A targeted ad is worth significantly more than one without targeting. I buy ads at a $0.25 cpm and a $40 cpm, the only difference is targeting data.


It still holds the data alone isn't worth much. If you've built an ad platform with customers, reach, and the ability to target people given data, then sure, you can convert that raw material into something more valuable. And once you sell the derived product (ads), you'll be taxed on your income.

I feel like you're arguing that dirt is worth as much as the farm that one could build with it.


You can still show the ads and there are a lot of other signals and context to use. Also other than Facebook or google with strong identity, 3rd party data on the open web is next to useless. If you’re paying $40cpm for data, you’re getting ripped off.


Sorry to be that guy, but: I spend over $5m a year on rtb ads. I literally spend 50 hours a week doing this. If the money I spend doesn't produce verifiable results, I lose it.

For example, that 40cpm is to reach a pool of <1000 users who are in charge of purchasing for networks of hospitals, and my ads are for MRI machines. 3rd party data is unbelievably valuable, probably $1.5 million of my budget goes to data costs alone.


That doesn't add up. I've managed budgets a magnitude higher, know the founders of every major SSP and DMP, and now specialize in B2B marketing for F1000 companies with long sales cycles. If you're really trying to reach a pool that small, open web advertising is incredibly inefficient.

This is well understood by the adtech community and even the flashy new "ABM" companies will tell you the same. 3rd party data is universally terrible. At best, it'll work at scale (of millions) on general demographic details but will definitely not recognize 1000 people on the open web.

That kind of list might work on Linkedin or Facebook with email targeting but it would be easier to focus on niche trade sites without any data, or just use a direct sales team. That $1.5M in data you're paying for would have much better ROI with a good VP of sales.


They're swayed by ads? I'm surprised they don't need to do a formal RFP among the handful of companies who can make an MRI certified for medical use.


The targeted viewer may see the ad and start thinking "we need a better modern MRI with whatever fancy feature I read about here" or "we could hire a new MRI from xyz cheaper than our current contract!".

Sometimes you can prod your potential customers into action.


Who do you send the RFP to?


That anyone would be influenced in what MR scanner they buy due to Facebook advert is amazing. It would also explain things I hear from radiographers overseas. Wow. Are you able to state which vendors buy Facebook adverts (I assume you can’t)?


Might be better off spending £5k+ on personal gifts for each decision maker than bothering with Web advertising if it's that few people you're targeting :-)


I may not be a lawyer, but gifts of $5k to induce someone to purchase a thing for their workplace feels like it should count as corruption and bribery.

If someone tried to do that to me, I’d report the attempt to the company lawyer, and I’d doubt the quality of the thing they were selling was as good as the quality of the thing the other poster was advertising.


Welcome to the advertising industry. You'd be surprised what goes on when media buyers control so much money. There are 20 year old planners with control of 7+ figure budgets for major brands - you can bet they're getting plenty of gifts.


5k+ per person would be bribery. I hope he meant 5k in total..., 1000 gifts of 5$ would be ok


I'm sure he meant what he said. Do the math. He was trying to point out that it would cost the same either way. But if you give the mark the money, it would have a much greater effect than 5k on silly web adds.


what about a $5k rebate or discount?


Difficult to say. My mental framing has, as a result of my previous post, become “be suspicious”.


[flagged]


That honestly sounds borderline mafia or cartel style... "We're going to bribe you, you're going to take it, and since we all know it's illegal, you're on the hook with us all. Welcome to the game kid."


If I’m ever in a position to be worth anyone trying to bribe me, I’ll try to make sure nobody is in a position to threaten me like that.

Fortunately for me, I don’t control any budgets.


In the old days, you would do that.

Now that’s not allowed in most places, plus you need to hire fancy salespeople to deliver those gifts.


A company would never take the risk of using black market data

I bet LinkedIn does.


"None of us is as [valuable] as all of us," is a saying that has been around for decades, surely there are business rules that have cropped up to support a valuation of this scenario in the meantime.


I'm not saying anything different. Whenever a business/individual sells information at a value, they'll be taxed on the sale, just like anything else. OP mentioned selling a cars worth of his own data, which doesn't exist.


It is when you have 3B records


If one company sells anything to another company, that sale is taxed as income like any other sale. So yes, if Yahoo sold 3B records, that would be taxed, and the recipient company can choose to book that purchase as an asset on their balance sheet and will likely expense the purchase. Hell they could even choose to depreciate the value too for as long as they follow GAAP.

OP mentioned his data alone, which isn't worth squat unless the transaction says otherwise. Meaning, if OP sold his data to a company for a taxable amount, he would be taxed on that income.


I think you have it backwards...

Wouldn't black market identities be worth MORE if they weren't so easy to get?

So the more we tax / regulate it, the harder it is, the more valuable they get. Win-win for everyone.


> Wouldn't black market identities be worth MORE if they weren't so easy to get?

No. It's already illegal to buy and sell identities, so black market demand for identities is likely at a maximum already. I'm just using that number as a proxy for what your clicks on the internet must be worth. I'm basically making the assumption that Value(Clicks) < Value(Black Market Identity), which I'd say is a fair bet.

> So the more we tax / regulate it, the harder it is, the more valuable they get. Win-win for everyone.

Again, this wouldn't be true for black market identities, but let's look at clicks.

What I'm saying is that the click you give a way is worth too little to be taxed at all.

Say 1M click data points is worth $1K (which I think is still very generous given the amount of noise) that means each click is worth 1/10¢. Any company that sells the 1M clicks to another company will pay taxes on the $1K of income. So if you increased taxes, you would discourage them from selling your data to another company. This doesn't change your behavior though as a consumer. You still give away an untaxable 1 click at a time (1/10¢): which is not taxable as a gift because of the size of the amount (even 1K clicks is only $1) and because the company can easily argue they provided you with a service in exchange for that 1/10¢.

What you're looking for is a way to penalize companies for receiving data (i.e. for every data point you gain, you owe some $x in taxes.) This would need to be legislated since that's not currently how tax law works.


> Is it possible that current accounting/tax law can be interpreted so that these are viewed similarly?

Yes, that actually happens to be the status quo.

A collection of data is an intellectual property asset just like a patent, or movie rights, or your brand.

If you buy a database, you will, depending on the costs, have to deprecate it over its useful lifetime. That means your tax burden in the first year will be higher than if you blew the money on the company Christmas party. That's the same as if you bought a software license, or Coca Cola Co.

If you collect the data yourself, that mechanism doesn't kick in. The reason is that it's difficult to value intellectual properties' value unless they're traded, and it would allow for too much manipulation of a company's profits.

Now these assets aren't taxed on an ongoing basis in the way you imply. That's because no assets are, except real estate in some jurisdictions.


Does that mean that your ID number (social security for U.S. readers) is taxable upon receipt (birth or immigration)? Also we will need a birthday tax as your age (a key demographic data point) changes then. A marriage tax, moving (address change) tax, employment change tax etc. Tax law will have a concept of taxable data event much like a liquidity event.

Obviously this is a silly thought exercise but it is fun to think about.


Careful there tiger. You're starting to sound like a Sovereign man with your corporate federal account.


I do not know what any of that means. Googling it led to a bunch of conspiracy sites and equally incomprehensible shady semi-legal advice and advocacy sites


late2part was saying that your chain of thought mirrors the conspiracy sites that you mention.


The problem is you aren't "giving" a company anything. The company is observing how you interact with their products.

This is like saying by walking into a store you are "giving" the company your image on their security camera. It would take a very odd definition of "gift" to make that claim.


Sure there is some data companies are collecting of that form but typically it isn't Personally Identifiable Information and even when it is that isn't what people are worried about with the Yahoo breach.

Yahoo was "gifted" data. People explicitly gave them names, email addresses and passwords. That is what Yahoo failed to protect.

> The stolen information included names, email addresses, phone numbers, birthdates and security questions and answers.


The asset concept is interesting. If you introduce taxes into the mix then you will also need to value your asset. If you sell your asset then you need to record a fair value price for which you bought it and when... that way you can record a short or long term capital gain. The problem with digital assets is that you can easily copy them. So what does it mean to sell an asset which you actually still own/have a copy of. Its a bit tough to conceptualize - but I think there's something there... requires a bit more brainstorming.


> actually still own/have a copy of

Record companies forced plenty of DRM related BS down our throats to drag us into a "license not own" rental model. I suggest we return the favor.


> So what does it mean to sell an asset which you actually still own/have a copy of

Yet we manage to sell IP (or just some intangible right to use it)


That’s a fascinating premise. Revenues generated from the use of ones data is taxed like anything else (unless routed through Ireland ;-) ) but I don’t think assets are taxed at rest. I could be wrong.


Indirectly, they are. Governments don't let you blow all your profits on assets that are as-good-as-cash, and then claim you didn't make any taxable profits.

So if you make $X in profit and then use it to buy a tractor, then (from the government's perspective), you've just swapped $X for an asset worth $X. No change in book value, no reduction in profit, no reduction in tax liability.

You are, however, allowed to treat the tractor as an expense that's distributed over several years of its useful life, which is called "depreciating" it.

So yes, to the extent that your cash is exchanged for assets, that counts as a higher book value and higher tax liability (than if it were a pure expense). I don't know if you'd have to treat a "data purchase" more like a tractor or more like buying electricity (a pure expense) though.

My previous, longer comment on the constrains of the tax code and how it results in needing the concept of depreciation: https://news.ycombinator.com/item?id=15060604#15061439


> Indirectly, they are. Governments don't let you blow all your profits on assets that are as-good-as-cash, and then claim you didn't make any taxable profits.

Similar experience here:

In an earlier career my company reinvested all profits back into growth, only to learn that the taxman didn't care about such silly things. The IRS demanded the tax from the profits that had been reinvested and were no longer available.

Plus they wanted the tax from the profits of the growth that had only happened from reinvesting the earlier profits that they wanted tax from. Their demands were in excess of the actual realized profit that had been made by the company.


That sounds like a bug. File a report and maybe a pull request. If only it were that easy.


Stunning... How badly do you really want to be a tax-serf?


I reached this same conclusion from a very different angle. If you're seriously worried about the unchecked power of monopolies, and understand the effects of Metcalfe's law, we should measure the degree of monopolization of tech companies differently than we do traditional industries. Businesses are locked in to Facebook and Google the same way that businesses were locked in to doing business with Standard Oil in the gilded age. The impossibility for regular users to leave the network makes competition de facto impossible, even if the company does not actively engage in anti-competitive behavior (in many cases, they do anyway).

A tax on social software companies proportional to their network size would be an interesting proposal to solve both of these issues. It would also greatly increase the ability for 100,000 - 1,000,000 person "decentralized" social networks (like Mastodon or other competing networks) to thrive.


This is how the civil legal system is supposed to function. There needs to be some very large class action lawsuits brought against these companies, and huge awards need to be extracted in order increase the financial risk of having shitty infosec.


For PCI Compliance purposes, at least, holding user credit card information is already seen as a liability because maintaining compliance is a cost center. That's why there's been some shift towards tokenizing transactions on the fly and directly submitting to the CC company via javascript so that shopping websites never see your CC number even when you enter it on their website - even if you're scheduling future payments.

Maybe we should include other data elements under PCI or similar regulations. SSN?

I imagine HIPAA has similar requirements and associated costs.


Even just a fragment of someone's personal information can be combined with other data points in order to commit identity theft.

Other impacts may not have direct monetary damages, but could be even more devastating (ie Ashley Madison).

Until the courts start clamping down on negligent handling of personal data, firms will continue to cheap out on infosec.


User accounts? Really? This is Yahoo we’re talking about. You really do need user accounts to run an email service


> You really do need user accounts to run an email service

Exactly, regardless of that companies keep asking users for a whole collection of personal data, not always making it obvious which fields are actually required because it's good business for them to get as much personal data as possible.

Average users are usually unsure about a lot of this stuff and naive enough to enter their real data for fear of getting caught "lying".

This happens because companies see this data as an asset instead of a liability, from the companies view not asking for that data/tricking users into giving it away means missing out on assets.

But if you instead make the personal data a liability, by enforcing standards for keeping/sharing it with hefty fines, then fewer companies will go out of their way asking users for personal information they have no business asking for in the first place because it would put them in a position of liability for what happens with said data.


>> User accounts? Really? This is Yahoo we’re talking about. You really do need user accounts to run an email service

> Exactly, regardless of that companies keep asking users for a whole collection of personal data, not always making it obvious which fields are actually required

You literally don't need any user information to run an email service. You only need a means to identify them which could just amount to giving them a long, randomly generated password. Even the username is only necessary for the purpose of being able to identify them as a recipient, not for login itself.


> You literally don't need any user information to run an email service.

I know that and you know that the average user does NOT know that and is too good-natured to enter fake information.

There are plenty of email services out there, among them many of largest and most established ones, where the real name is a required field during registration.

Sure you can always argue "Well just enter fake details" but that's missing the point. The point being that once personal information becomes a liability, instead of something you can just haphazardly hoard as an asset, companies would be much more careful about what kind of information they are asking from the users in the very first place.

Companies abuse the goodwill of the average users by asking for more information than they should because it comes at no cost to them while at the same time being a very big asset. Even if they fail to secure these assets and a breach happens, most of the costs of that are externalized onto the users whose data actually got leaked, the consequences for the company are often only cosmetical, some bad PR/stock prices take a little downturn. But the brunt of that will be over after a couple of weeks and after that, it's back to business as usual.

That needs to change, companies need to be held liable for:

A) Needlessly asking for and hoarding personal information B) Sloppy treatment of information resulting in a leak

Yes, this could very well be opening Pandora's box, but something about the current state of things really needs to change.


Wouldn't the emails themselves count as user information?


No. User information here means information denoting a user not information from a user.


That's dangerous schematic games along the same lines of "Metadata is harmless and can't identify anybody".

Emails can sometimes contain very detailed and very denoting user information. Trying to differentiate between users "personal information" and users "personal content" is imho a rather dangerous thing to do because who decides where to draw the lines between the two?

As a user, I expect my data, regardless of which data, to stay private unless I explicitly intent to publish it to the public or somebody else. I most certainly do not expect some employees reading through my private emails for their lunch-break entertainment.


> That's dangerous schematic games along the same lines of "Metadata is harmless and can't identify anybody".

That's a complete strawman argument that has nothing to do with what I wrote. The distinction is correct and factual in this exact situation. You are attempting to redefine terms for apparently no reason other than to argue.

Whether emails contain detailed information or not is irrelevant to the term "user information" in this context, meaning information about a user. The discussion is about whether an email service requires personal information to operate.

> As a user, I expect my data, regardless of which data, to stay private unless I explicitly intent to publish it to the public or somebody else. I most certainly do not expect some employees reading through my private emails for their lunch-break entertainment.

In the real-world, you either need to change your expectations or encrypt your data.


Sure, but you don't need first name, last name, phone number, birth date or gender. All of which are asked on the signup and of which only Gender is specified as optional: https://login.yahoo.com/account/create

On my small business we ask only for an email address, password and confirm password. Everything else is excessive.

Tax obligations can be another problem which may require an address, but often have a simpler way to resolve them by simply picking the appropriate country and state off a list or even with just a checkbox for "are you in X jurisdiction which I am required to tax?". I believe Tarsnap handles it that way.


Tax obligations can be another problem which may require an address, but often have a simpler way to resolve them by simply picking the appropriate country and state off a list or even with just a checkbox for "are you in X jurisdiction which I am required to tax?". I believe Tarsnap handles it that way.

Tarsnap has a "are you Canadian" checkbox. Unfortunately if you are Canadian I have to collect your name and address because I have to provide[0] invoices/receipts which contain this information.

Mind you, there's no requirement that you give me truthful information. If you claim to be John Smith living at 123 Main Street, you'll get an invoice which says that at the top of it. You won't be able to use it to claim a tax rebate; but if you're not running a business it's not useful for that purpose anyway.

[0] IIRC I technically don't have to provide those such invoices to everybody; merely to anyone who asks for one. But collecting the information up front and emailing PDFs to all the Canadians is much easier than handling individual requests later.


You need to collect date of birth for COPPA compliance


Where in COPPA does it say the DOB needs to be collected? I don't think that statement is true as you word it.


My memory of implementing COPPA compliance a decade ago was that DOB was an implicit requirement, the explicit requirement being “confirm they’re over 13; a checkbox isn’t good enough because they’ll clearly just lie.” (paraphrased, not quoted).


Thank you for explaining that for me.

That's what I had thought. However, users will lie about a DOB if they will lie about an age.


Does that mean you have to retain it beyond the initial check?


No idea. UK games company, so UK data protection act also applied, which I think actually forbids retaining any more than strictly necessary.


First and last name at least needed for meet the email protocol. Emails shouldn't be addressed to handles/nicknames


"username <username@yahoo.com>" is perfectly valid in SMTP last I checked.

If it's a user experience thing, fine, but at least make it an optional field.


How would you handle all the people with identical names? Even if you used DoB, there is a chance for a duplicate.


The irony of that comment being made from an account with a username that doesn't clearly identify a person..


Deliberate sarcasm?


How much do you actually need to know about someone to serve up email to them?


For a consumer mail service, you to need to know enough to let them recover their account, possibly with decades of un-backed-up correspondence with and photos of since-deceased friends and relatives, when they’ve forgotten their password, and without letting someone else recover their account. This is a hard problem.

(I’m expecting some idealized “solutions” from people with idealized beliefs about mass market tech skills.)


Why don't we just <incredibly naive solution that does nothing to help, but somehow also involves the blockchain>?


This is such a nasty area, with such crappy solutions (name of your first pet), that it should become a service all of its own.

i.e. when I sign up with you, I can choose my own preferred vendor to handle identity recovery.


You also need a process for resolving ownership disputes. Facebook takes the tactic of having the person claiming ownership upload government-issued ID, which seems like it would be the only foolproof way to do so, yet they're constantly maligned for it.


Most sites solve this problem with a combination of 2fA and at least one security question.

Having my phone number isn't even good 2fA.


You almost certainly need a password hash, and that’s (as I understand it) the main privacy worry related to the Yahoo breach.


A free email service.


My personal data is an asset. And it belongs to me.

Anyone who has my data for any purpose owes me my cut.

Making this a property rights issue solves all the privacy & identity issues.


> My personal data is an asset. And it belongs to me.

> Anyone who has my data for any purpose owes me my cut.

It's well established in the US that you do not in fact own your data. You don't own your school records or employment records. You don't own your medical records or your credit records. In general the best you have is a right to view those records and that's only in certain cases.


Because Freedom Markets™?


Pretty sure it's the same across the world, really. Try getting your medical records expunged in Cuba or your search records expunged in China. I bet you have the same success as in the US.


Maybe you should look at free people in mostly free countries, instead of dictatorships.

For instance, the EU and Switzerland.


Have you successfully gotten Google or Facebook or your doctor to purge your records in Switzerland?


Are you suggesting some corporations have become more powerful than our lawfully elected governments?


I’m suggesting that “data about you” is not the same as “your data” and never has been.


I honestly don't know how to distinguish between "me" and "about me".

These discussions always go full meta. Makes my head hurt.

Being a simple bear, I try to distill these paradoxes (freewill, love, death, what is art) down to something actionable. Hence my conclusion, after much thought and effort (eg securing medical records), that "I am my data, my data is me." and therefore I own it.

If privacy is the ability to control what is publicly known about yourself, the best (practical, prescriptive) way I can think to do that is via property rights.

---

I appreciate your reply. I'm going to revisit my beliefs, conclusions. Starting with the currently generally accepted definitions.

https://en.wikipedia.org/wiki/Privacy

https://en.wikipedia.org/wiki/Personal_identity

https://en.wikipedia.org/wiki/Authentication


> I honestly don't know how to distinguish between "me" and "about me".

> These discussions always go full meta. Makes my head hurt.

You go to the store to buy a carton of eggs. The store now has data about you and your purchase. If you pay with credit card, they have a record tied to your identity. If you pay with cash, they still have a record of what you bought with your eggs, and nothing stops them from scribbling your name on the copy of the receipt they keep.

You have no right to demand that the store cease possession of this data. They might use this data (in aggregate) to determine when they need to restock eggs. They might use this data (along with other purchase records) to determine that butter should be stocked next to the eggs. They might discard this data as soon as books are reconciled or they might retain this data in perpetuity. This was the case is 1920 and it's the case now. We like to talk about "big data" as if it changed the fundamentals, but all it actually changed was the scale.


Well. This is where we'll disagree then.

No entity has any right whatsoever to retain any data about me, for any purpose.

Your rights end where mine begin. My right to privacy trumps everyone else's profit motive.

Edit: Scratch that. Our disagreement is more fundamental. I believe humans have a fundamental right to privacy. You don't.


I absolutely believe in the right to privacy. But I don’t think the right to privacy extends that far. I think it’s kind of unreasonable that everyone else loses their rights to record data to protect your right to privacy. This runs counter to the first amendment and makes journalism impossible. It also makes it impossible to do things like monitor the police.


Good points.


I'm torn between liking this view of personal data as property and also liking the view of Richard Stallman and the FSF that "intellectual property" is a legal fiction that we ought to resist. What does it actually mean to "own" data, and is "property" the best metaphor to represent a set of personal data control rights?


I hear ya. Pinko commie liberal me abhors the idea.

Legal fictions like "property", "money", and "rights" are practical innovations that make society work better (eg more moral, greater public good). Kinda like the tech tree in games like Civilization.

The books "The Mystery of Capital" and "Nonzero" influenced me a lot. Good starting points, optimistic, and more right than wrong.

https://en.wikipedia.org/wiki/Hernando_de_Soto_Polar

https://en.wikipedia.org/wiki/Robert_Wright_(journalist)

Until something better comes along...


RMS' objection to "intellectual property" isn't an objection to the concept of property in general, but rather an objection to conflating the rights that are associated with tangible property with the rights that are granted by copyright law and patent law. The canonical example is the language of "theft" used by organisations like the MPAA to refer to copyright infringement - RMS would say that illegally copying a copyrighted work doesn't deprive anyone else of their copy of said work, unlike stealing a sandwich from someone.

RMS might disagree with me here, but I think the same thing can be said in this case - we need more precise terminology that accurately describes the types of infringement when it comes to misuse of PII.


> My personal data is an asset. And it belongs to me.

Hm. How would this work for (say) a social security number? Does it belong to you? Does every number anyone chooses to identify you belong to you?


Any non government entity storing data about me (PII and beyond) can only do so with my (revokable) permission and owes me my cut.


In Switzerland, anyone collecting data about other people must make a public declaration of that collection, and may not keep such records about people who disagree with being thus documented ("fiché").


There is a similar law in the UK: https://en.m.wikipedia.org/wiki/Data_Protection_Act_1998

Which itself is based on an EU directive: https://en.m.wikipedia.org/wiki/Data_Protection_Directive

This is to be replaced with an updated version between now and Brexit (that’s just going to add to the fun!): https://en.m.wikipedia.org/wiki/General_Data_Protection_Regu...


For the curious: There's an English translation of the Federal Act on Data Protection.

https://www.admin.ch/opc/en/classified-compilation/19920153/...


Okay? People signed up for Yahoo accounts so that Yahoo could provide them email, messaging, fantasy sports, and other account-based webapps. How is their possession of the records of those accounts some kind of property crime? This isn't Equifax.

I'd buy a negligence argument, but there's no much to find fault with regarding possession.


You agree to give up your data in return for services. Yahoo mail, or gmail for that matter aren't actually free. You are trading your data for a service.


No contract may take away a person's rights.


I don't understand. You believe you have the right to have no records of yourself written anywhere. You believe it is impossible to contract away this right. You've given HN an individual identifier for yourself, and also furnished your political views (a specially protected category under the GDPR) to its database.

Aren't all your comments proof of Y Combinator's human rights violation against you? Shall we have HN shut down and its operators jailed? Obviously this isn't the world we live in, but isn't it the one you're arguing for?

Maybe you can't give permission to have data treated carelessly, but it seems absurd to say you can't give permission to have data collected at all. Opening an account with Yahoo is surely consent to let Yahoo have a record of that account.


Guessing you don't support the right to be forgotten.


No, the "right" to edit other people's memories and disappear information from the public domain is the dystopian hellscape on the first page of 1984.


On the contrary, it is commonplace for contracts to take away your rights.

A common example is an arbitration clause, where you sign away your rights to use the courts to resolve disputes.


Some rights, but not all rights. You cannot (in any country I am aware of) contract away your right to life, nor turn yourself into a slave in exchange for your debts being forgiven.

The latter used to be possible, if I understand serfdom correctly.

Question is, should data rights be alienable or inalienable?


Thank you for articulating, explaining stuff better than I can.


If you're talking about what rights you should have, that's fine. Currently, however, there is no defined right in the US such that service providers like google, Facebook, etc can't make use of the data they acquire from you. If you consider data on you to be more valuable than the service provided, don't use the service.


Does it really belong to you if you have no control over it?


So you do understand.


I believe EU's GDPR made some efforts in that direction, but I'm not sure it went far enough.

We need laws that give companies incentive to store very little data on us outside of what's absolutely required for the functioning of the service. And if they do store additional info, and their servers are breached, then automatic hefty fines should be paid (right after the mandatory notification to authorities and the public).

That should encourage companies to either minimize data collection or use end-to-end encryption, where most of that additional data would be stored on the client's device. This would have to exempt them from liability, and it should since the data wouldn't be on their servers if breached.


I was about to mention the GDPR - it definitely is a step in the right direction. I don't think that it doesn't go far enough - compared to previous regulations, it is quite severe, and it already is a pain to implement as it is. If it went any further, many companies would probably not even bother and somehow do their business outside the EU, or just prepare to be fined

> if they do store additional info, and their servers are breached, then automatic hefty fines should be paid (right after the mandatory notification to authorities and the public).

This is already in the GDPR - you have to notify everyone affected about breaches, and the fines can go up to 20 million or 4% of annual turnover, whichever is greater.


GDPR already does more than any existing law in either US or Europe (not sure about other countries). As every law it will be reviewed and can be made stricter. Changes are likely needed anyway as companies try to circumvent the directive with "creative" ways or incentives for users to give up privacy.

But it's a huge step forward compared to the existing situation.


This could be a voluntary insurance that companies purchase on behalf of their users. If the company suffers a breach, they will be bound to pay X amount to their users depending on the data lost.

Dress it up with a fancy badge to slap on the front of their site. Maybe a silver badge means user data is insured up to $10 each; a gold badge is up to $100; platinum up to $1000.


So Yahoo would have been insured for somewhere between $30 Billion and $3 Trillion in this scheme? That seems untenable. Good luck collecting from the bankrupt insurer.


Good point, although the report states 3 billion user accounts were breached but this doesn't mean 3 billion people. I am guessing the vast majority of accounts did not contain any sensitive information.

And maybe insurance isn't the right word; the risk should probably fall to the company holding the data, not a third party who would never be able to audit every single step to ensure there is no weak link.


The first step towards this is having useful industry standards for auditing and certification that actually work... then you can think about an insurance market where insurers force certification.


True, it wouldn't be 3 billion individuals claiming the benefit. Still the scale is so large that it would utterly bankrupt most companies to pay out for a single breach.


If the cost of disclosure was a dollar a user there's pretty much no way we'd see them voluntarily tell us they were hacked. We'd have to wait until the information got out some other way.


Not a perfect solution, but sure. "You're the weakest link - goodbye!" <everyone else flinches and reviews infosec>


I think hehheh is saying that a policy like this would strongly encourage hiding breaches. No one would openly admit a breach if they knew it would kill the company. The net effect would be less transparency, not better security.


Sure, I like the thinking. In theory some of these costs will hit the errors and omissions insurance, which will drive up their costs in the long run (I know they are being absorbed by Verizon, but typically...). In turn part of the insurance evaluation would they assess the collection of the data as a risk as well as their track records in keeping it secure.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: