How accurate is canvas fingerprinting? And could it be used in the courts (which...

noja · on Jan 3, 2017

Someone else can answer better than me, but https://panopticlick.eff.org claims the canvas fingerprint provides 17 bits of identifying information (click detailed results after testing)

MR4D · on Jan 3, 2017

On mine, the System Fonts give 17 bits of info (1 in roughly 200,000 computers). User-Agent is next with only 8 bits.

Based on that alone, it seems that just replying back with either a blank font list or the minimal standard font list (e.g. only Times & Arial) would solve most of this problem.

I'd love to see the Firefox team fix that first.

gsnedders · on Jan 3, 2017

A blank font list where? There's no way to get a direct list of fonts: you just try rendering text with a given font and look at the metrics of it versus the fallback. Font lists are done using side-channels (and you also therefore have to have a list of fonts to sniff in the first place).

The only way to stop font-based side-channels is to limit the web to a fixed set of fonts: and that will horribly break the web in some linguistic communities where there's a fair amount of web content that relies on specific fonts (that typically map old Windows codespaces to other characters for support for their language, often before Unicode covered those characters).

You also need identical fonts for a given user agent, and that's very hard to guarantee short of shipping your own fonts (e.g., consider an OS update that changes a font!), and that becomes expensive fast.

BuuQu9hu · on Jan 4, 2017

Disable looking at the metrics of text rendering results?

gsnedders · on Jan 4, 2017

Get two block level elements, render some text in between, and calculate how far apart the two block level elements are, and you can already determine the height of the glyph.

So, yeah, to disable that you'd have to entirely disable the CSSOM, which would cause ridiculous amounts of breakage.

kibwen · on Jan 3, 2017

Unless every browser in the world adopts the same list, replying with a fixed list of fonts would make users of a given browser immediately recognizable (especially for low-marketshare browsers like Tor). Seems like you'd want a system where the response to a list-of-fonts query would be semi-random and likely to overlap with the lists that are naturally produced by other browsers.

ta3434343434 · on Jan 3, 2017

Generally speaking, you have two approaches (that I'm aware of) for addressing fingerprints: one is to "hide in the crowd", i.e., return values that are common across the browsers population. The second is to create unique value for each separate session (like incognito and cookies). See: https://www.microsoft.com/en-us/research/wp-content/uploads/... [PDF!]

eslaught · on Jan 3, 2017

But user agents already identify the browser, right?

I agree that implementing this first in Tor is probably not a good idea, but if Firefox were to do it first, then I don't see the problem. "They're a Firefox user" isn't nearly as specific information.

dexterdog · on Jan 3, 2017

User agent gives the browser version and platform version. Two macs with the same OS version and the latest version of Chrome will have the same user agent.

nsgi · on Jan 3, 2017

That's the point. With this feature, two computers with the latest version of Firefox would have the same font list.

dexterdog · on Jan 4, 2017

Is that true? I thought the point of using the font list for fingerprinting is that it can vary widely from user to user.

MR4D · on Jan 3, 2017

It would reduce the variability. 1 in 200,000 is reasonably unique. But if all Firefox browsers reported the same result for fonts, then it would provide no more information than the spying website already has (i.e. the user is using Firefox).

I'd bet that Chrome would follow quickly, which would put pressure on Apple to do the same. If that happened, we'd have a minor victory.

All I'm trying to do is reduce information that is needlessly leaked out by a browser. True privacy still requires more.

nsgi · on Jan 3, 2017

This would also have the additional positive effect of reducing differences in rendering across browsers. At the moment there's a risk of the browser a webpage is viewed in not having the right fonts.

There's no reason for browsers to make a large number of fonts available if websites aren't able to use them because not all browsers make them available.

However, there may be an issue with internationalisation.

swsieber · on Jan 3, 2017

I think they are working on the fonts problem.

muizelaar · on Jan 3, 2017

I think this is an over-estimation of the amount of entropy. If canvas hardware acceleration is disabled, the only things that can really have an impact on the output of the panopticlick canvas fingerprint are OS version, user-agent and available cpu vectorization instructions.

drvdevd · on Jan 3, 2017

Interesting. I wonder to what extent one could trap vectorization instructions in e.g. KVM. Presumably they are unprivileged instructions on x86?

dom0 · on Jan 3, 2017

> Presumably they are unprivileged instructions on x86?

They're all unprivileged; having to go to the kernel would defeat the purpose most of the time.

Also, trapping them wouldn't make a difference. Fixing the CPUID fields on the other hand (so that these code paths are not taken in the first place)...

niftich · on Jan 3, 2017

The 2012 UCSD paper [1] claims they observed 5.73 bits of entropy in their admittedly non-representative population.

As with everything, it depends on the user's threat model. In a court setting, it'd depend on how individual pieces of evidence stack up against a user to make them look bad, and whether there is enough reasonable doubt.

[1] https://cseweb.ucsd.edu/~hovav/papers/ms12.html

_wldu · on Jan 3, 2017

That's less than picking one character randomly from a keyboard. Seems pretty small to me.

    >>> import math
    >>> print math.log(95) / math.log(2)
    6.56

twic · on Jan 3, 2017

ObGolf:

>>> print math.log(95, 2) 6.56985560833

estrabd · on Jan 3, 2017

It is frighteningly effective.