Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Hashing is better than storing it in the clear, but many email addresses have too low entropy to make hashing an effective anonymization.


Someone did a pretty good analysis of that problem here: https://medium.com/@matthew.bajorek/using-hashcat-to-recover...


I ran such an attack on stackoverflow's data dump in 2011 and recovered about 28% of emails, using only a simple laptop CPU.

https://meta.stackexchange.com/questions/44717/is-gravatar-a...

Somebody else ran a similar attack on a different dataset using a GPU and recovered around 45%.

https://arstechnica.com/information-technology/2013/12/crypt...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: