Cyberabad cops nab man who stole personal info of 66.9 cr people in 24 states

mnw21cam · on April 2, 2023

For reference, "cr" is an abbreviation for the Indian word crore, which is 10^7, so this is 669 million people.

throw0101c · on April 2, 2023

Interesting notation / digit grouping as well:

> It is written as 1,00,00,000 with the local 2,2,3 style of digit group separators (one lakh is equal to one hundred thousand, and is written as 1,00,000).[1]

* https://en.wikipedia.org/wiki/Crore

* https://en.wikipedia.org/wiki/Indian_numbering_system#Use_of...

> In the Indian system, the next powers of ten are called one lakh, ten lakh, one crore, ten crore, one arab (or one hundred crore), and so on; there are new words for every second power of ten (10^(5 + 2n)): lakh (10^5), crore (10^7), arab (10^9), kharab (10^11), etc. In the Western system, the next powers of ten are called one hundred thousand, one million, ten million, one hundred million, one billion (short scale)/one thousand million (long scale), and so on; in the short scale, there are new words for every third power of ten (10^3n): million (10^6), billion (10^9), trillion (10^12), etc.

* https://en.wikipedia.org/wiki/Indian_numbering_system

sva_ · on April 2, 2023

> In the Western system,

Surely they mean "in the English system". In German, we have "Milliarde" between "Millionen" and "Billionen", and so on.

In fact, they will end on "-illion" if it's a power of 10^(6n) and on "-illiarde" if it's a power of 10^(6n+3).

adhesive_wombat · on April 2, 2023

It used to be like this in British English (the so-called long scale, where milliard was 10^9 and billion was a million millions, 10^12).

Now, indeed, the split is roughly along the line of English (and Russian, Turkics and Arabic) on the short scale side and everyone else, especially Continental European language speakers on the other. It's not a universal rule: Brazil is maybe the largest defector from the Continental European group to the short scale, but Canada reinforces the rule by using both, with the choice being dictated by whether you're speaking English or French.

throw0101c · on April 2, 2023

> Surely they mean "in the English system".

It's Wikipedia: feel free to click "Edit" and correct/update it.

ChoGGi · on April 2, 2023

And then have it reverted soon after.

midasuni · on April 2, 2023

In the olden days English used have billion as a million million.

For international purposes (america) this “long scale” fall out of use, with he “short scale” being recommended in the U.K. since 1974

mnw21cam · on April 2, 2023

I'm British, and I'm still sore about this one, even though that "recommendation" was before I was born. The long scale is the right one, darnit. Just because the government says otherwise doesn't make it so.

The best thing I can offer is that the word "billion" and greater should be regarded as cursed, ambiguous, and shouldn't be used any more.

chimeracoder · on April 2, 2023

> For international purposes (america) this “long scale” fall out of use, with he “short scale” being recommended in the U.K. since 1974

I know a lot of people who didn't get that memo, because I still see the long scale with some regularity.

That said, the short scale is much more common overall.

masfuerte · on April 2, 2023

Where? I'm British and old and, apart from in reference books much older than me, I've never seen the long scale used in any context.

chimeracoder · on April 2, 2023

> Where? I'm British and old and, apart from in reference books much older than me, I've never seen the long scale used in any context.

Vestiges of it pop up from time to time. It's not really something you can Google directly for, but as an indication, literally the second hit on Google for "thousand million" - a term that only exists due to the long scale, and which is obsolete in the short scale - is an article about a Brexit-related bus ad:

https://www.independent.ie/world-news/and-finally/brexit-bus...

andreareina · on April 2, 2023

c.f. https://en.wikipedia.org/wiki/Long_and_short_scales

grishka · on April 2, 2023

Between? That's odd. In Russian, it goes миллион (million, 10^6), миллиард (milliard, 10^9), триллион (trillion, 10^12), квадриллион (kvadrillion, 10^15), etc in 10^3 increments. "Биллион" isn't a word. It does exist in the macOS dictionary (I just checked) but I've never ever seen or heard it used.

aix1 · on April 2, 2023

In Russian, the word "биллион" is mainly of historic interest: https://ru.wikipedia.org/wiki/%D0%91%D0%B8%D0%BB%D0%BB%D0%B8... (see the note about "Арифметика" Магницкого.)

Since Russia uses a short-scale system, the word is no longer in use. https://en.wikipedia.org/wiki/Long_and_short_scales

fabianhjr · on April 2, 2023

That's called a long scale (also in use in LATAM) in contrast with a short scale (in use in the US and Canada)

unwind · on April 2, 2023

Thanks.

Meta: the title should be changed, in my opinion.

vijaybritto · on April 2, 2023

I expect trouble for activists talking about it cause when Aadhar data was available for sale on the darknet the govt threatened to jail anyone publishing info about it

https://www.moneylife.in/article/aadhaar-data-breach-largest...

bvsrinivasan · on April 2, 2023

The net number (669 million) looks wrong. India's population is around 1.4 billion, so this would mean a data leak of nearly 1 in 2 Indians. If we further remove children below 14 (30% of India) who are unlikely to have data of their own and others who are completely off any of the data leak sources, the number given here would mean everyone in India has had their data leaked!

The data distribution given in the article seems to add to approximately 7 crores (70 million). I think there is a misplaced decimal somewhere. In all probability it is 6.69 crores (66.9 million). Still very significant, though.

notimetorelax · on April 2, 2023

They show 21 crore from Uttar Pradesh. This state has enough people, but it’s almost everyone. So yeah, hard to believe the original statement.

bvsrinivasan · on April 2, 2023

Thanks! I didn't read the data in the pictures, only read the text. Time for GPT4 I guess :-D

As you said, this also looks wrong. Maybe double-counting? Same people or IDs getting leaked across multiple platforms?

KennyBlanken · on April 2, 2023

> India's population is around 1.4 billion

1.4b currently living people.

Just because people pass away doesn't mean their data stops existing in government records.

I'm not saying the figure is necessarily correct, just that your reason for dismissing it is faulty.

zeckalpha · on April 2, 2023

66.9 crore is 669 million for those unfamiliar with the unit.

avi_vallarapu · on April 2, 2023

Interesting to see the discussion going towards converting 66.9 cr to Millions.

However, what is the type of data is my curiosity. Is this sensitive data such as PCI or some details such as : Names, Phone Number, Location etc. If it is sensitive data, how he got that data is the biggest worry.

Hope there is enough clarity on the type of data.

tianqi · on April 2, 2023

Is Cyberabad the name of the place? What can people expect to happen in a place called cyber-bad?

vrc · on April 2, 2023

Hyderabad city (specifically the area called HITEC City) in Telangana is often called Cyberabad[0] because of all of the tech co’s there.

[0] https://en.m.wikipedia.org/wiki/HITEC_City

phanimahesh · on April 2, 2023

Cyber-abad, not cyber-bad. Abad means place, a somewhat common suffix. See https://en.m.wikipedia.org/wiki/Abad

chsreekar · on April 2, 2023

The region is called Hyderabad. The tech township it hosts has been dubbed Cyberabad

cubefox · on April 2, 2023

I thought the headline sounded like from an 80s cyberpunk short story!

antibasilisk · on April 3, 2023

that sounds awesome

notRobot · on April 2, 2023

Cyberabad is a cool name.

choxi · on April 2, 2023

I was curious because it doesn’t sound like a typical Indian name, apparently it’s a nickname for Hyderabad because there’s a large tech industry there

samarthr1 · on April 2, 2023

Correction, it is a region of Hyderabad which plays host to a lot of tech... So the financial district, Gachibowli, hi tech city, madhapur and the other tech areas come under cyberabad limits.

v7engine · on April 2, 2023

How did he get all this data? Did he hack websites?

harrymit907 · on April 2, 2023

More than half the data there is available on Breached and other similar forums and telegram groups freely. These clueless cops do not know this and just have to make some headlines to keep the budget flowing.

sciencesama · on April 2, 2023

Any links of the websites you are referring to ?

plq · on April 2, 2023

So that's what he was wearing while waiting for the db dump to finish, I guess?

Man, tech reporters are killing me sometimes.

zeeshanmh215 · on April 3, 2023

would be interesting to see who was his clients.