An iOS zero-click radio proximity exploit odyssey

dvt · on Dec 1, 2020

I read the entire thing, and honestly the heap grooming is very interesting, but really that's the boring part -- lots of trial and error, padding memory, etc. Also interesting that linked-lists aren't used by Apple† (and Ian Beer's suggestion that they ought to use them), but that's neither here nor there. Getting kernel memory read/write is also very interesting, albeit (again) a bit tedious. At the end of the day, it all started with this:

> Using two MacOS laptops and enabling AirDrop on both of them I used a kernel debugger to edit the SyncTree TLV sent by one of the laptops, which caused the other one to kernel panic due to an out-of-bounds memmove.

How did this even pass the _smell_ test? How did it get through code reviews and auditing? You're allocating from an untrusted source. It's like memory management 101. I mean, my goodness, it's from a wireless source, at that.

† In this specific scenario, namely the list of `IO80211AWDLPeer`s.

fulafel · on Dec 2, 2020

> How did this even pass the _smell_ test?

Because attackers only have to find one place that was unlucky in implementation, and hence defenders are burdened with eliminting every last one of them.

This is why implementing your network protocols in unsafe languages is bad. Testing can just find some bugs, not ensure absence of bugs.

geocar · on Dec 2, 2020

If it's not one thing, it's another.

https://cve.mitre.org/cgi-bin/cvekey.cgi?keyword=rust

Now I know it's deeply comforting to think if you just had "safety" you could write all the code you want with abandon and the computer would tell you if you did it wrong, but this is a sophomoric attitude that you will either abandon when you have the right experiences, or you will go into management where the abject truth in this statement will be used to keep programmer salaries in the gutter, and piss-poor managers in a job. Meanwhile, these "safe" languages will give you nothing but shadows you'll mistake for your own limitations.

My suggestion is just learn how to write secure code in C. It's an unknown-unknown for you at the moment, so you're going to have to learn how to tackle that sort of thing, but the good news is that (with the right strategy) many unknown-unknowns can be attacked using the same tricks. That means if you do learn how to write secure code in C, then the skills you develop will be transferable to other languages and other domains, and if you still like management, those skills will even be useful there.

jchw · on Dec 2, 2020

You can’t just build a better developer when a single mistake is end game. Even if you do everything right you can still run into problems.

The reason for that is because large projects can’t have only one developer. As soon as you have multiple developers you have a problem. What happens when two developers begin working on the same base commit. Developer A makes a change to remove a contractual behavior that is not relied upon. Developer B makes a change that relies on this contractual behavior. Both changes are correct on their own, could very well pass code review simultaneously, and then both merge without conflicts. And then your last life line is whatever guarantees you have via static analysis, etc. (notably, this could still fail in a memory safe language if there aren’t any safe guards for this particular logic bug. Nothing is a panacea. Having more tools to write safer code, though, can at least help prevent some of these cases.)

That’s assuming everyone is perfect and has unlimited time to write perfectly sound code always. And it still fails.

You point to Rust but nobody said it had to be Rust. Still, just because Rust is not a panacea does not mean it has no value. On the contrary, while there has been decades to hone practices for secure C, Rust is a relative newcomer and obviously shows a ton of promise. It and other new memory safe languages are very likely to take a bite out of C usages where security is important. You can embrace this or deny it... but if you think it’s not happening, you should definitely take a look at the writing on the wall, because it’s certainly there. On the other hand, there are also other approaches. I believe seL4 is doing C code with proofs of correct operation. (Admittedly, I do not fully understand what guarantees this gives you and how, but it sounds promising based on descriptions. There could still be bugs in the proofs, but it certainly raises the bar.)

geocar · on Dec 3, 2020

> You can’t just build a better developer when a single mistake is end game.

Why not?

Doctors got better with more training and better strategies towards medicine, and that's a game where a single mistake really is "end game".

jchw · on Dec 3, 2020

Doctors benefit from better tools and processes too, but this is all wildly beside the point, because my point was not that we can’t build a better programmer, it’s that we can’t just build a better programmer, for the reason that I then went on to outline.

You bring up a great point, though. Historically, C is often not trusted for software where people’s lives are on the line. Thus bringing up doctors is a great example of how building a better programmer is not good enough. There’s an entire class of “safety critical” programming practices and standards and it was common to prefer a language like Ada that made more bugs and logic errors into compiler errors.

geocar · on Dec 4, 2020

> it’s that we can’t just build a better programmer, for the reason that I then went on to outline.

Apologies for missing your point. I thought you meant by this:

> The reason for that is because large projects can’t have only one developer.

... that you meant we would have to change more things in our business and software cultures rather than just making programmers smart (something I could agree with), not that you believed this was some kind of truism.

I don’t believe this is true. Why are you convinced it is impossible to do “big” things without big teams of idiots?

> Historically, C is often not trusted for software where people’s lives are on the line. Thus bringing up doctors is a great example of how building a better programmer is not good enough.

I don’t see how one of these things has to do with the other. Can you explain the link?

krisoft · on Dec 2, 2020

> My suggestion is just learn how to write secure code in C.

That is a good suggestion to an individual developer. What is your suggestion to a lead developer of a big organisation? Let’s say to the CTO of Apple.

You can see at that level of abstraction the “make sure every one of your developers know how to write secure code in C and they never slip up” manifestly doesnt work.

You can fault individuals for bugs up to a certain point, but if we want to make secure systems we have to change how we are making them. To make the whole process resistant to oopsies.

fulafel · on Dec 2, 2020

I don't think this is a case of "both sides have a point" when it comes to blaming individuals for vulnerabilities or suggesting they learn to write secure C. We're way past these things.

geocar · on Dec 3, 2020

> What is your suggestion to a lead developer of a big organisation? Let’s say to the CTO of Apple.

He can pay for my advice if he really wants to hear it, but this isn't really about programming at this point because we're talking about business goals which can have all sorts of priorities besides making quality software.

Do we want software quality? Then we want better programmers.

> if we want to make secure systems we have to change how we are making them.

On this we agree, but thicker training wheels just gets more people on bikes; It doesn't make the roads any safer.

fulafel · on Dec 4, 2020

Better programmers on C can't effecicely eliminate whole classes of the most common fatal security vulnerabilities.

Of course after we do eliminate these low hanging fruit, we will be left with a pie of the remaining classes of vulnerabilities that looks different, it's like Amdahls law. But that's no excuse to skip past "step 1".

geocar · on Dec 4, 2020

> Better programmers on C can't effecicely eliminate whole classes of the most common fatal security vulnerabilities.

Sure they can, it just requires discipline. Most of djb's code (in C) has a lower defect count than most other implementations you'll find in any language, and the mistakes he does make are in relaxing his discipline when thinking it doesn't matter (because of privilege isolation -- something he later admitted was a mistake[1] -- or because nobody puts that much memory in a machine, because times change!).

[1]: https://cr.yp.to/qmail/qmailsec-20071101.pdf

> But that's no excuse to skip past "step 1".

Zeno would like a word. I'm arguing a different metaphor, not "try harder".

If it is true that programs get too big to maintain the level of discipline the language requires, and regardless of the language you're going to be confronted with defects, then the solution (in my mind) is smaller programs because only the small program has a chance of being correct in the first place.

toyg · on Dec 2, 2020

I agree with your main point, just a nitpick about "at that level of abstraction the “make sure every one of your developers know how to write secure code in C and they never slip up” manifestly doesnt work": it kinda worked with Windows post-XPsp2, the amount of security holes fell pretty dramatically in subsequent releases.

When a company puts security first, they can get results. Unfortunately, security doesn't really sell software like features do, so a true hardened-by-default mindset is impossible in practice. Hence, we need better tools and processes to build features, as you say.

allie1 · on Dec 2, 2020

Hire more security auditors?

Apple employees would have access to these symbols and much more debugging info that the OP poster "got lucky due to an accidental share", so the process would be a lot easier for them.

EDIT: also, required reading on how vulnerabilities are discovered, important types of bugs, is not unreasonable, nor a long read.

scoutt · on Dec 2, 2020

> every one of your developers

You don't need "every one of your developers" to write C neither.

metafunctor · on Dec 2, 2020

> My suggestion is just learn how to write secure code in C.

This not good advice. We've been battling with this issue for decades, and it's clearly not going away by trying to be more careful.

pjmlp · on Dec 2, 2020

A another C advocate talking about the mythical safe C code that no one has managed to do in 50 years of CVE database entries.

The whole point of safe systems language is not to write 100% code free of exploits, rather to minimize it as much as possible.

Naturally there are still possible exploits, however the attack surface is much smaller when memory corruption, UB (> 200 documented use cases), implicit conversions and unchecked overflows aren't part of every translation unit.

kungito · on Dec 2, 2020

I'm just so happy seeing the gp comment downvoted. It gives me hope that that mentality is slowly dying. Maybe in 30 years we rid ourselves of the C/C++ shackles for something like Rust

octoberfranklin · on Dec 2, 2020

> https://cve.mitre.org/cgi-bin/cvekey.cgi?keyword=rust

Almost all of those are due to code in unsafe blocks. In other words, not safe rust.

A few are cryptographic errors. No argument there, Rust won't save you from that.

FWIW Rust does badly need a standardized unsafe-block auditing mechanism. Like "show me all the unsafe blocks in my code or any of the libraries it uses, except the standard library". If that list is too long to read, that's a bug in your project.

sitkack · on Dec 2, 2020

Related to what you are looking for is https://github.com/rust-secure-code/cargo-geiger which analyzes the dependency tree for unsafe but afaik it doesn't actually show each individual block.

The readme is quite good.

octoberfranklin · on Dec 3, 2020

Wow, yeah, that's exactly the technological aspect of what I had in mind.

I guess all that's left is the socialogical aspect: packages' "geiger" status ought to be treated as being as important as their dependencies. In other words, lib.rs/docs.rs/crates.io ought to display these data in all the sorts of places where they list the dependencies of a package.

It would also be great if this tool were made a standard part of cargo. I think it's important enough to deserve that status.

sitkack · on Dec 3, 2020

I think this would be a docs.rs or lib.rs feature, I used to think crates.io was that place but it is not.

I could see there being all kinds of scans of dependencies, like enforcing test coverage, builds and tests passing on certain platforms (risc-v, wasi, etc).

octoberfranklin · on Dec 6, 2020

No, see, my point is that this is as important as dependencies.

Anything that tracks dependencies ought to be tracking transitive unsafeness.

That's the mindset shift the Rust world needs. Otherwise we're going to keep getting these (in some sense valid) complaints about how Rust isn't memory-safe because it has unsafe-blocks.

sitkack · on Dec 18, 2020

Safety is top of mind for a large percentage of the Rust community. There was major political drama when the fastest http framework liberally used unsafe. That framework was almost universally shunned because of it and now the vast majority of the use of unsafe in that project have been removed. I think we are in a good position, but there could be in effect, a Sybil attack against norms where if unsafe was an Ok thing to do for perf or expediency that the value of Rust would be largely obliterated.

My personal hope is that lib.rs and docs.rs replace crates.io and that safety, code coverage, perf and other dimensions of quality are prominently displayed and queryable. Crates.io as it is now has outlived its usefulness.

I believe you and I agree on this issue completely.

Icathian · on Dec 2, 2020

Does this meet the need you're describing?

I've been looking for a good project to get my feet wet in rust and your idea sounds like a perfect one, but a quick Google search makes it seem like a solved problem. I'd like to understand what's still left that I might add.

Thanks!!

https://www.reddit.com/r/rust/comments/8ssjv2/cargogeiger_ye...

UncleMeat · on Dec 2, 2020

> My suggestion is just learn how to write secure code in C

Decades of evidence demonstrate that this cannot be done. Even world experts introduce vulns. Writing secure code in languages with tons of guardrails is hard. Writing and evolving secure C is impossible at almost any scale.

ponker · on Dec 2, 2020

That’s like saying “learn to drive a Formula One car if you want to feel safe driving at 65 miles an hour.” Sure, it works, but it’s impractical and unnecessary for everyone to do this.

fulafel · on Dec 2, 2020

Also writing secure C is much harder than driving a Formula One car (as evidenced by the number of competent practicioners of both disciplines in existence).

google234123 · on Dec 3, 2020

wow, that's terrible advice.

ashleyn · on Dec 1, 2020

I suspect linked lists are not used because they are notorious for wrecking cache performance.

dilyevsky · on Dec 2, 2020

Same is true for other trees such as std::map

pfundstein · on Dec 2, 2020

Never head of that, though I don't use C much. Are you referring to the CPU cache?

saagarjha · on Dec 2, 2020

Yeah, linked lists are bad for the data cache since each element is in some totally random area of memory and thus less likely to be in a cache. Whereas for a linear array the data is next to each other and can be cached effectively and accesses can be easily predicted.

yencabulator · on Dec 2, 2020

Some 20 years ago, when I was a kernel developer with high performance requirements, my favorite data structure was a linked list of smallish arrays. Each array was small enough to be cheap to completely rewrite on insert/delete, if you needed to e.g. preserve insertion order; most of the time, it was enough to preserve order only across the linked list, treat each array like a hash table slot, or sort at consumption time.

skavi · on Dec 2, 2020

I’ve considered making a vector sorta thing implemented as a linked list of arrays. Assuming you sized the arrays to fit well inside the CPU cache, are there really many tradeoffs?

yencabulator · on Dec 2, 2020

Even if the array overflows the cache, it's not too bad for most access patterns. CPUs will easily prefetch data on a linear scan, so the "second half of the array" will most definitely be in cache by the time you get there. And a linear scan-once shouldn't even evict too much other stuff out of the cache.

In practice, I would let benchmarks decide the array size. Modern CPUs are surprisingly good at prefetching even linked lists, and x86 works weird out-of-order magic and speculation while waiting for memory. My rules of two thumbs were 1) where performance really matters, don't trust your intuition, program and benchmark alternative implementations 2) don't trust microbenchmarks, e.g. total icache pressure changes whether inlining is good or not.

gleenn · on Dec 2, 2020

That's how Clojure implements vectors except its a tree of 8-element arrays. This has nice functional programming properties because it makes it easy to make immutable copies of large vectors with good performance and memory characteristics.

jacoblambda · on Dec 2, 2020

It sounds like you are talking about an unrolled linked list[1]. I'd argue in most cases that while they are better than linked lists in just about every way (except data objects larger than the cache line), in most cases there is a better data structure to use. I'd find that the hashed array tree[2] would likely be a better/more versatile data structure solely due to the improved lookup time.

[1]: https://en.wikipedia.org/wiki/Unrolled_linked_list

[2]: https://en.wikipedia.org/wiki/Hashed_array_tree

philsnow · on Dec 2, 2020

You can also force-align (with padding if necessary) your structures to be along whatever boundaries make sense for your processor's cache lines.

Ensuring an oft-loaded structure always fits inside a single cache line (instead of splaying across ~1.9 cache lines on average) is good not only for fitting lots of those structures into your L1 but for not blowing out cache bandwidth (read _and_ write, if you're modifying them).

a_t48 · on Dec 2, 2020

It's much more difficult to prefetch them, though - if you're traversing a large list it misses the cache every time, rather than just the first element.

(someone please correct me if I'm wrong)

asdfasgasdgasdg · on Dec 2, 2020

That's right. The reason why you see linked lists so much in old C stuff is (IMO) similar to the reason you see null terminated strings (and the associated bugs if you truncate incorrectly). And that's because of one of C's original sin: the failure to include size information in array objects.

There is also the claim that linked lists are better in certain types of performance sensitive code because you can sometimes avoid allocating. I don't fully understand the logic there myself but I trust that there are cases where this is true.

flohofwoe · on Dec 2, 2020

The actual reason why you see linked lists (and similar pointer-chasing data structures) so much in old code is because the entire problem of slow-vs-fast memory accesses didn't exist back then. Memory access always happened in a fixed and very low number of CPU cycles (the entire RAM essentially had L1 cache performance). The cpu-memory-gap only slowly started to widen during the 90's when CPU frequencies started to skyrocket and caches had to be added to CPUs. Unfortunately this problem still isn't as widely recognized as it should be in high-level programming circles even today.

asdfasgasdgasdg · on Dec 2, 2020

That's a really good point. Thanks for the context!

pushrax · on Dec 2, 2020

If your primary operation is inserting at a random location in the list, linked lists are faster than arrays at large sizes. You avoid having to move all the memory after the index you are modifying (to make space for the inserted element).

A linked list also avoids copying the entire array when you need to insert more elements than you have allocated space for.

reificator · on Dec 2, 2020

> If your primary operation is inserting at a random location in the list, linked lists are faster than arrays at large sizes. You avoid having to move all the memory after the index you are modifying (to make space for the inserted element).

This is false. Big O notation says it should be true, you'll get marked wrong if you say arrays are faster in your algorithms & data structures final, but when you're running on actual hardware the array is faster at all sizes of n and as n becomes larger so does the gap in performance.

Here is a talk[0] by Bjarne Stroustrop (Creator of C++) that even includes imaginary graphs demonstrating this phenomenon. If you want a visual for what the missing graph was supposed to look like, here's a similar one.[1]

Here's another video[2] by Scott Meyers (Author of Effective C++) that goes into more detail about why this happens.

[0]: https://www.youtube.com/watch?v=YQs6IC-vgmo [1]: https://airspeedvelocity.files.wordpress.com/2015/08/pasted_... [2]: https://www.youtube.com/watch?v=WDIkqP4JbkE

jodrellblank · on Dec 2, 2020

Summary of the Stroustrop video - inserting or removing an item at a point in an array of 100k items needs a linear scan from the start of the data structure to get to the point - an array does that very quickly, a linked list is much slower, lots of random memory indirection, a pointer for every list node blowing out the cache - in practice this linear scan to to the change point dominates the runtime, and arrays come out much faster.

While the array change does need on average 50k items to be shuffled up to make room or close the gap, modern caches are very good at that.

If the array is sorted it can be binary searched to get to the change point, which improves its performance even more, linked lists can’t do that.

Interesting.

tmd83 · on Dec 2, 2020

I can definitely see in modern cpu array scanning being faster than pointer chasing but I wouldn't have expected that to survive insertion with 50K move wow!

And if you not doing ordered insertion you wouldn't have to move the data in the array anyway, you would keep track of the size and jump to the end, so not sure I understand the binary search comment.

The next question is at what level of growth the waste of empty space in the array becomes too much. Some kind of data structure (tree/linked list) with largish (whatever size applicable for modern cpu) as probably mentioned in other comments does seem the most versatile approach while keeping the performance. Or perhaps the handling of that data structure might overwhelm the array advantage?

reificator · on Dec 2, 2020

> > If the array is sorted it can be binary searched to get to the change point, which improves its performance even more, linked lists can’t do that.

> And if you not doing ordered insertion you wouldn't have to move the data in the array anyway, you would keep track of the size and jump to the end, so not sure I understand the binary search comment.

It just means that if I want to view the nth element of an array, that's a constant time operation. I just take the pointer and add n times the size of the elements.

But for a linked list if I want to view the nth element of the list, I have to view the (n-1)th element first, all the way back until the first element I have a reference to.

> The next question is at what level of growth the waste of empty space in the array becomes too much.

Everything I've tried and everything I've seen from people testing on real hardware is that the gap in performance widens with larger values of n.

You might expect different performance as the system runs out of memory. But arrays have a size of n * element size, and linked lists have a size of n * (pointer(s) + element size), so the linked list would hit memory limitations more quickly regardless.

tmd83 · on Dec 2, 2020

But the memory usage isn't considering growth. If I grow by doubling the size (seems common) at n+1 element where n is the last allocation, one would allocate n2 + n (the original until the copy finishes) vs. n (pointer(s) + element size).

But yes the size advantage is very reduced for most use cases. What would you imagine the cases where LinkedList is still a valid data structure?

reificator · on Dec 2, 2020

> If I grow by doubling the size (seems common)

True. Then it comes down to the size of your data vs the size of your pointers.

> What would you imagine the cases where LinkedList is still a valid data structure?

Compared to an array? When the array doesn't have the behavior that you need.

For instance, if you need to keep a stable reference to a particular element in a list, even while that list is being inserted into, then an array is not going to cut it. The linked list handles this case with ease.

That's not to say you can't write a wrapper for the array that does the right thing, probably for cheaper. But out of the box the linked list can do things that the array cannot and if you rely on them, then use the right tool for the job.

pushrax · on Dec 5, 2020

I think it's also worth noting that this is specific to (common) CPU architecture where cache is sufficiently big and orders of magnitude faster than main memory, and where main memory reads and writes have similar cost.

The tradeoff in moving memory at the end of the array vs scanning to the point of linked list insertion could be different on some existing and some hypothetical systems where these concepts may be relevant. For example, storage write cycle minimization could be a priority in rare cases.

That said this is an excellent point about the majority of use cases. I hadn't considered the dominance of the time it takes to locate the element to modify. I've never had a reason to benchmark a linked list since even a basic idea of the effect of cache/prefetch precludes its use in every use case I've actually come across.

Probably useful for CS education to start comparing linked lists to bubble sort. There's always a better algorithm, but here's a simple one to learn so you understand how to compare data structure performance.

pushrax · on Dec 3, 2020

Doesn’t this depend on the size of each element? What if each element is 1 MiB?

reificator · on Dec 3, 2020

Are you reading the whole 1 MiB in linear order? If so, then the prefetcher should help in exactly the same way.

If you're not, then why is all of that data packed together? Is there an alternate layout you could use instead where you iterate over the exact values you need? If performance is important to you in that context it might be worth it.

pushrax · on Dec 4, 2020

Right - I suppose any large associated data that does not need to be searched can be malloc()'d (via standard or fancy allocator) and you just store a pointer. Or if the other data is relatively small, just in another array.

One case where I don't know what to think: the implications of memmoving the subarray on NUMA cache invalidation in an extremely multithreaded application.

cma · on Dec 2, 2020

Depends on element size and some other stuff, but if it is a singly linked list, and truely random insertion location, iterating to that location is N/2 on average, where inserting and copying the rest of the array is also N/2. Small elements and the array could still be much faster since they are all prefetched during the copy, vs jumping all over memory during the list iteration and potentially stalling on each for ~200 instructions waiting on main memory.

saagarjha · on Dec 2, 2020

You can imagine a linked list whose “API” is “take this node and insert after it”.

cma · on Dec 2, 2020

If a bunch of insertions happen like that, based on some other data structure, and then a periodic iteration eventually happens at some point, you could also imagine an array version being batching together all the array inserts in another simple array and applying them all at once before iteration.

whimsicalism · on Dec 2, 2020

B-tree would be faster than either

pushrax · on Dec 2, 2020

Expanding on this: usually linked lists allocate each element with a separate malloc() on the heap, so they end up all over the place in memory.

You could, in principle, allocate your linked list out of contiguous memory in such a way that sequentially reading the list doesn't cause so many page accesses. That would itself be a complicated data structure to maintain, so simply using a linear array is often the right tradeoff.

AlotOfReading · on Dec 2, 2020

That's just a slab allocator. They're not what I'd call trivial, but still simple enough to be common in small projects and CS homework. I agree that an array is still generally simpler though.

pushrax · on Dec 3, 2020

Once you are implementing your own allocator you need to worry about fragmentation under many operations, which would be the majority of the complexity.

netheril96 · on Dec 2, 2020

The language doesn't matter. In almost all languages linked list has worse performance due to cache.

pfundstein · on Dec 2, 2020

Forgive me but are linked lists common in other languages?

smabie · on Dec 2, 2020

Super common? Main data structure in Scala, any kind of Lisp, Haskell, Erlang, etc. Linked lists are a functional programmer's goto data structure.

pfundstein · on Dec 2, 2020

Well that would explain why I've only encountered them in C.

azernik · on Dec 2, 2020

C is the only language where they're called "linked lists"; everything else just calls them "lists".

pjmlp · on Dec 2, 2020

Nope, plenty of literature refers to them as linked lists.

Here is an article about them in Modula-2, to use a language of similar age as C.

https://www.modula2.org/tutor/chapter11.php

azernik · on Dec 2, 2020

I was being loose with the words "everything" and "only".

GP comment was curious about why they've only heard about "linked lists" in a language, C, which like Modula-2 does not use them as the basic standard library collections type. When you have to implement linked lists yourself if you want to use them, you're always going to refer to them by implementation detail. In the vast majority of cases where they're used, though, it's as the fundamental standard library collection, and in that case they're usually just called Lists without telling the caller about their implementation.

(This idiom is true to the point that the Python implementers feel the need to point out that CPython uses arrays to implement the list type, as they feel that this would be unexpected to users of other languages that call their basic collections type a "list".)

tikiman163 · on Dec 2, 2020

Finding the bug that allowed this exploit took this researcher weeks. QA can't find all defects without somehow testing every conceivable scenario without knowing every conceivable scenario, and code review can only catch defects if at least one reviewer is able to somehow know that specific methods make an exploit possible. Given that the exact code of underlying methods used may not be known to code reviewers, or that a reviewer might simply not know the full potential use cases for new code at the time of review, it is entirely understandable that defects and resulting exploits happen.

This is why researchers like the OP exist. They find exploits and report them to the manufacturer (hopefully) before they can be used. The fact that this is an effective way of protecting us is also why major software companies offer bug/exploit bounties to researchers.

To demand that all possible exploits of this nature never find their way into production builds is to demand perfection from humans. There is too much to know and think about, and definitely too many unknowns about the future, to make such a fantasy possible while still meeting release deadlines. We software developers often have a hard enough time just meeting feature and documentation deadlines, and adding more people just makes organizing your efforts more complex and difficult which then requires even more people until you reach the point that the scope of organizing your development teams is financially impossible.

saagarjha · on Dec 2, 2020

Well, the majority of the time went into being able to turn the bug into an exploit.

saagarjha · on Dec 2, 2020

Apple uses tons of linked lists, I'm not sure why you got the impression that they don't?

dvt · on Dec 2, 2020

I was referring specifically to the list of `IO80211AWDLPeer`s the author was reverse-engineering. His assumption was that the `IO80211AWDLPeer`s were in a linked-list type of data structure (which is a pretty sensible guess). In fact, it ended up being more akin to a priority queue:

> The data structure holding the peers is in fact much more complex than a linked list, it's more like a priority queue with some interesting behaviours when the queue is modified and a distinct lack of safe unlinking and the like.

I amended my post for clarification, I'm sure Apple uses linked lists all the time :)

SulfurHexaFluri · on Dec 1, 2020

The scary thing is that even though this sounds like a monstrous effort to pull off this hack, its not out of reach for large governments. Its basically known as a fact they have loads of these exploits sitting in their toolbox ready to use when they have a enticing enough target.

Short of rewriting the whole of iOS in a memory safe language I'm not sure how they could even solve this problem. Assigning a researcher to search for 6 months only to find one bug is financially prohibitive.

q3k · on Dec 1, 2020

The research would've been much shorter if Apple would actually provide researchers with debug symbols. Or you know, if Apple open sourced their security-critical software.

> One of the most time-consuming tasks of this whole project was the painstaking process of reverse engineering the types and meanings of a huge number of the fields in these objects. Each IO80211AWDLPeer object is almost 6KB; that's a lot of potential fields. Having structure layout information would probably have saved months.

> Six years ago I had hoped Project Zero would be able to get legitimate access to data sources like this. Six years later and I am still spending months reversing structure layouts and naming variables.

saagarjha · on Dec 1, 2020

It’s intensely frustrating, because for some reason Apple thinks it’s a good idea to strip out security code from the source that they do release (months late), and they tend to strip (and until recently, encrypt) kernel code. This is what a company from the last decade might do to hide security issues, except it’s coming from the world’s largest company with a highly skilled security team. Is there some old-school manager with so much influence that they’re able to override any calls from internal and external sources? It’s gotten to the point where Apple engineers privately brag about their new proprietary security mitigations after researchers who scrounge for accidentally symbolicated kernels (thank you, iOS 14 beta) do the work to find them. Why does this situation exist?

Wowfunhappy · on Dec 2, 2020

There were some Hacker News threads the other day about Marcan's Patreon campaign for porting Linux to Apple Silicon. Everyone basically expects that Marcan will need to reverse engineer everything on his own, and my gut tells me they're right.

But, if you actually stop and think about it for a moment... isn't this situation completely bizarre? Apple Silicon Macs explicitly support booting alternate OSs, because Apple went out of their way to add a `permissive-security` option to the boot-loader. They know Linux is important—the initial Apple Silicon reveal included a Linux VM demonstration—and now a well-known and talented developer is planning to do a native Linux port, at no cost to Apple, and we all fully expect that Apple won't make any documentation available or answer any questions? And, we're probably right?

The more I consider it, the more crazy it all seems. Why is Apple so private about the internals of their products? It won't affect marketing—normal consumers don't care—and I can't think of a plausible scenario where this type of information could help a competitor.

Is Apple using source code stolen from Oracle? Are they scared someone will discover an internal library written in COBOL and make fun of them? Are they worried their documentation could revive Steve Jobs as a vengeful ghost? I just don't get it.

toyg · on Dec 2, 2020

> Why is Apple so private about the internals of their products?

Because they don't care. The extent they care is directly linked to the amount of money they will make from caring. They won't sell more macs if macs can run Linux better; but they will sell more Apple Music subscriptions if macs keep running macOS.

> They know Linux is important

No, they know Linux is a pain in the ass. The bootloader option assuages the executives' conscience enough to be able to talk to a journalist and keep a straight face when asked about "openness" or being "hacker-friendly", stuff those 1980s-style Linux hobbyists keep talking about and nobody else gives a shit about.

Apple makes money by selling iDevices to consumers and selling Macs to enough developers to build apps for iDevices. Everything else is a bonus, and not worth spending much time on. They do the minimum and leave it as that. There is no inconsistency or secret motive. They just don't care. When they cared, in the early '00s, they did a bit more; now they do less. The attitude is the same.

mschuster91 · on Dec 2, 2020

> Apple makes money by selling iDevices to consumers

For now. They haven't made any "I have to buy this right fucking now!" worthy revolutionary improvements in the iDevices lineup recently, the Western market for smartphones is near saturation - and with Corona tanking the US economy for wide masses, people don't have the hundreds of dollars just lying around to shell off for the latest iteration.

I believe that both the last (horribly expensive at that, and still people kept buying it) Mac Pro and the new M1 lineup is a sign that Apple wants to shift back attention to the non-mobile sector - because the competition there is asleep on the wheels. Everyone uses Intel who has managed to fuck up its lineup for many years now, Microsoft has thoroughly pissed off the privacy-conscious folks with Win10 and (judging by a random walk through a Mediamarkt) build quality in Windows laptops still hovers barely above "acceptable" - cheap plastics, tiny touchpads and abysmal screens are the norm, whereas Apple is only robust aluminium cases, giant touchpads and crystal clear, bright screens.

What I'm really excited for is when Apple decides to put an Mx chip into an iMac, paired with a decent AMD GPU. The thermals and energy profile should be allowing a lot more leeway for resource usage than a Macbook...

Wowfunhappy · on Dec 2, 2020

> they will sell more Apple Music subscriptions if macs keep running macOS.

The type of person who buys an Apple Silicon Mac to run Linux is not going to buy an Apple Silicon Mac to run macOS. However...

> They won't sell more macs if macs can run Linux better.

They would sell some more Macs. Possibly hundreds of thousands more. A drop in the bucket for Apple, but still money—and all they have to do to get it is answer some questions.

plorkyeran · on Dec 2, 2020

Even without being able to compile it I've successfully used their source dumps to debug problems in my code quite a few times (and occasionally find bugs in their code which I have to work around). Having code with comments to read is a huge step up from having to rely on decompilers.

Wowfunhappy · on Dec 2, 2020

(Quick note for others that my GP comment originally contained a paragraph about the stuff Apple does open source. I edited this out because I felt it was beside the point.)

tzs · on Dec 2, 2020

> P.S. And what's with the stuff Apple does release as open source? Don't get me wrong, I'm glad they do it—because I'll take what I can get—but I have no clue who it's for! A lot of the code is either extremely difficult or impossible to actually compile, because it relies on internal Apple tools or libraries which aren't public

Even when it doesn't rely on anything Apple-specific, it can be unclear how to build it.

I noticed that if I ctrl-z dc, then resume it, it silently exits. I grabbed the source to see if I could build it, and then perhaps debug this.

The source is part of bc. When you extract it there is a directory containing a bc dir, a patches dir, a bc sources tarball, and a Makefile. The bc directory is the contents of the tarball with the patches from the patches directory applied.

Optimistically typing "make" does not work. It runs configure somewhere (in the bc directory, I think), decides that gcc is /Applications/Xcode.app/Contents/Developer/Toolchains/XcodeDefault.xctoolchain/usr/bin/cc, and decides that this cannot create executables and exits.

Maybe just going into the bc directory and running configure and make there will do the trick? ./configure works and builds a makefile. Trying to compile with that gets fatal errors, apparently due to missing <string.h> in some C files.

OK, I don't actually care about bc, so how about just trying to build dc, which lives in a subdirectory under the bc directory.

That gets a fatal error due to a conflict between "#define ptrdiff_t size_t" in the config.h that configure made, and "typedef __darwin_size_t size_t" from somewhere. Based on the comments in config.h, apparently it should only be defining that if it is not defined by the system. Commenting it out in config.h and trying again...and all the compiling steps for dc actually finish!

Alas...it then fails because it needs ../lib/libbc.a, which presumably would have been built before building dc if the bc build had worked.

Maybe if I go to ../lib and type make? Nope. In fact, the errors are identical to when I typed make for bc, because it turns out that making libbc.a is the first thing the bc make tries to do.

Tossing in "#include <string.h>" in lib/getopt.c and lib/number.c makes everything build, finally giving me a locally built dc.

Is it too much to ask that when I download the source from Apple to their version of a simple command line knows-nothing-about-MacOS utility like this, I should just be able to type "make" somewhere and have it build? Or at least have a README in the tarball that tells me what I need to do?

saagarjha · on Dec 2, 2020

In this case, the top-level Makefile includes a bunch of internal junk, and the configure script thinks your system is very broken because it's old and Xcode 12 ups the warning for a missing prototype to an error. I was able to get it to build with

  $ CC="clang -Wno-implicit-function-declaration" ./configure
  $ make

richardwhiuk · on Dec 2, 2020

Lots of the stuff they release I imagine is to comply with license obligations.

Wowfunhappy · on Dec 2, 2020

In a few cases perhaps—they definitely still use some GPLv2 stuff—but it's mostly under a license that doesn't require them to release anything.

bun_at_work · on Dec 2, 2020

I can only speculate, but Apple seems to have very tightly coupled software and hardware. Since this coupling probably holds trade secrets (which we don't know about by definition), it seems likely to me that they are controlling access to as much of the stack as they can while still protecting those secrets.

saagarjha · on Dec 2, 2020

Yes, but that doesn’t really make sense for things they have already shipped: researchers have to reverse engineer those for what seems like no reason. For example, the newest iPhones have entirely custom privilege levels that are lateral to the typical ARM exception levels and entered using proprietary instructions that their own silicon understands. This is something you can find if you load the kernel into a disassembler and poke at it a bit. But Apple doesn’t mention it at all or document it…what’s the point? Why put up such petty barriers in the face of people trying to audit this?

richardwhiuk · on Dec 2, 2020

Likely the documentation that does exist internal would take a relatively large amount of cost to extract without pulling other stuff with it.

saagarjha · on Dec 2, 2020

But they go through the effort of stripping all mentions of these things from the source code they release?

bun_at_work · on Dec 2, 2020

I'm late to respond, but the obfuscation of code is almost certainly automated.

saagarjha · on Dec 2, 2020

Apple doesn't really obfuscate their code outside of their DRM stuff–usually they just remove all the symbols and do a ⌘F for certain terms in their open source releases and strip those out. It really seems to be an manual process, since sometimes they miss things…

someguydave · on Dec 2, 2020

wouldn’t the public interest in that be obvious at design time? why would apple write internal docs in such a way that they could never be released?

3pt14159 · on Dec 2, 2020

Because you're missing the other half of the exploit market: Selling vulnerabilities for big cheques.

https://zerodium.com/program.html

saagarjha · on Dec 2, 2020

I'm not sure how that applies to my points?

3pt14159 · on Dec 2, 2020

> Apple thinks it’s a good idea to strip out security code from the source...

Because it makes it easier for attackers to find vulns. That's why they do it. The high payouts on these platforms are evidence that it isn't as simplistic as "only defenders would look at this why won't they release it!?"

Leherenn · on Dec 2, 2020

But doesn't it work in some ways? It's not going to save them, but it seems to significantly increase the time/cost of exploiting the vulnerability. One more layer to the security system.

saagarjha · on Dec 2, 2020

Obfuscating source? No, not at all. It just annoys legitimate security researchers (making them not want to deal with you) and is something that black hat bug finders largely don't care much about. Not only do they have more resources and patience, they are also more willing to use questionable methods to make their lives easier.

Leherenn · on Dec 2, 2020

What makes it less of an issue for black hats? Do they have access to symbols/source code that security researchers do not/are not willing to use?

I certainly understand the frustration for legitimate researchers, and there's plenty to be said about having the source code available to make auditing easier but in itself it seems that making a black hat take 6 months instead of 1 to create an exploit raise the skill/patience level needed and busy them for while where they are not working on the next exploit.

saagarjha · on Dec 2, 2020

Yes: black hats have much more incentive and generally larger, more focused teams to find these bugs, and they aren't concerned with the issues of buying stolen devices and source code on the black market. (If you're curious, search for "dev-fused iPhone" and "iBoot source code". The Project Zero team works from about the worst situation possible, choosing to even forgo using services like Corellium.)

Leherenn · on Dec 2, 2020

Thanks for the detailed explanation!

CharlesW · on Dec 1, 2020

> The research would've been much shorter if Apple would actually provide researchers with debug symbols.

I believe they're about to do this: https://www.theverge.com/2019/8/8/20756629/apple-iphone-secu...

Wowfunhappy · on Dec 1, 2020

And Google Project Zero won't get them.

https://twitter.com/benhawkes/status/1286021329246801921

> It looks like we won't be able to use the Apple "Security Research Device" due to the vulnerability disclosure restrictions, which seem specifically designed to exclude Project Zero and other researchers who use a 90 day policy.

ASalazarMX · on Dec 1, 2020

Goddammit, 90 day policy and reasonable rewards would strengthen their security and gain the trust of their advanced users.

For some reason this ridiculous restriction reminds me when Apple sued Samsung because their phones had round corners.

ethbr0 · on Dec 2, 2020

Apple sued Samsung because Samsung had aspirations of being Apple.

Rounded corners are the after-the-strategic-decision legal justification.

Wowfunhappy · on Dec 2, 2020

Frankly, I think Apple sued Samsung because Steve Jobs was still CEO at the time, and he sometimes acted emotionally instead of rationally.

sneak · on Dec 2, 2020

Advanced users that want a secure device require devices that can be reinitialized to a known state without external input.

This is no longer possible on any phone, tablet, or computer Apple sells: all require online activation with device-specific info. There is no way to put the device back into a known state offline or without Apple having an opportunity to tamper with it (or be forced to tamper with it).

Wowfunhappy · on Dec 2, 2020

> This is no longer possible on any phone, tablet, or computer Apple sells

It is still possible on all of their computers, just not their phones or tablets. Intel Macs (which are still being sold in large numbers) can always be wiped and restored from USB without an internet connection, and Apple Silicon Macs can do it if you set the boot-loader to "Reduced Security" mode.

sneak · on Dec 2, 2020

This is a false statement. Intel macs have the T2 boot security chip, which requires online activation to be able to access the internal disk after a full system wipe. The M1, even in reduced security mode, also requires online activation after a full system wipe. I've tested this this week; if you know of something I'm missing please tell me the exact steps to take to wipe and reinstall a T2/M1 mac offline, as I am confident now that it is not possible to do so.

I would love to be wrong about this.

This is the case even if you have a full offline boot/restore USB.

I have a post coming out today about just this, and how it renders all current macs unsuitable for long term offline/airgap applications.

q3k · on Dec 1, 2020

These are just phones that you are officially permitted to attach a root shell and kernel debugger to, like to any other device that's not an iPhone. Researchers have been working around that for years by using private jailbreaks / exploits to get similar levels access, and with checkm8/ktrw you yourself can get similar access to any vulnerable iPhone 7/8/X.

No sources or structure layout or symbols, so you're still stuck waddling through megabytes of compiled code to reverse-engineer everything from scratch.

It's Apple drumming up absolutely nothing, and from my point of view it's mostly a PR stunt.

Wowfunhappy · on Dec 1, 2020

> It's Apple drumming up absolutely nothing, and from my point of view it's mostly a PR stunt.

Well, I don't think it's quite "nothing". Newer phones don't have access to checkm8, and getting a private jailbreak or exploit working can be non-trivial. And in some cases, researchers may need to avoid reporting that exploit to Apple in order to keep using it.

It's a good step. It's just not sufficient, especially given all the other restrictions.

q3k · on Dec 1, 2020

> And in some cases, researchers may need to avoid reporting that exploit to Apple in order to keep using it.

And this will continue to happen until Apple just starts selling the damn things to anyone who wants them, instead of trying to gatekeep them to people who are playing by their ridiculous security disclosure rules.

Wowfunhappy · on Dec 1, 2020

Right! It would solve so many issues! Put them on an unlisted page of your online store, charge a 50% markup over a normal iPhone, make the boot screen bright red, and do something ugly and obvious with the phone's exterior.

Sure, some crazy people who aren't security researchers will probably buy them too and use them as daily drivers (I'd probably be one of them). So what? I don't understand why Apple feels the need to hold this stuff so close to their chest. Everyone in this scenario knows exactly what they're buying.

CharlesW · on Dec 1, 2020

> No sources or structure layout or symbols…

Oh, that's a shame. The slide in the referenced tweet says, "advanced debug capabilities", so I'd assumed that's what it meant. I wonder what else that could mean?

saagarjha · on Dec 1, 2020

The ability to attach a debugger to the kernel. No, really, that’s “advanced” for an iOS device, because normally you don’t get to do anything even close to that. You can’t even debug userspace processes that aren’t ones that you put there yourself (as a developer writing apps) on normal iPhones.

suifbwish · on Dec 2, 2020

Believe it or not, open sourcing the security code is actually not a great idea. Most of the worlds bot nets run on Wordpress which is open source. Most of the time legitimate actors are not going to read through an entire code base because they have better things to do. Illegitimate actors however have a very high incentive to read through a widely used public code base and do so.

Shared404 · on Dec 2, 2020

OpenBSD [0] is OSS, practices full disclosure, and is considered highly secure by... everyone.

Wordpress is a mess, but being OSS does not inherently make something less secure.

[0] https://www.openbsd.org/security.html

callesgg · on Dec 1, 2020

He could just have sent in a bug report. Said that the length was not validated.

No need to dig so much if you just want to fix the problem.

But he wanted to prove something. That is a different thing.

q3k · on Dec 1, 2020

By 'wanting to prove something', he caused the vendor to act urgently, instead of sweeping this as a maybe-exploitable-maybe-not bug that would get lazily patched whenever.

By 'wanting to prove something', he showed the shortcomings of multiple security mitigations, all defeated by simple bugs.

By 'wanting to prove something', he also discovered two other exploitable 0days, that wouldn't have been discovered otherwise. Those 0days were likely already in the hands of bad actors, too.

Finally, the reason he even discovered the original bug is because Apple accidentally once or twice forgot to strip function names from a binary. If this didn't happen, that bug very likely would still be out there in the wild.

I'm not sure you understand how security research works.

tptacek · on Dec 1, 2020

This is a weird statement, since the premise of this blog post is that these kinds of attacks aren't out of reach for a single talented researcher on a Google salary. It's not out of reach for any government. Nauru, Grenada, Tonga, the Comoros --- they can all afford this.

Thorrez · on Dec 2, 2020

I believe the point of SulfurHexaFluri's final sentence is that it is cost prohibitive for Apple to dedicate a bunch of employees to search for bugs in order to fix them all. That is, it's cost-effective to find 1 bug, but not to find all of them. The sentence could have been worded better.

scarybeast · on Dec 1, 2020

I'd personally phrase things a bit differently: an _individual_ was able to pull this off while surrounded by screaming children. A large government, with all its resources and hundreds+ of people, would pull this off regularly and without breaking a sweat.

est31 · on Dec 1, 2020

> Short of rewriting the whole of iOS in a memory safe language I'm not sure how they could even solve this problem. Assigning a researcher to search for 6 months only to find one bug is financially prohibitive.

Note that memory safe languages won't solve security. They only eliminate a class of security bugs, which would be amazing progress, but not all of them.

brundolf · on Dec 2, 2020

The OP sounds like it was a memory-safety bug, so this is a bit pedantic.

grishka · on Dec 2, 2020

Didn't they move WiFi drivers, among other things, into the userspace in macOS Big Sur? I've heard somewhere that they're going in the direction towards microkernel for this particular reason of reducing the attack surface.

(yes I know I'm talking about macOS but the vulnerability was found in iOS, but there's a lot of shared code between them, especially on levels this low)

Moosdijk · on Dec 1, 2020

>Its basically known as a fact they have loads of these exploits sitting in their toolbox ready to use when they have a enticing enough target.

Do you have a source for this?

loxias · on Dec 1, 2020

Google "NSA TAO" -- Tailored Access Operations. AIUI, among other things they're responsible for developing, discovering, and weaponizing exploits used to access high value targets -- sometimes through fun techniques like "Quantum Insert", a sort of faster-man-in-the-middle attack. The wealth of exploits released in the equation group hack should put all doubts to rest.

anonymousiam · on Dec 2, 2020

Spot on. I expect this was a designed-in feature, but if I could prove it, I wouldn't be able to do so without going to jail.

anonymousiam · on Dec 2, 2020

From public sources:

https://www.spiegel.de/international/world/catalog-reveals-n...

heavyset_go · on Dec 1, 2020

There's a market for exploits that pays pretty well. Someone is throwing millions of dollars at them, and from what we can glean from investigations, leaks and whistle blowers, it's states that are buying them. One company in that space made world-wide news[1] by selling to governments.

[1] https://en.wikipedia.org/wiki/Hacking_Team

Shared404 · on Dec 1, 2020

>[1] https://en.wikipedia.org/wiki/Hacking_Team

Also a good idea to DDG Phineas Phisher. You should turn up an interesting read on pastebin iirc.

Edit: found it on exploit-db

[0] https://www.exploit-db.com/papers/41915

b06tmm · on Dec 1, 2020

First time I've seen "DDG", well done.

Shared404 · on Dec 2, 2020

Thanks! I realized a while ago that I don't use google, so why should I recommend it to others, even in passing.

cambalache · on Dec 2, 2020

OK. But you should have said Bing

Shared404 · on Dec 3, 2020

The user interface I use and recommend to others is Duck Duck Go. Why would I recommend using Bing directly?

cambalache · on Dec 3, 2020

Because that is the engine, the cosmetic is irrelevant. Unless you are choosing for the interface but that is not the concern of 99% of the people. We need a real independent searcher

Shared404 · on Dec 3, 2020

I use DDG for the fact that it provides a privacy layer over Bing.

There are of corse Yacy and other more independent options, but they aren't ready for most people.

SulfurHexaFluri · on Dec 1, 2020

The whole NSA leaks thing proved it. They had a tool built for exploiting windows boxes which was leaked and converted in to the ransomware WannaCry which spread globally a few years ago.

antegamisou · on Dec 2, 2020

The NSO Group, the Israeli team behind the Pegasus iOS spyware, have been accused of selling it to the UAE government.

https://www.haaretz.com/middle-east-news/.premium-with-israe...

zahma · on Dec 2, 2020

Interview with a nation state hacker for TAO at NSA.

https://podcasts.apple.com/us/podcast/darknet-diaries/id1296...

I believe they described their toolbox as metasploit on steroids. Some other episodes of darknet diaries also interview former and current government hackers.

crtasm · on Dec 2, 2020

Official website with full transcript + some nice pixel art: https://darknetdiaries.com/episode/10/

q3k · on Dec 1, 2020

Who do you think the customers of ZDI, Zerodium, Azimuth and others are?

roywiggins · on Dec 1, 2020

https://en.wikipedia.org/wiki/EternalBlue

Veserv · on Dec 1, 2020

It is not just not out of reach for large governments, it probably not even out of reach for most organizations with between 5-10 people. As the author says, 6 months of "one person, working alone in their bedroom, was able to build a capability which would allow them to seriously compromise iPhone users they'd come into close contact with". Even if we assume the author is paid $1,000,000 a year that is still only $500,000 of funding which is an absolute drop in the bucket compared to most businesses.

The average small business loan is more than that at $633,000 [1]. Hell, a single McDonalds restaurant [2] costs more than that to setup. In fact, it is not even out of the reach of vast numbers of individuals. Using the net worth percentiles in the US [3], $500,000 is only the 80th percentile of household net worth. That means in the US alone, which has 129 million households, there are literally 25.8 million households with the resources to bankroll such an effort (assuming they were willing to liquidate their net worth). You need to increase the cost by 1,000x to 10,000x before you get a point where it is out of reach for anybody except for large governments and you need to increase the cost by 100,000x to 1,000,000x before it actually becomes infeasible for any government to bankroll such attacks.

tl;dr It is way worse than you say. Every government can fund such an effort. Every Fortune 500 company can fund such an effort. Every multinational can fund such an effort. Probably ~50% of small businesses can fund such an effort. ~20% of people in the US can fund such an effort. The costs of these attacks aren't rookie numbers, they are baby numbers.

[1] https://www.fundera.com/business-loans/guides/average-small-...

[2] https://www.mcdonalds.com/us/en-us/about-us/franchising/new-...

[3] https://dqydj.com/average-median-top-net-worth-percentiles/

heavyset_go · on Dec 1, 2020

For those who don't see why a company would want to use such exploits, consider how valuable it would be to know if a company's employees were planning to organize or strike.

There are also paranoid people in positions of power, and bureaucracies that can justify spying on employees. One of the interesting things about this lockdown was finding out that many companies put spyware on their employee-issued computers to monitor their usage.

allie1 · on Dec 2, 2020

How is it financially prohibitive to pay a researcher a salary to find a 0day like this, where the bounty programs pay $100k-$500k on 0days on the same ? source: https://developer.apple.com/security-bounty/payouts/

Animats · on Dec 1, 2020

Unfortunately, it's the same old story. A fairly trivial buffer overflow programming error in C++ code in the kernel parsing untrusted data, exposed to remote attackers. In fact, this entire exploit uses just a single memory corruption vulnerability to compromise the flagship iPhone 11 Pro device. With just this one issue I was able to defeat all the mitigations in order to remotely gain native code execution and kernel memory read and write.

Yes, same old buffer C/C++ overflow problem. We have mainstream alternatives now. C#. Go. Rust. It's time to move on.

blub · on Dec 2, 2020

The code where the bug happens is legal C++, but it uses absolutely none of the memory safety improvements which were added to the language in the past... twenty years probably. It's basically C with classes.

If they haven't kept up with the changes in their current language, what makes one think that they would "move on" to the alternatives, two of which aren't even alternatives?

Before they switch to Rust it would be much faster and more efficient to use smart pointers, std::array, std::vector and stop using memcpy.

saagarjha · on Dec 2, 2020

Note that this code is shipping as a kernel extension, which uses Embedded C++, not standard C++. Notably, things like templates and exceptions are not available. It would be nice if they could work on this instead, but looking at the dyld and Security sources (which has no such limitations, as the run in userspace) I don't have much confidence.

pjmlp · on Dec 2, 2020

They could still make use of bounds checking, like my own classes did back in the MS-DOS days, when C++ARM was pretty much the only thing available.

Naturally when one writes C in C++ it doesn't help.

saagarjha · on Dec 2, 2020

I suspect that they'll adopt MTE first.

pjmlp · on Dec 2, 2020

I agree, that was the path taken by Solaris SPARC and it is the only way to make it work, because even if a language level safety would be introduced today, not everyone would bother adopting it.

q3k · on Dec 1, 2020

As much as I like to bash security critical code written in memory-unsafe languages, I don't think that this is the crux of the problem here.

To me it's that this extremely trivial bug (the heap overflow, let's ignore the rest for now) passed through code review, security review, security audits, fuzzing... Or that Apple didn't have these in place at all. Not sure which option is worse.

tptacek · on Dec 1, 2020

We have 30 years of experience showing that ordinary heap overflows are not in fact easy to spot in code review, security review, security audits, and fuzzing. Each of those modalities eliminates a slice of the problem, and some of them --- manual review modalities --- will remove different slices every time they're applied; different test team, different bugs.

To me, this strongly suggests that the problem is in fact memory-unsafe languages, and not general engineering practices.

Apple, by the way, has all the things you're talking about in place, and in spades.

thenewwazoo · on Dec 2, 2020

I agree that the problem is memory-unsafe languages.

You can improve the tools, or you can improve the human, and nobody has managed to improve the human despite decades of trying.

gok · on Dec 2, 2020

OTOH, we don't really have evidence to show that memory safety is effective in kernels/drivers because no memory safe language has ever been deployed at scale for that purpose.

Cyph0n · on Dec 2, 2020

The way I look at it is that relying exclusively on manual review is at best the same as relying on both manual review and a memory safe language.

In practice, the best case and average case rarely line up.

tptacek · on Dec 2, 2020

You don't have to manually review for classes of vulnerability that your programming environment forecloses on.

Cyph0n · on Dec 2, 2020

Good point - all the more reason to use a memory safe language!

scoutt · on Dec 2, 2020

> the problem is in fact memory-unsafe languages, and not general engineering practices.

Languages don't introduce bugs by themselves. Engineers produced those bugs.

I always thought that bugs are the programmers' fault, and not to blame the language. It's like blaming the English language because it allows you to misuse it and manufacture very offensive racial slurs, or to be rude and cruel, and thus we should replace it with another language that doesn't allow to exploit these weaknesses. We won't be able to express ourselves with beautifully (low-level) crafted poems anymore, but that's the price to pay.

tiborsaas · on Dec 2, 2020

There are inherent features of human languages that force you into weird issues. English for example has gender pronouns and that's why you see in profiles how you should approach someone. It's not like they want to add it, it's that they have to if it bothers them when people misuse them.

In Hungarian we don't have this problem at all, there's no concept of gender specific pronouns.

vlovich123 · on Dec 1, 2020

Such bugs are extremely difficult to prevent at scale. Even the most talented engineers make such mistakes and programming quality varies significantly even within top engineering teams which are usually comprised of people with different skill sets (+ junior engineers that need training).

Safe languages are the only way forward to drastically reduce the problem. It can’t be guaranteed to be eliminated 100% obviously because there are still escape hatches available, but it will be significantly improved because you can limit the surface area where such bugs can live.

SulfurHexaFluri · on Dec 1, 2020

Not sure if this is common for everyone but I find whenever I get assigned for a review for a monster change, I spend over an hour just working out what the change does and if it seems like it will work. There is no way I could spot tiny potential security exploits in 3000 lines of changed code.

Xenograph · on Dec 2, 2020

Sending 3,000+ LoC code reviews is not considered good engineering practice. In general it's important to keep code reviews small for the reasons you describe. If it's impossible to make incremental changes in smaller units, that's a sign of an unhealthy code base.

HereBeBeasties · on Dec 2, 2020

Although it is certainly challenging, you shouldn't spend only an hour reviewing a 3,000 LoC review, unless most of it is trivial refactoring, especially if that code is security-critical and handles untrusted input. That's only 1.2s per line of code. No chance you have good quality control with that amount of attention.

If it's taken someone a whole fortnight to write it, you should expect to spend at least half a day reviewing it, IMO.

anonymousiam · on Dec 2, 2020

Does any software producer do fuzzing on their own product? I have never heard of this being done by software developers. Usually it's done by exploit developers. Of course there are static analysis tools that should uncover a problem like this, and I know that high-reliability embedded software developers use them, but I don't know if the likes of Apple does.

colesantiago · on Dec 1, 2020

IMO this is huge.

Thankfully, Apple is starting to hire Rust developers as well as AWS.

The tide is changing, one day we will see some Rust code in iOS/macOS so that these issues are a thing of the past.

zepto · on Dec 1, 2020

It seems unlikely that they’ll solve the problems in iOS with Rust.

It seems much more likely that they will use Swift in some form.

KMag · on Dec 2, 2020

Swift seems an unlikely choice for incrementally replacing portions of kernel code.

pjmlp · on Dec 2, 2020

Apple clearly states Swift is a systems programming language.

> Swift is a successor to both the C and Objective-C languages.

https://developer.apple.com/swift

> Swift is intended as a replacement for C-based languages (C, C++, and Objective-C).

https://swift.org/about/

KMag · on Dec 3, 2020

There are systems languages and then there are systems languages. As the Golang team have pointed out, there's lots of systems programming going on outside of OS kernels. Neither of those links mention kernel development. Pervasive refcount updates (ARC) and a vtable-unfriendly dynamic dispatch mechanism inherited from Objective-C are fine for userspace, but most kernel code is very performance-sensitive.

pjmlp · on Dec 3, 2020

Go team only changed their message due to the pitch forks they were getting from the UNIX/C crowd.

There are people that believe systems programming languages in an OS kernel is doable with some form of GC, and then there those that will never change their mind.

https://www.f-secure.com/en/consulting/foundry/usb-armory

https://blog.arduino.cc/2019/08/23/tinygo-on-arduino/

https://gvisor.dev/

https://github.com/mit-pdos/biscuit

https://github.com/ycoroneos/G.E.R.T

By the way, NeXTSTEP drivers were written in Objective-C, and the new userspace framework, DriverKit, is an homage to the NeXTSTEP driver API.

https://www.nextop.de/NeXTstep_3.3_Developer_Documentation/O...

KMag · on Dec 3, 2020

I'm familiar with Biscuit and Midori. The Biscuit paper estimates a 10% performance hit in syscall-intensive benchmarks due to using Golang vs. C.

That being said, ARC may faire much better than a tracing GC, particularly in latency variance, throughput variance, and mean heap usage.

Though, I think we're best off moving off of hypervisors and onto 4th generation microkernels with isolation similar to AIX LPARs / Solaris Containers. After all, a hypervisor is essentially a microkernel that's forced to use hardware traps/faults plus hardware emulation for its syscall interface (plus upcalls for performance, which use a calling convention much closer to traditional syscalls). There are stability, security, and performance advantages to throwing out all of that hardware emulation code, moving everything to upcalls, and getting rid of the second (guest OS) kernel running between the application and the hardware. If you push most of the system functionality out of kernel space, then rewriting some of the less performance-critical components in a language with ARC or tracing GC starts to make more sense.

pjmlp · on Dec 3, 2020

ARC is slower than tracing GC as all benchmarks where Swift was used prove.

Biscuit was a research project whose goal was to get the thesis done, when the thesis was done, no more effort was spent.

Bing was powered by Midori for the Asian countries for part of its life, and Joe Duffy has stated in the RustConf keynote that Windows Dev team was a reason why Midori faced so much internal resistance, even when they were proven wrong.

Before Google was willing to put the money into JavaScript many would assert that JavaScript would be worthless to anything besides form validation and DHTML.

Mainframes to this day still make use of their own systems programming languages, way safer than C, and you don't see people crying that their kernels are too slow.

In fact, Unisys uses this fact to sell ClearPath MCP to customers that value security above anything else, Fort Knox style.

KMag · on Dec 3, 2020

Very interesting! I wasn't aware Midori had been widely deployed in production! My understanding is that Midori ran the kernel and all programs in ring 0 and relied on the classloader enforcing type safety and the managed runtime enforcing the other security and stability constraints normally enforced by hardware, so syscalls were just normal method calls and there were no context switches.

Burroughs MCP was written in essentially an extended Algol 60 dialect. Algol 60 only had very limited heap allocation for dynamic arrays, no GC, and I haven't read any indications that Burroughs added GC to their extended dialects.

Multics was written in a PL/I dialect, without tracing GC. Likewise, IBM OS/360 and descendants are written in the PL/S dialect of PL/I, and I haven't seen any indication it has tracing GC.

With tracing GC, you have a trade-off between the peak amount of unclaimed garbage and the GC overhead. ARC should have lower variance in both latency and heap usage, which I presume is the reason Apple moved the whole Objective-C and Swift ecosystem to ARC and deprecated the Objective-C tracing GC.

I used to be a True Believer(tm) in the JVM and other managed runtimes. I was one of 5 developers of the most popular Java desktop application in the mid 2000s. Then I moved to Google and started developing web search infrastructure. I was at Google when V8 was created, and I put a lot of effort into running all of the JavaScript that the indexing system found, across the entire visible web. For things at massive scale, spending millions of dollars per year just in electricity bills, it's extremely tough to beat highly tuned C++. Yes, it's a lot of effort. Yes, I hope safer languages like Rust replace C++ and static analysis tools continue to improve.

I still kind of want to be a managed runtime true believer again, but it's tough to go back after believing for so many years that managed runtimes were going to match expertly hand-optimized C++ in latency- and throughput-critical applications "any day now".

pjmlp · on Dec 3, 2020

For me it sufices the little victories achieved by F-Secure, Aicas, PTC, Astrobe.

Fortunately there are enough true believers to keep them in business.

By the way, apparently it is time for Google to improve their gRPC C++ implementation.

https://devblogs.microsoft.com/aspnet/grpc-performance-impro...

As for the mainframe languages, yes they don't have a GC, but they have the right defaults regarding bounds checking, implicit conversions, explicit unsafe code.

Regarding managed runtimes, versus C++, languages like C#, D, Modula-3, Swift have the features to write C++ like code when needed, the main problem is that many don't bother to learn the language features available to them.

At Microsoft stories about hard core C++ devs having to be proven wrong with C# running in front of them is relatively known, Joe Duffy has shared a couple of such stories.

His experience in Singularity and Midori is also what made him bet on Go for Pulumi.

zepto · on Dec 2, 2020

In its current form perhaps this is true. However Chris Lattner and others have expressed the desire to have it be suitable for systems programming.

There is reason to believe they will adapt it.

KMag · on Dec 3, 2020

Has Chris specifically mentioned OS kernel programming? As the Golang team have pointed out, there's lots of systems programming going on in userspace, which is what they refer to when they call Golang a systems programming language.

That being said, ARC is probably easier to get to work well in-kernel vs. tracing garbage collection. There's a performance cost to all of those reference count updates, but at least the variance is extremely low.

blub · on Dec 2, 2020

By the time we all retire in a few decades they'll be a thing of the past, probably.

There's so much low-hanging fruit to pick in that code and switching to Rust is like saying that we should go to Mars to pick fruit instead.

pfundstein · on Dec 1, 2020

C#? You've got to be joking.

kogir · on Dec 1, 2020

It’s less insane than you might think:

https://en.m.wikipedia.org/wiki/Singularity_(operating_syste...

I agree rust is probably better suited. Or Apple could make their own memory safe language. They’re clearly capable.

pfundstein · on Dec 1, 2020

That's an interesting experiment, but that's all it is. The project relies on ASM/C/C++ to boot into a microkernel and to interpret and run the C#. But I suppose it would greatly reduce the attack surface of C/C++/ASM code.

I just wonder, for example, how a capable hardware abstraction layer would work in C#, interrupt handling, CPU and IO scheduling, etc.

pjmlp · on Dec 2, 2020

Here is another experiment, this one being delivered into production,

https://www.wildernesslabs.co/

olliej · on Dec 1, 2020

You mean Swift?

snazz · on Dec 1, 2020

C#, Go, and Java all go in the same category (roughly)—they wouldn't work for kernel code. Rust will be a valid replacement for C++ kernel code in the near future, I'm sure.

yencabulator · on Dec 2, 2020

Here's a POSIX kernel in Go, written explicitly to prove your point wrong:

https://github.com/mit-pdos/biscuit

https://pdos.csail.mit.edu/projects/biscuit.html

fulafel · on Dec 2, 2020

Where do you get that? Those have all been used in kernels, they work.

Also on another front Apple seems to have already enabled device drivers in user space: https://developer.apple.com/system-extensions/

bitcharmer · on Dec 2, 2020

Yes, you can use them to play around and experiment with OS-level stuff and even have some fun with it.

But you definitely cannot build a performant and robust kernel from scrtach with these languages.

fulafel · on Dec 2, 2020

> you definitely cannot build a performant and robust kernel from scrtach with these languages.

I think this is irrelevant (and less importantly, false). Its's fine to use eg se4l as a foundation, or as a incremental step even just have a safety focused driver runtime inside the kernel proper.

pjmlp · on Dec 2, 2020

Sure they would, so much that there people doing it right now.

https://www.wildernesslabs.co/

https://labs.f-secure.com/blog/tamago/

https://www.ptc.com/en/products/developer-tools/perc

saagarjha · on Dec 1, 2020

Writing the majority of a kernel in those languages is certainly possible.

pfundstein · on Dec 2, 2020

Possible, perhaps, but feasible? Microsoft certainly had a go at it with the likes of Midori and Singularity, but these were met with the same fate that will likely befall any managed code kernel. While it's an honorable pursuit with certain merit, to produce a fully featured OS in this way -- without serious concessions -- is just not feasible.

pjmlp · on Dec 2, 2020

Those projects died due to management politics from WinDev, nothing to do with capabilities.

So much that many of the System C# features are now in .NET 5 and other ones will eventually land on .NET 6.

pjmlp · on Dec 2, 2020

Nope, some people are actually quite serious about it.

https://www.wildernesslabs.co/

olliej · on Dec 1, 2020

C# is GC'd so massive memory hit, and also not a language you can have in a kernel.

Go: GC again, so no go.

Rust: most sane of the examples you've given.

Apple has already started migrating to Swift which is a memory safe language.

However the real reasons Rust and Go aren't feasible is that they're both essentially all-or-nothing, and neither offers even the most basic semblance of ABI compatibility. Their only nod to ABI stability is "use FFI to C" which means your APIs remain unsafe, and doesn't work for non-C languages without all your system APIs having other languages layered on top.

Swift at least lets you replace individual objc classes one at a time, and is ABI stable, but has no C++ interaction.

gfxgirl · on Dec 2, 2020

Swift is far more like C# than Rust in terms of memory management. Sure it uses ARC but arguably that makes it not suitable for kernel level stuff.

olliej · on Dec 3, 2020

xnu is refcounted, its also c++ which isn't swift friendly.

XNU also has ABI stability requirements which rules out rust.

djeiasbsbo · on Dec 1, 2020

Yes, but what about these huge legacy codebases like the iOS kernel? I assume we will have to deal with this type of vulnerability for years to come...

Gibbon1 · on Dec 1, 2020

Could also fire everyone on the C/C++standards bodies and replace them with people willing to add arrays as a first class data type.

Animats · on Dec 1, 2020

I had that argument with the C standards people a decade ago. [1] Consensus was that it would work technically but not politically. The C++ people are too deep into templates to ever get out.

The basic trick for backwards compatibility is that all arrays have sizes, but you get to specify the expression which represents the size and associate it with the array. So you don't need array descriptors and can keep many existing representations.

Also, if you have slices, you rarely need pointer arithmetic. Slices are pointer arithmetic with sane semantics.

I'm tired of seeing decade after decade of C/C++ buffer overflows. It speaks badly of software engineering as a profession.

[1] http://www.animats.com/papers/languages/safearraysforc43.pdf

gfxgirl · on Dec 2, 2020

Or you could be like Jonathan Blow who claims he never has any memory bug issues and so it's not a problem worth solving in his JAI language.