My personal experience, over 20 years, is that if something like credential rota...

Silhouette · on March 23, 2020

It sounds complicated, but if you have decent abstractions, this kind of stuff is actually pretty easy to accomplish.

I'd be interested in seeing any end-to-end examples of how people are doing this in practice.

For example, suppose you're maintaining a SaaS application and you have a private key to access some third party API that certain parts of your back end code need. How do you automate this process, so you change your private key on a regular schedule and update all affected hosts so your application code picks up the new one?

Ideally this needs to avoid introducing risks like a single point of failure, a new attack surface, or the possibility of losing access to the API altogether if something goes wrong. Assuming the old key is immediately invalidated when you request a new one via some API, you also need a real time way of looking up the current active key from any of your application hosts when they need it, again without creating single points of failure, etc.

No doubt this could be done with enough work, but it doesn't feel like a trivial problem.

infogulch · on March 24, 2020

The key rotation issue where there are extra complications around synchronizing multiple keyholders to use the updated credential is neatly solved by having two keys, both valid, and rotating one at a time only after all nodes that need it have moved to the new one.

Silhouette · on March 24, 2020

Sure, if the API you're dealing with supports that then it's easy. But if it doesn't, you have a non-trivial timing problem.

infogulch · on March 31, 2020

I would say that an API that requires authorization keys is incomplete if it doesn't provide the tools to manage them securely. How could a service even offer scalability and security if it doesn't support two keys with rotation? It's a "non-trivial problem" because consistent, available, distributed API key rotation is not just hard, it's impossible. (See: CAP)

That doesn't solve your problem, but it means that you should take this complaint to whatever API service offering you are using.

sitkack · on March 24, 2020

If the skeleton of your system starts with these processes in place, then you can evolve the arch while maintaining these invariants. If something is an invariant rule, then it needs to exist at the start of the system's life. If you patch the system later, it won't have proper coherence.