Isn't choasmonkey just randomly restarting virtual servers on Amazon?
Kudos to Netflix, but restarting a virtual server vs a physical or a whole data center are different things.
I think every company that cares a bit about high availability knocks out stuff randomly or at least in different ways, does stuff like introducing packet loss, etc. It's another layer and another thing to test that on service/virtual server layer than on close to physical layers.
Of course, one should test that too and it's nowhere near impossible, but Chaosmonkey is for a somewhat different use case.
Kudos to Netflix, but restarting a virtual server vs a physical or a whole data center are different things.
I think every company that cares a bit about high availability knocks out stuff randomly or at least in different ways, does stuff like introducing packet loss, etc. It's another layer and another thing to test that on service/virtual server layer than on close to physical layers.
Of course, one should test that too and it's nowhere near impossible, but Chaosmonkey is for a somewhat different use case.
Also the "article" mentions that tests are done.