Finally, use the project management style that suits the project in its current state.
Bobby Marleyhas quoted8 years ago
A matrix of all possible combinations of disasters with plans to address each of these disasters permits you to sleep soundly for at least one night; keeping your recovery plans current and exercised permits you to sleep the other 364 nights of the year.
Bobby Marleyhas quoted8 years ago
The cost of failure is education. Devin Carraway
Bobby Marleyhas quoted8 years ago
Your monitoring system should address two questions: what’s broken, and why?
Bobby Marleyhas quoted8 years ago
In general, an SRE team is responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of their service(s).
Bobby Marleyhas quoted8 years ago
Monitoring should never require a human to interpret any part of the alerting domain.
Darkohas quotedlast month
request success rate
Darkohas quotedlast month
Whether it is at Google or elsewhere, monitoring is an absolutely essential compo‐
nent of doing the right thing in production. If you can’t monitor a service, you don’t know what’s happening, and if you’re blind to what’s happening, you can’t be reliable.
Darkohas quotedlast month
store mapping between BNS paths and IP address:port pairs.
Darkohas quotedlast month
Load balancing at the Remote Procedure Call (RPC) level,