Finally, use the project management style that suits the project in its current state.
Bobby Marleyhas quoted8 years ago
A matrix of all possible combinations of disasters with plans to address each of these disasters permits you to sleep soundly for at least one night; keeping your recovery plans current and exercised permits you to sleep the other 364 nights of the year.
Bobby Marleyhas quoted8 years ago
The cost of failure is education. Devin Carraway
Bobby Marleyhas quoted8 years ago
Your monitoring system should address two questions: what’s broken, and why?
Bobby Marleyhas quoted8 years ago
In general, an SRE team is responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of their service(s).
Bobby Marleyhas quoted8 years ago
Monitoring should never require a human to interpret any part of the alerting domain.
Darkohas quoted2 months ago
request success rate
Darkohas quoted2 months ago
Whether it is at Google or elsewhere, monitoring is an absolutely essential compo‐
nent of doing the right thing in production. If you can’t monitor a service, you don’t know what’s happening, and if you’re blind to what’s happening, you can’t be reliable.
Darkohas quoted2 months ago
store mapping between BNS paths and IP address:port pairs.
Darkohas quoted2 months ago
Load balancing at the Remote Procedure Call (RPC) level,