26:13 SREcon21 - Take Me Down to the Paradise City Where the Metric Is Green and Traces Are Pretty USENIX
28:50 SREcon21 - Microservices above the Cloud—Designing the International Space Station for Reliability USENIX
30:33 SREcon21 - How We Built Out Our SRE Department to Support over 100 Million Users for the World's 3rd USENIX
13:12 SREcon21 - Scaling for a Pandemic: How We Keep Ahead of Demand for Google Meet during COVID-19 USENIX
49:04 SREcon21 - DevOps Ten Years After: Review of a Failure with John Allspaw and Paul Hammond USENIX
28:45 SREcon21 - Demystifying Machine Learning in Production: Reasoning about a Large-Scale ML Platform USENIX
15:10 SREcon21 - When Systems Flatline—Enhancing Incident Response with Learnings from the Medical Field USENIX
14:39 SREcon21 - How Our SREs Safeguard Nanosecond Performance—at Scale—in an Environment Built to Fail USENIX
27:22 SREcon21 - Taking Control of Metrics Growth and Cardinality: Tips for Maximizing Your Observability USENIX
25:33 SREcon21 - Latency Distributions and Micro-Benchmarking to Identify and Characterize Kernel Hotspots USENIX