1. Monitoring & Observability What are the key metrics you monitor to ensure service reliability? How do you prioritize them? Can you explain the difference between monitoring, logging, and tracing, and give an example of when you’d use each? Describe a time when you set up monitoring or alerting for a critical system. What were the challenges, and how did you address them? 2. Incident Management & Troubleshooting What’s your approach to diagnosing and resolving a high-severity incident? Can you walk me through an example? How do you conduct post-incident reviews to prevent recurrence, and what do you look for? Explain how you would handle an incident where latency suddenly spikes for a critical application. What steps would you take? 3. Automation & Tooling How do you identify opportunities for automation in daily tasks? Give an example of a repetitive task you automated. What tools have you used for automating infrastructure deployment and configuration management? Explain how you would approach building a self-healing system. What tools and practices would you use? 4. Scalability & Performance How would you design a system to handle high traffic loads while maintaining low latency? Can you explain the concept of horizontal vs. vertical scaling and when you would use each? Describe an instance when you helped optimize a system for scalability. What methods did you employ? 5. Reliability & Availability What are Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs), and why are they important in SRE? How would you handle a situation where you’re nearing your error budget for the quarter? What are some common trade-offs you consider when balancing reliability with system performance and cost?
Sre Interview Questions
1,936 sre interview questions shared by candidates
- Career overview and presentation - Motivation - Kubernetes deployment of a web app & documentation
Tell me about a project where you successfully influenced a other team
Python coding and discussion on DSA concepts
No.
Which function is used in C to allocate a block of memory? to free? Which function is used in C++ to allocate a block of memory? to free? How many bytes are necessary to store a MAC address? What is an inode? What are the packets exchanged to establish a TCP connection? Which of the following algorithms is not Θ(log(n))?
Count consecutive equal integers in an array Depth first search using recursion Topological sort with cycle detection
Similar to "Median from data stream"
Design a Site Reliability System which scales for large number of users
1) Some string manipulation problem
Viewing 1751 - 1760 interview questions