Question 5 of 10Pro Only

How do you configure and tune Kubernetes Horizontal Pod Autoscaler for a latency-sensitive API service? What metrics do you use and what are the common tuning challenges?

Sample answer preview

The Horizontal Pod Autoscaler in Kubernetes adjusts the number of pod replicas based on observed metrics. For a latency-sensitive API service, default CPU-based autoscaling is often insufficient, and you need to configure custom metrics and carefully tune the scaling behavior to…

HPAprometheus-adaptercustom metricsstabilization windowscaling policiesreadiness probes

Unlock the full answer

Get the complete model answer, key points, common pitfalls, and access to 9+ more SRE / Platform Engineer interview questions.

Upgrade to Pro

Starting at $19/month • Cancel anytime