What is Horizontal vs Vertical Scaling in System Design?

Question

Accepted Answer

I'd start with vertical scaling for simplicity, then move to horizontal when approaching resource limits. For stateless app servers, horizontal scaling behind a load balancer is straightforward. For databases, I'd scale vertically first, then add read replicas for read-heavy workloads, and finally shard writes when single-node write throughput is the bottleneck. The key enabler for horizontal scaling is stateless services — all session and application state must live in external stores.

Related concepts