Scalability means that an application / system can handle greater loads by adapting.

Two types of scalability:

Vertical and Horizontal

Untitled

horizontal scalability is increasing no of instances:

Untitled

Untitled

active passive:

In passive high availability, there is a primary instance and one or more standby instances. The standby instances are not actively serving requests but are kept in sync with the primary instance.

In active high availability, multiple instances are actively serving requests simultaneously. This is often achieved through load balancing, where incoming requests are distributed across multiple instances.

Ideal for applications with high traffic and the need for both high availability and scalability.

Untitled

HA is basically: Ensures that a system remains operational and accessible even in the event of failures.

Whereas horizontal scaling is adding more instnaces

Elastic Load Balancer:

Untitled

Untitled

/health is definiying the route:

Untitled