DEV Community

Pragya Sapkota

Posted on Jan 22, 2023 • Originally published at pragyasapkota.Medium

Clustering: How much does it differ from Load Balancing?

#clustering #architecture #distributedsystems #beginners

A cluster can be defined as a group of stuff. Likewise, a computer cluster is a group of computers that works for a common goal. The functionality of clustering is to divide workloads among the computers that act like nodes so that their combined memory and processing power of them all together gives a better overall performance.

All the nodes in a cluster are connected to a network where they form an internode connection and later form a cluster. They even might have shared storage or local storage on each node.

On a cluster, there is always one node that acts as a leader and is an entry point for the cluster. It delegates the tasks to the other fellow nodes and sometimes even returns the result to the user. The whole cluster works like a single system and there is no way for the user to know if they are interacting with a cluster or an individual machine. The clusters lead to low latency and prevent bottlenecks in node-to-node communication as well.

We can see clustering in many technologies these days. There are Kubernetes and Amazon ECS as containers, Cassandra and MongoDB as databases, and Redis as caches. There are three types of usual clusters,

Highly Available or fail-over
Load Balancing
High-Performance computing

For the highly available clusters, there are two kinds of configurations. Let’s look at them individually.

Active-Active Configurations

In the active-active configurations, there are at least two nodes that work together for the same kind of service. The concept of active-active comes from load balancing that distributes the work across the devices in the group. This helps the nodes remain intact since the overload is prevented in the configuration.

In addition, some nodes are spared to be used later which can help improve throughput and response time.

Active-Passive Configurations

Further, there is an active-passive configuration where there are at least two nodes like the active-active configurations. However, the node from the active-passive cluster has a special feature. Not all nodes are active in the configuration, but some will be in passive or standby mode as well. They only come forward when the active node fails or goes passive due to any reason.

How much different is Clustering from Load Balancing?

While load balancing and clustering might sound the same, they are not. They do share some common features though. While the nodes or the servers have no idea of each other’s existence in the load balancer, they are aware of the nodes in clustering. Clustering also gives us redundancy and boosts capacity and availability because all the servers work for the same goal. Likewise, the load balancer makes its node react to the requests they receive respectively.

To note, we can implement both load balancing and clustering in the system as well. It can also be in use when independent servers are for the same goal. They can be beneficial when specific web services are required.

Advantages of Clustering

Highly Scalable
Availability
Improved Performance
Low Latency
Cost Effective

Challenges of Clustering

More complexity for installation and maintenance
Increased complications if the nodes are not homogenous
Managing Storage is hard since the nodes can sometimes over-write one another in shared space
Extra hard work — distributed data needs to be in sync always

pragyaasapkota / System-Design-Concepts

Though the concepts of system design might be tricky, let's see them individually to their core concepts and have a better understanding.

System Design

Systems design is the process of defining elements of a system like modules, architecture, components and their interfaces and data for a system based on the specified requirements.

This is a index for the concepts of system.

If you wish to open these in a new tab, Press CTRL+click

S.N.	Table of Content
1.	Caching
2.	Network Protocols
3.	Storage: The Underrated Topic
4.	Latency and Throughput
5.	System Availability
6.	Leader Election
7.	Proxies
8.	Load Balancing
9.	Endpoint Protection
10.	HTTPS: Is it better than HTTP?
11.	Polling and Streaming
12.	Long Polling
13.	Hashing
14.	CAP Theorem
15.	PACELC Theorem
16.	Messaging and Pub-Sub
17.	Database Relational Database Non-relational Database
18.	Logging, Monitoring, and Alerting
19.	Distributed System
20.	Scaling
21.	Event Driven Architecture (EDA)
22.	CQRS
23.	Message Queue
24.	Architectural Patterns
25.	Enterprise Service Bus (ESB)
26.	SLA, SLO, and SLI
27.	Heartbeat

…

View on GitHub

I hope this article was helpful to you.

Please don’t forget to follow me!!!

Any kind of feedback or comment is welcome!!!

Thank you for your time and support!!!!

Keep Reading!! Keep Learning!!!

DEV Community

Clustering: How much does it differ from Load Balancing?

Active-Active Configurations

Active-Passive Configurations

How much different is Clustering from Load Balancing?

Advantages of Clustering

Challenges of Clustering

pragyaasapkota / System-Design-Concepts

Though the concepts of system design might be tricky, let's see them individually to their core concepts and have a better understanding.

System Design

Top comments (0)

Read next

Understanding the Barrel Pattern in JavaScript/TypeScript

Simplify Environment Variable Management with GitHub Environments

10 Cool Ideas for Discord Bots You Can Build Today

Understanding Software Architecture Diagrams: Beyond the Blueprint