The cluster module allows us to create worker processes to improve our NodeJS applications performance. This is specially important in web applications, where a master process receives all the requests and load balances them among the worker processes.
But all this power comes with the cost that must be the application who manages all the complexity associated with process managements: what happens if a worker process exists unexpectedly, how exit gracefully the worker processes, what if you need to restart all your workers, etc.
In this post we present PM2 tool. although it is a general process manager, that means it can manage any kind of process like python, ruby, ... and not only NodeJS processes, the tool is specially designed to manage NodeJS applications that want to work with the cluster module.
- Understanding the NodeJS cluster module
- Using cluster module with HTTP servers
- Using PM2 to manage a NodeJS cluster
- Graceful shutdown NodeJS HTTP server when using PM2
Introducing PM2
As said previously, PM2 is a general process manager, that is, a program that controls the execution of other process (like a python program that check if you have new emails) and does things like: check your process is running, re-execute your process if for some reason it exits unexpectedly, log its output, etc.
The most important thing for us is PM2 simplifies the execution of NodeJS applications to run as a cluster. Yes, you write your application without worrying about cluster module and is PM2 who creates a given number of worker processes to run your application.
The hard part of cluster module
Lets see an example where we create a very basic HTTP server using the cluster module. The master process will spawn as many workers as CPUs and will take care if any of the workers exists to spawn a new worker.
const cluster = require('cluster');
const http = require('http');
const numCPUs = require('os').cpus().length;
if (cluster.isMaster) {
masterProcess();
} else {
childProcess();
}
function masterProcess() {
console.log(`Master ${process.pid} is running`);
for (let i = 0; i < numCPUs; i++) {
console.log(`Forking process number ${i}...`);
cluster.fork();
}
cluster.on('exit', (worker, code, signal) => {
console.log(`Worker ${worker.process.pid} died`);
console.log(`Forking a new process...`);
cluster.fork();
});
}
function childProcess() {
console.log(`Worker ${process.pid} started...`);
http.createServer((req, res) => {
res.writeHead(200);
res.end('Hello World');
process.exit(1);
}).listen(3000);
}
The worker process is a very simple HTTP server listening on port 3000 and programmed to return a Hello World
and exit (to simulate a failure).
If we run the program with $ node app.js
the output will show something like:
$ node app.js
Master 2398 is running
Forking process number 0...
Forking process number 1...
Worker 2399 started...
Worker 2400 started...
If we go to browser at URL http://localhost:3000
we will get a Hello World
and in the console see something like:
Worker 2400 died
Forking a new process...
Worker 2401 started...
That's very nice, now lets go to see how PM2 can simplify our application.
The PM2 way
Before continue, you need to instal PM2 on your system. Typically it is installed as a global module with $ npm install pm2 -g
or $ yarn global add pm2
.
When using PM2 we can forget the part of the code related with the master process, that will responsibility of PM2, so our very basic HTTP server can be rewritten as:
const http = require('http');
console.log(`Worker ${process.pid} started...`);
http.createServer((req, res) => {
res.writeHead(200);
res.end('Hello World');
process.exit(1);
}).listen(3000);
Now run PM2 with $ pm2 start app.js -i 3
and you will see an output similar to:
Note the option
-i
that is used to indicate the number of instances to create. The idea is that number be the same as your number of CPU cores. If you don't know them you can set-i 0
to leave PM2 detect it automatically.
$ pm2 start app.js -i 3
[PM2] Starting /Users/blablabla/some-project/app.js in cluster_mode (3 instances)
[PM2] Done.
| Name | mode | status | ↺ | cpu | memory |
| ----------|---------|--------|---|-----|-----------|
| app | cluster | online | 0 | 23% | 27.1 MB |
| app | cluster | online | 0 | 26% | 27.3 MB |
| app | cluster | online | 0 | 14% | 25.1 MB |
We can see the application logs running $ pm2 log
. Now when accessing the the http://localhost:3000
URL we will see logs similar to:
PM2 | App name:app id:0 disconnected
PM2 | App [app] with id [0] and pid [1299], exited with code [1] via signal [SIGINT]
PM2 | Starting execution sequence in -cluster mode- for app name:app id:0
PM2 | App name:app id:0 online
0|app | Worker 1489 started...
We can see how PM2 process detects one of our workers has exit and automatically starts a new instance.
Conclusions
Although the NodeJS cluster module is a powerful mechanism to improve performance it comes at the cost of complexity required to manage all the situations an application can found: what happens if a worker exists, how can we reload the application cluster without down time, etc.
PM2 is a process manager specially designed to work with NodeJS clusters. It allow to cluster an application, restart or reload, without the required code complexity in addition to offer tools to see log outputs, monitorization, etc.
Top comments (2)
If there is a node js module for handling REST requests, will running two identical instances of it using PM2 make the performance (throughput) better - (note it is in fork mode since each slightly has different configuration)?
Do you know how to cluster with PM2 programmatically?
Can you point to any examples please?