proc-watch: find out what is using the most CPU and memory

#prometheus #devops #python

When you are running multiple EC2 instances or running a shared environment, it is useful to monitor the amount of CPU being used and how much memory is being used over time.

For example, if there's a lot of processing happening and CPU usage

there could be something strange going on
...or it could be an expected amount of CPU usage depending on the workloads of the compute nodes

If there's a lot of memory usage that is unexpected

it could be a memory leak
...or a new process launched that is consuming more resources

If there are more users logging in and starting various programs, there could be more processes running or more CPU or memory usage around particular times of the days.

It can be hard to tell what's actually happening unless you SSH into the instance and check the CPU/memory usage with top or htop.

proc-watch is meant to help in this respect. It is a Python program that will parse the output of ps on Linux and MacOS systems to tell you:

the number of processes that are currently running
the process that is using the most memory
the process that is using the most CPU
the most memory being used by one process
the most CPU being used by one process

It exports these metrics to Prometheus so that you can monitor the changes over time.

Here's how it would be useful:

you will be able to see that the system has no load when the most CPU or memory usage is barely hovering above 0.1%
when a resource-intensive process is started, you will be able to see that, a-ha! It's headless Chrome starting an automated test run!

Check out the code here: https://github.com/rudolfolah/proc-watch

DEV Community

proc-watch: find out what is using the most CPU and memory

Top comments (0)

Read next

Understanding DevSecOps Principles

My first AI Food Assistant

Modern Traffic Management with Gateway API in Kubernetes

Understanding the MLOps Lifecycle