DEV Community

Site Reliability Engineering

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Roles and Responsibilities Matrix

Roles and Responsibilities Matrix

Comments
5 min read
Matriz de Papéis e Responsabilidades

Matriz de Papéis e Responsabilidades

1
Comments
6 min read
Docker Log Observability: Analyzing Container Logs in HashiCorp Nomad with Vector, Loki, and Grafana

Docker Log Observability: Analyzing Container Logs in HashiCorp Nomad with Vector, Loki, and Grafana

6
Comments
8 min read
How to send Alerts and Notifications with Telegram

How to send Alerts and Notifications with Telegram

Comments
3 min read
Kubectl Port-forward Flow Explained

Kubectl Port-forward Flow Explained

Comments
3 min read
2024 Site Reliability Engineering: Key Trends and Focus Areas for SREs

2024 Site Reliability Engineering: Key Trends and Focus Areas for SREs

Comments
7 min read
Inside the Kubernetes Control Plane

Inside the Kubernetes Control Plane

14
Comments 2
5 min read
Expand your root EBS Volume attached to your Windows EC2

Expand your root EBS Volume attached to your Windows EC2

Comments
2 min read
ARM vs x86 em Docker

ARM vs x86 em Docker

2
Comments
6 min read
Effortless Database Scaling: Migrate from RDS to Aurora Serverless V2

Effortless Database Scaling: Migrate from RDS to Aurora Serverless V2

Comments
2 min read
Why Should Devops/SRE learn Golang?

Why Should Devops/SRE learn Golang?

Comments
4 min read
Reciprocity, Companion Planting & DevSecOps

Reciprocity, Companion Planting & DevSecOps

1
Comments
3 min read
Kubernetes Debugging: Handling Multiple kubectl port-forward from Tray

Kubernetes Debugging: Handling Multiple kubectl port-forward from Tray

2
Comments
6 min read
Observability Maturity Model for AWS

Observability Maturity Model for AWS

4
Comments
3 min read
On The Importance of End-to-End Monitoring for IoT

On The Importance of End-to-End Monitoring for IoT

2
Comments
2 min read
How does SRE differ from traditional IT operations?

How does SRE differ from traditional IT operations?

Comments
3 min read
Reliability in Legacy Software

Reliability in Legacy Software

1
Comments
3 min read
Por que os times precisam de SLOs, SLIs e Error Budget?

Por que os times precisam de SLOs, SLIs e Error Budget?

Comments
4 min read
Por que os times precisam de SLOs, SLIs e Error Budget?

Por que os times precisam de SLOs, SLIs e Error Budget?

5
Comments
4 min read
Smart Chaos: LLMs, No More Human Modeling

Smart Chaos: LLMs, No More Human Modeling

4
Comments
6 min read
5 Strategies For Managing High-Performance DevOps Teams

5 Strategies For Managing High-Performance DevOps Teams

1
Comments
6 min read
Instalando Kubernetes do Zero

Instalando Kubernetes do Zero

Comments
11 min read
Exploring the World of LLMs for SRE Powered by PartyRock (Claude, Jurassic-2, Titan, Command, Liama 2 & Stable Diffusion XL)

Exploring the World of LLMs for SRE Powered by PartyRock (Claude, Jurassic-2, Titan, Command, Liama 2 & Stable Diffusion XL)

6
Comments
7 min read
CI/CD Observability and Why it matters.

CI/CD Observability and Why it matters.

2
Comments
2 min read
Netdata vs Prometheus: Performance Analysis

Netdata vs Prometheus: Performance Analysis

Comments
12 min read
#DevOps para noobs - Proxy Reverso

#DevOps para noobs - Proxy Reverso

192
Comments 12
3 min read
Discovering the Magic of Service Mesh: Navigating the Microservices Maze đŸŒđŸ•žïžđŸ•”ïžâ€â™‚ïž

Discovering the Magic of Service Mesh: Navigating the Microservices Maze đŸŒđŸ•žïžđŸ•”ïžâ€â™‚ïž

8
Comments
3 min read
Bridging Salesforce and AWS: Unveiling Event-Driven Architectures

Bridging Salesforce and AWS: Unveiling Event-Driven Architectures

Comments
2 min read
Karpenter vs. Cluster Autoscaler in EKS: A Comparative Guide

Karpenter vs. Cluster Autoscaler in EKS: A Comparative Guide

1
Comments
4 min read
Charting the Course: An IT Journey from Clouds to Code

Charting the Course: An IT Journey from Clouds to Code

Comments
1 min read
Certified Enterprise Chaos Engineer

Certified Enterprise Chaos Engineer

Comments
2 min read
Observability for DevOps and SRE - free certificate course on Feb 8th

Observability for DevOps and SRE - free certificate course on Feb 8th

1
Comments
1 min read
#DevOps para noobs - Requests x limits no Kubernetes

#DevOps para noobs - Requests x limits no Kubernetes

94
Comments 15
2 min read
Observability for DevOps and SREs - free certificate course on Feb 8th

Observability for DevOps and SREs - free certificate course on Feb 8th

Comments
1 min read
Best Programming Languages for DevOps in 2024

Best Programming Languages for DevOps in 2024

Comments
6 min read
Evite Custos Desnecessårios: Diagnóstico e Otimização de Desempenho em Sua Aplicação

Evite Custos Desnecessårios: Diagnóstico e Otimização de Desempenho em Sua Aplicação

Comments
1 min read
How OpenTelemetry Organizes Distributed Tracing

How OpenTelemetry Organizes Distributed Tracing

Comments
3 min read
The easiest way to send mobile push notifications for developers

The easiest way to send mobile push notifications for developers

1
Comments
2 min read
Maximizing Speed, Costs, UX - AWS ElastiCache Serverless

Maximizing Speed, Costs, UX - AWS ElastiCache Serverless

3
Comments 2
6 min read
Understanding Goal-Based Software Engineering A Path to Successful Software Development

Understanding Goal-Based Software Engineering A Path to Successful Software Development

1
Comments
4 min read
SRE é sobre criar softwares que resolvem problemas de operação de outros softwares

SRE é sobre criar softwares que resolvem problemas de operação de outros softwares

16
Comments 1
2 min read
Why AWS is poised to lead the Gartner Magic Quadrant for APM and Observability in 2024

Why AWS is poised to lead the Gartner Magic Quadrant for APM and Observability in 2024

6
Comments
11 min read
Do devs spend significant time on application rolling updates and rollbacks?

Do devs spend significant time on application rolling updates and rollbacks?

Comments
1 min read
Para quĂȘ serve o Stream Processing Offload Engine do HAProxy?

Para quĂȘ serve o Stream Processing Offload Engine do HAProxy?

Comments
1 min read
HAProxy FAQ

HAProxy FAQ

Comments
1 min read
Por que o HAProxy Ă© meu balancer/proxy favorito

Por que o HAProxy Ă© meu balancer/proxy favorito

1
Comments
2 min read
Como ir além do monitoramento båsico

Como ir além do monitoramento båsico

10
Comments
2 min read
Site Reliability Engineering: Fundamental Concepts And How To Put Them In Practice

Site Reliability Engineering: Fundamental Concepts And How To Put Them In Practice

Comments
5 min read
How to quickly realize proactive patrolling for dead-end network connectivity in large-scale clusters

How to quickly realize proactive patrolling for dead-end network connectivity in large-scale clusters

Comments
6 min read
ć€§è§„æšĄé›†çŸ€äž‹ïŒŒćŠ‚äœ•ćż«é€ŸćźžçŽ°æ— æ­»è§’çœ‘ç»œèżžé€šæ€§çš„äž»ćŠšć·ĄæŁ€

ć€§è§„æšĄé›†çŸ€äž‹ïŒŒćŠ‚äœ•ćż«é€ŸćźžçŽ°æ— æ­»è§’çœ‘ç»œèżžé€šæ€§çš„äž»ćŠšć·ĄæŁ€

Comments
2 min read
Decoding the Tech Maze: Demystifying SRE and DevOps for Everyone

Decoding the Tech Maze: Demystifying SRE and DevOps for Everyone

Comments
2 min read
Step-by-Step Guide to Setting Up Point-in-Time Recovery in PostgreSQL 16 with Scripts

Step-by-Step Guide to Setting Up Point-in-Time Recovery in PostgreSQL 16 with Scripts

Comments
1 min read
Mastering Docker: Defining Health Checks in Docker Compose

Mastering Docker: Defining Health Checks in Docker Compose

14
Comments 1
6 min read
AWS Observability: Building a Comprehensive Solution for Distributed Systems

AWS Observability: Building a Comprehensive Solution for Distributed Systems

7
Comments 2
12 min read
Books That Helped Me Become a Tech Lead

Books That Helped Me Become a Tech Lead

336
Comments 32
10 min read
Take back control of your tags with Tailwarden - Part 1

Take back control of your tags with Tailwarden - Part 1

2
Comments
7 min read
Root Cause Chronicles: Connection Collapse

Root Cause Chronicles: Connection Collapse

Comments
10 min read
Combining 2FA and Public Key Authentication for a better Linux SSH security

Combining 2FA and Public Key Authentication for a better Linux SSH security

1
Comments
6 min read
About to build an internal developer platform? Check it out here!

About to build an internal developer platform? Check it out here!

Comments
1 min read
AWS re:Invent 2023 - Empowering SREs with Game-Changing Solutions

AWS re:Invent 2023 - Empowering SREs with Game-Changing Solutions

12
Comments 2
3 min read
loading...