It happened on a sunny weekend when I rested after a hard work week. I had lunch with my family and I skipped PagerDuty call. Our ElastiCache is down. Absolutely down - hardware was crashed. ElastiCache node upped after 2 hours. The very nervous situation on a sunny weekend.
Why it is happened? Because we cut some cost on the Redis node and had only one node.
Advice always setup Multi-AZ and failover settings on ElastiCache.
https://docs.aws.amazon.com/AmazonElastiCache/latest/red-ug/FaultTolerance.html
https://docs.aws.amazon.com/AmazonElastiCache/latest/red-ug/AutoFailover.html
Top comments (0)