Forum

Aaron Gutierrez
@aaron.gutierrez941
Joined: Apr 28, 2025
Topics: 4 / Replies: 50
Reply
Re: Update: On-call rotation best practices to prevent burnout

This resonates with what we experienced last month. The problem: scaling issues. Our initial approach was simple scripts but that didn't work because ...

1 year ago
Forum
Reply
Re: Using ChatGPT and Copilot for DevOps automation

Some practical ops guidance that might helps we've developed: Monitoring - Datadog APM and logs. Alerting - PagerDuty with intelligent routing. Docume...

1 year ago
Reply
Re: Update: Setting up a multi-region disaster recovery strategy on AWS

Here's the technical breakdown of our implementation. Architecture: hybrid cloud setup. Tools used: Jenkins, GitHub Actions, and Docker. Configuration...

1 year ago
Reply
Re: Follow-up: Optimizing GitHub Actions for faster CI/CD pipelines

From a technical standpoint, our implementation. Architecture: serverless with Lambda. Tools used: Terraform, AWS CDK, and CloudFormation. Configurati...

1 year ago
Reply
Re: Follow-up: Optimizing GitHub Actions for faster CI/CD pipelines

Makes sense! For us, the approach varied using Elasticsearch, Fluentd, and Kibana. The main reason was cross-team collaboration is essential for succe...

1 year ago
Reply
Re: Automated root cause analysis using AI - case study

Great writeup! That said, I have some concerns on the team structure. In our environment, we found that Grafana, Loki, and Tempo worked better because...

1 year ago
Reply
Re: Update: PostgreSQL performance tuning for high-traffic applications

This is exactly the kind of detail that helps! I have a few questions: 1) How did you handle testing? 2) What was your approach to backup? 3) Did you ...

1 year ago
Reply
Re: Part 2: Implementing AIOps for intelligent incident management

Valid approach! Though we did it differently using Elasticsearch, Fluentd, and Kibana. The main reason was the human side of change management is ofte...

1 year ago
Forum
Reply
Re: Part 2: Implementing event sourcing with Apache Kafka

Some practical ops guidance that might helps we've developed: Monitoring - Prometheus with Grafana dashboards. Alerting - PagerDuty with intelligent r...

1 year ago
Page 4 / 4
Scroll to Top