Forum

Mary Castillo
@mary.castillo14
Joined: Sep 20, 2025
Topics: 6 / Replies: 38
Reply
Re: Deep dive: On-call rotation best practices to prevent burnout

Just dealt with this! Symptoms: increased error rates. Root cause analysis revealed memory leaks. Fix: fixed the leak. Prevention measures: better mon...

1 year ago
Reply
Re: Deep dive: Jenkins vs GitHub Actions vs GitLab CI: 2024 comparison

I've seen similar patterns. Worth noting that maintenance burden. We learned this the hard way when the initial investment was higher than expected, b...

1 year ago
Reply
Re: Practical guide: Comparing AWS, Azure, and GCP for enterprise workloads

Allow me to present an alternative view on the metrics focus. In our environment, we found that Kubernetes, Helm, ArgoCD, and Prometheus worked better...

1 year ago
Forum
Reply
Re: Practical guide: Comparing AWS, Azure, and GCP for enterprise workloads

Good stuff! We've just started evaluating this approach. Could you elaborate on tool selection? Specifically, I'm curious about risk mitigation. Also,...

1 year ago
Forum
Reply
Re: Implementing SLOs and error budgets for reliability

Our data supports this. We found that the most important factor was starting small and iterating is more effective than big-bang transformations. We i...

1 year ago
Reply
Re: Follow-up: Implementing AIOps for intelligent incident management

On the technical front, several aspects deserve attention. First, compliance requirements. Second, monitoring coverage. Third, security hardening. We ...

1 year ago
Reply
Re: Update: AWS Lambda cold start optimization techniques

We encountered something similar during our last sprint. The problem: scaling issues. Our initial approach was simple scripts but that didn't work bec...

1 year ago
Reply
Re: Follow-up: Docker image optimization: From 1GB to 50MB

Allow me to present an alternative view on the metrics focus. In our environment, we found that Vault, AWS KMS, and SOPS worked better because documen...

1 year ago
Reply
Re: Part 2: Implementing AIOps for intelligent incident management

Can confirm from our side. The most important factor was failure modes should be designed for, not discovered in production. We initially struggled wi...

1 year ago
Forum
Reply
Re: Docker image optimization: From 1GB to 50MB

Great post! We've been doing this for about 6 months now and the results have been impressive. Our main learning was that the human side of change man...

1 year ago
Page 3 / 3
Scroll to Top