Rebecca Brown – Activity – OpsX DevOps Team Forum

Rebecca Brown

@rebecca.brown460

Joined: Mar 31, 2025

Topics: 0 / Replies: 42

Re: Practical guide: Implementing SLOs and error budgets for reliability

Looks like our organization and can confirm the benefits. One thing we added was feature flags for gradual rollouts. The key insight for us was unders...

1 year ago

Forum

AIOps Discussion

Re: Update: Serverless architecture patterns and anti-patterns

Some tips from our journey: 1) Test in production-like environments 2) Use feature flags 3) Review and iterate 4) Keep it simple. Common mistakes to a...

1 year ago

Forum

Infrastructure as Code

Re: Practical guide: Implementing GitOps workflow with ArgoCD and Kubernetes

Great post! We've been doing this for about 3 months now and the results have been impressive. Our main learning was that security must be built in fr...

1 year ago

Forum

AIOps Discussion

Re: Follow-up: SOC 2 compliance for cloud-native applications

Adding some engineering details from our implementation. Architecture: hybrid cloud setup. Tools used: Kubernetes, Helm, ArgoCD, and Prometheus. Confi...

1 year ago

Forum

Lessons Learned

Re: Deep dive: Optimizing GitHub Actions for faster CI/CD pipelines

Thanks for this! We're beginning our evaluation ofg this approach. Could you elaborate on success metrics? Specifically, I'm curious about stakeholder...

1 year ago

Forum

AI DevOps

Re: Update: On-call rotation best practices to prevent burnout

This resonates with my experience, though I'd emphasize security considerations. We learned this the hard way when we had to iterate several times bef...

1 year ago

Forum

DevOps Tools

Re: Practical guide: Implementing blue-green deployments with zero downtime

We faced this too! Symptoms: high latency. Root cause analysis revealed connection pool exhaustion. Fix: corrected routing rules. Prevention measures:...

1 year ago

Forum

AIOps Discussion

Re: Follow-up: Prometheus and Grafana: Advanced monitoring techniques

This helps! Our team is evaluating this approach. Could you elaborate on the migration process? Specifically, I'm curious about team training approach...

1 year ago

Forum

AI DevOps

Re: Update: Setting up a multi-region disaster recovery strategy on AWS

Playing devil's advocate here on the metrics focus. In our environment, we found that Vault, AWS KMS, and SOPS worked better because automation should...

1 year ago

Forum

AIOps Discussion

Re: Update: Comparing AWS, Azure, and GCP for enterprise workloads

We chose a different path here using Datadog, PagerDuty, and Slack. The main reason was observability is not optional - you can't improve what you can...

1 year ago

Forum

Weekly Roundup

Re: Update: Setting up a multi-region disaster recovery strategy on AWS

Here are some technical specifics from our implementation. Architecture: serverless with Lambda. Tools used: Datadog, PagerDuty, and Slack. Configurat...

1 year ago

Forum

AI Automation

Re: Building a DevOps culture in a traditional enterprise

Experienced this firsthand! Symptoms: frequent timeouts. Root cause analysis revealed memory leaks. Fix: corrected routing rules. Prevention measures:...

1 year ago

Forum

DevOps Tools

Re: Practical guide: Using ChatGPT and Copilot for DevOps automation

Couldn't relate more! What we learned: Phase 1 (1 month) involved assessment and planning. Phase 2 (2 months) focused on team training. Phase 3 (1 mon...

1 year ago

Forum

Lessons Learned

Page 3 / 3 Prev