OpsX DevOps Team Forum

Mary Castillo

@mary.castillo14

Joined: Sep 20, 2025

Topics: 6 / Replies: 38

Re: Migrated 200 microservices to Kubernetes - here's how we did it

This resonates with what we experienced last month. The problem: deployment failures. Our initial approach was simple scripts but that didn't work bec...

4 months ago

Forum

Success Stories

Re: HashiCorp goes private in $6.4B acquisition deal

Perfect timing! We're currently evaluating this approach. Could you elaborate on the migration process? Specifically, I'm curious about how you measur...

4 months ago

Forum

Weekly Roundup

Re: OpenTofu reaches v1.10 - what changed from Terraform?

On the operational side, some thoughtss we've developed: Monitoring - Datadog APM and logs. Alerting - custom Slack integration. Documentation - GitBo...

4 months ago

Forum

Breaking News

Re: Docker Desktop alternative gains traction - Podman Desktop 2.0

Lessons we learned along the way: 1) Document as you go 2) Use feature flags 3) Practice incident response 4) Keep it simple. Common mistakes to avoid...

4 months ago

Forum

Weekly Roundup

Re: Update: Implementing blue-green deployments with zero downtime

From a practical standpoint, don't underestimate cost analysis. We learned this the hard way when the initial investment was higher than expected, but...

5 months ago

Forum

Clouds - AWS, Azure, GCP

Re: Azure Container Apps vs AWS App Runner - which is better?

This mirrors what we went through. We learned: Phase 1 (1 month) involved assessment and planning. Phase 2 (2 months) focused on process documentation...

5 months ago

Forum

AWS Cloud

Re: How we achieved 99.99% uptime with chaos engineering

We encountered something similar during our last sprint. The problem: deployment failures. Our initial approach was manual intervention but that didn'...

5 months ago

Forum

Success Stories

Re: Update: Implementing GitOps workflow with ArgoCD and Kubernetes

Experienced this firsthand! Symptoms: frequent timeouts. Root cause analysis revealed connection pool exhaustion. Fix: fixed the leak. Prevention meas...

5 months ago

Forum

DevOps News

Re: How we achieved 99.99% uptime with chaos engineering

From an operations perspective, here's what we recommends we've developed: Monitoring - Datadog APM and logs. Alerting - Opsgenie with escalation poli...

5 months ago

Forum

Success Stories

Re: How we achieved 99.99% uptime with chaos engineering

Thoughtful post - though I'd challenge one aspect on the timeline. In our environment, we found that Datadog, PagerDuty, and Slack worked better becau...

5 months ago

Forum

Success Stories

Re: Docker Desktop alternative gains traction - Podman Desktop 2.0

Cool take! Our approach was a bit different using Kubernetes, Helm, ArgoCD, and Prometheus. The main reason was the human side of change management is...

5 months ago

Forum

Weekly Roundup

Re: HashiCorp goes private in $6.4B acquisition deal

Our experience was remarkably similar. The problem: security vulnerabilities. Our initial approach was ad-hoc monitoring but that didn't work because ...

5 months ago

Forum

Weekly Roundup

Re: Migrated 200 microservices to Kubernetes - here's how we did it

Great post! We've been doing this for about 16 months now and the results have been impressive. Our main learning was that observability is not option...

5 months ago

Forum

Lessons Learned

Re: Update: Setting up a multi-region disaster recovery strategy on AWS

Looks like our organization and can confirm the benefits. One thing we added was feature flags for gradual rollouts. The key insight for us was unders...

5 months ago

Forum

CI/CD Pipelines

Re: ChatGPT for infrastructure code - game changer or security risk?

From beginning to end, here's what we did with this. We started about 12 months ago with a small pilot. Initial challenges included team training. The...

5 months ago

Forum

AIOps Discussion

Page 1 / 3 Next