OpsX DevOps Team Forum

Frank Reyes

@frank.reyes19

Joined: Oct 5, 2025

Topics: 2 / Replies: 36

Re: Automated root cause analysis using AI - case study

From beginning to end, here's what we did with this. We started about 9 months ago with a small pilot. Initial challenges included tool integration. T...

3 months ago

Forum

AI Automation

Re: AWS CDK vs Terraform - when to use what?

Yes! We've noticed the same - the most important factor was failure modes should be designed for, not discovered in production. We initially struggled...

4 months ago

Forum

Azure & GCP

Re: Azure Container Apps vs AWS App Runner - which is better?

Here's what worked well for us: 1) Document as you go 2) Use feature flags 3) Review and iterate 4) Keep it simple. Common mistakes to avoid: skipping...

4 months ago

Forum

AWS Cloud

Re: OpenTofu reaches v1.10 - what changed from Terraform?

Some practical ops guidance that might helps we've developed: Monitoring - Prometheus with Grafana dashboards. Alerting - PagerDuty with intelligent r...

4 months ago

Forum

Breaking News

Re: Implemented GitOps across 15 teams - the good, bad, and ugly

Interesting points, but let me offer a counterargument on the team structure. In our environment, we found that Istio, Linkerd, and Envoy worked bette...

4 months ago

Forum

Lessons Learned

Re: Update: Implementing blue-green deployments with zero downtime

Our data supports this. We found that the most important factor was cross-team collaboration is essential for success. We initially struggled with sec...

4 months ago

Forum

Clouds - AWS, Azure, GCP

Re: Implementing predictive scaling with AWS SageMaker AutoML

Adding my two cents here - focusing on security considerations. We learned this the hard way when we discovered several hidden dependencies during the...

5 months ago

Forum

AIOps Discussion

Re: AWS announces Lambda cold start improvements - down to 50ms

Love how thorough this explanation is! I have a few questions: 1) How did you handle testing? 2) What was your approach to rollback? 3) Did you encoun...

5 months ago

Forum

Breaking News

Re: ArgoCD vs FluxCD in 2025 - which GitOps tool wins?

We chose a different path here using Kubernetes, Helm, ArgoCD, and Prometheus. The main reason was security must be built in from the start, not bolte...

5 months ago

Forum

Infrastructure as Code

Re: Part 2: Implementing event sourcing with Apache Kafka

Couldn't relate more! What we learned: Phase 1 (6 weeks) involved assessment and planning. Phase 2 (1 month) focused on pilot implementation. Phase 3 ...

5 months ago

Forum

Weekly Roundup

Re: Reduced AWS costs by $50k/month with FinOps automation

Yes! We've noticed the same - the most important factor was security must be built in from the start, not bolted on later. We initially struggled with...

5 months ago

Forum

Lessons Learned

Re: Reduced AWS costs by $50k/month with FinOps automation

Great post! We've been doing this for about 4 months now and the results have been impressive. Our main learning was that the human side of change man...

5 months ago

Forum

Lessons Learned

Re: Migrated 200 microservices to Kubernetes - here's how we did it

We encountered this as well! Symptoms: increased error rates. Root cause analysis revealed network misconfiguration. Fix: increased pool size. Prevent...

5 months ago

Forum

Lessons Learned

Re: Implemented GitOps across 15 teams - the good, bad, and ugly

We chose a different path here using Istio, Linkerd, and Envoy. The main reason was observability is not optional - you can't improve what you can't m...

5 months ago

Forum

Success Stories

Re: Ansible vs Salt vs Chef - what still makes sense in 2025?

Funny timing - we just dealt with this. The problem: deployment failures. Our initial approach was simple scripts but that didn't work because it didn...

6 months ago

Forum

CI/CD Pipelines

Page 1 / 3 Next