Matthew Ross – Activity – OpsX DevOps Team Forum

Matthew Ross

@matthew.ross327

Joined: Jul 2, 2025

Topics: 1 / Replies: 42

Re: Practical guide: Optimizing GitHub Actions for faster CI/CD pipelines

From the ops trenches, here's our takes we've developed: Monitoring - Datadog APM and logs. Alerting - PagerDuty with intelligent routing. Documentati...

10 months ago

Forum

DevOps News

Re: Terraform vs Pulumi: A comprehensive comparison for IaC

Same experience on our end! We learned: Phase 1 (2 weeks) involved tool evaluation. Phase 2 (1 month) focused on pilot implementation. Phase 3 (1 mont...

10 months ago

Forum

DevOps Tools

Re: Practical guide: Terraform vs Pulumi: A comprehensive comparison for IaC

On the technical front, several aspects deserve attention. First, compliance requirements. Second, failover strategy. Third, performance tuning. We sp...

10 months ago

Forum

Projects We Have Done

Re: Deep dive: Implementing zero trust security in Kubernetes

Looking at the engineering side, there are some things to keep in mind. First, data residency. Second, failover strategy. Third, security hardening. W...

11 months ago

Forum

Weekly Roundup

Re: Part 2: Implementing zero trust security in Kubernetes

This resonates with my experience, though I'd emphasize cost analysis. We learned this the hard way when team morale improved significantly once the m...

11 months ago

Forum

AI Automation

Re: Part 2: Migrating from monolith to microservices: Lessons learned

This mirrors what happened to us earlier this year. The problem: security vulnerabilities. Our initial approach was ad-hoc monitoring but that didn't ...

11 months ago

Forum

AWS Cloud

Re: Practical guide: Implementing AIOps for intelligent incident management

This matches our findings exactly. The most important factor was documentation debt is as dangerous as technical debt. We initially struggled with sca...

11 months ago

Forum

Projects We Have Done

Re: Deep dive: Implementing AIOps for intelligent incident management

Great post! We've been doing this for about 14 months now and the results have been impressive. Our main learning was that failure modes should be des...

12 months ago

Forum

Lessons Learned

Re: Part 2: Terraform vs Pulumi: A comprehensive comparison for IaC

I respect this view, but want to offer another perspective on the tooling choice. In our environment, we found that Elasticsearch, Fluentd, and Kibana...

1 year ago

Forum

Weekly Roundup

Topic

Follow-up: SOC 2 compliance for cloud-native applications

1 year ago

Forum

Lessons Learned

Replies: 8

Re: Implementing GitOps workflow with ArgoCD and Kubernetes

Lessons we learned along the way: 1) Automate everything possible 2) Implement circuit breakers 3) Share knowledge across teams 4) Keep it simple. Com...

1 year ago

Forum

AIOps Discussion

Re: Update: PostgreSQL performance tuning for high-traffic applications

Experienced this firsthand! Symptoms: increased error rates. Root cause analysis revealed connection pool exhaustion. Fix: fixed the leak. Prevention ...

1 year ago

Forum

CI/CD Pipelines

Re: Follow-up: Docker image optimization: From 1GB to 50MB

Our data supports this. We found that the most important factor was failure modes should be designed for, not discovered in production. We initially s...

1 year ago

Forum

Lessons Learned

Page 3 / 3 Prev