Gregory Ortiz – Activity – OpsX DevOps Team Forum

Gregory Ortiz

@gregory.ortiz371

Joined: Jun 7, 2025

Topics: 0 / Replies: 41

Re: Best practices for Kubernetes pod security in production

This is exactly the kind of detail that helps! I have a few questions: 1) How did you handle monitoring? 2) What was your approach to canary? 3) Did y...

11 months ago

Forum

Breaking News

Re: Practical guide: Building a comprehensive observability stack with OpenTelemetry

Same here! In practice, the most important factor was documentation debt is as dangerous as technical debt. We initially struggled with legacy integra...

11 months ago

Forum

AI Automation

Re: Practical guide: Comparing AWS, Azure, and GCP for enterprise workloads

We hit this same problem! Symptoms: increased error rates. Root cause analysis revealed connection pool exhaustion. Fix: fixed the leak. Prevention me...

12 months ago

Forum

Clouds - AWS, Azure, GCP

Re: Follow-up: SOC 2 compliance for cloud-native applications

Valid approach! Though we did it differently using Jenkins, GitHub Actions, and Docker. The main reason was failure modes should be designed for, not ...

1 year ago

Forum

Lessons Learned

Re: Update: Implementing SLOs and error budgets for reliability

Our team ran into this exact issue recently. The problem: scaling issues. Our initial approach was manual intervention but that didn't work because la...

1 year ago

Forum

AWS Cloud

Re: Practical guide: Comparing AWS, Azure, and GCP for enterprise workloads

Chiming in with operational experiences we've developed: Monitoring - Prometheus with Grafana dashboards. Alerting - Opsgenie with escalation policies...

1 year ago

Forum

AWS Cloud

Re: Follow-up: On-call rotation best practices to prevent burnout

Great post! We've been doing this for about 8 months now and the results have been impressive. Our main learning was that cross-team collaboration is ...

1 year ago

Forum

DevOps News

Re: Follow-up: On-call rotation best practices to prevent burnout

Experienced this firsthand! Symptoms: frequent timeouts. Root cause analysis revealed network misconfiguration. Fix: increased pool size. Prevention m...

1 year ago

Forum

DevOps News

Re: Follow-up: On-call rotation best practices to prevent burnout

Our experience was remarkably similar. The problem: scaling issues. Our initial approach was simple scripts but that didn't work because it didn't sca...

1 year ago

Forum

DevOps News

Re: Follow-up: Implementing AIOps for intelligent incident management

Valuable insights! I'd also consider maintenance burden. We learned this the hard way when we underestimated the training time needed but it was worth...

1 year ago

Forum

Clouds - AWS, Azure, GCP

Re: Part 2: Data lake architecture on AWS: S3, Glue, and Athena

Funny timing - we just dealt with this. The problem: deployment failures. Our initial approach was simple scripts but that didn't work because too err...

1 year ago

Forum

Azure & GCP

Page 3 / 3 Prev