Forum

Gregory Ortiz
@gregory.ortiz371
Joined: Jun 7, 2025
Topics: 0 / Replies: 41
Reply
Re: How we achieved 99.99% uptime with chaos engineering

Our recommended approach: 1) Test in production-like environments 2) Monitor proactively 3) Review and iterate 4) Measure what matters. Common mistake...

3 months ago
Reply
Re: Follow-up: Best practices for Kubernetes pod security in production

Our data supports this. We found that the most important factor was failure modes should be designed for, not discovered in production. We initially s...

4 months ago
Reply
Re: Using Claude Code for Terraform refactoring - real results

This mirrors what we went through. We learned: Phase 1 (1 month) involved assessment and planning. Phase 2 (3 months) focused on process documentation...

5 months ago
Reply
Re: Open-sourced our internal developer platform - feedback wanted

While this is well-reasoned, I see things differently on the tooling choice. In our environment, we found that Kubernetes, Helm, ArgoCD, and Prometheu...

5 months ago
Reply
Re: Follow-up: Best practices for Kubernetes pod security in production

Solid analysis! From our perspective, security considerations. We learned this the hard way when we discovered several hidden dependencies during the ...

5 months ago
Reply
Re: Terraform vs Pulumi vs CloudFormation - real production experience

This happened to us! Symptoms: increased error rates. Root cause analysis revealed memory leaks. Fix: fixed the leak. Prevention measures: load testin...

5 months ago
Reply
Re: Part 2: Building a comprehensive observability stack with OpenTelemetry

We created a similar solution in our organization and can confirm the benefits. One thing we added was real-time dashboards for stakeholder visibility...

5 months ago
Reply
Re: Implemented GitOps across 15 teams - the good, bad, and ugly

Not to be contrarian, but I see this differently on the timeline. In our environment, we found that Elasticsearch, Fluentd, and Kibana worked better b...

5 months ago
Reply
Re: Update: Implementing GitOps workflow with ArgoCD and Kubernetes

Timely post! We're actively evaluating this approach. Could you elaborate on tool selection? Specifically, I'm curious about team training approach. A...

5 months ago
Forum
Reply
Re: Machine learning for cost optimization in multi-cloud environments

Great post! We've been doing this for about 13 months now and the results have been impressive. Our main learning was that cross-team collaboration is...

5 months ago
Reply
Re: Deep dive: Optimizing GitHub Actions for faster CI/CD pipelines

This is exactly our story too. We learned: Phase 1 (1 month) involved assessment and planning. Phase 2 (1 month) focused on pilot implementation. Phas...

5 months ago
Forum
Reply
Re: Part 2: Building a comprehensive observability stack with OpenTelemetry

This resonates with my experience, though I'd emphasize team dynamics. We learned this the hard way when we discovered several hidden dependencies dur...

5 months ago
Reply
Re: Follow-up: Implementing zero trust security in Kubernetes

Our experience from start to finish with this. We started about 10 months ago with a small pilot. Initial challenges included team training. The break...

6 months ago
Reply
Re: Deep dive: Optimizing GitHub Actions for faster CI/CD pipelines

Allow me to present an alternative view on the tooling choice. In our environment, we found that Elasticsearch, Fluentd, and Kibana worked better beca...

6 months ago
Reply
Re: Secrets management: HashiCorp Vault vs AWS Secrets Manager

Adding some engineering details from our implementation. Architecture: serverless with Lambda. Tools used: Elasticsearch, Fluentd, and Kibana. Configu...

6 months ago
Page 1 / 3
Scroll to Top