Forum

Jeffrey Price
@jeffrey.price491
Joined: Dec 25, 2024
Topics: 3 / Replies: 46
Reply
Re: Follow-up: PostgreSQL performance tuning for high-traffic applications

This resonates strongly. We've learned that the most important factor was security must be built in from the start, not bolted on later. We initially ...

11 months ago
Forum
Topic
Forum
Replies: 15
Views: 299
Reply
Re: Deep dive: Implementing SLOs and error budgets for reliability

Great post! We've been doing this for about 7 months now and the results have been impressive. Our main learning was that documentation debt is as dan...

12 months ago
Reply
Re: Update: Migrating from monolith to microservices: Lessons learned

We built something comparable in our organization and can confirm the benefits. One thing we added was cost allocation tagging for accurate showback. ...

12 months ago
Forum
Reply
Re: Follow-up: Data lake architecture on AWS: S3, Glue, and Athena

Let me dive into the technical side of our implementation. Architecture: hybrid cloud setup. Tools used: Datadog, PagerDuty, and Slack. Configuration ...

12 months ago
Reply
Re: Deep dive: Implementing SLOs and error budgets for reliability

Our solution was somewhat different using Vault, AWS KMS, and SOPS. The main reason was automation should augment human decision-making, not replace i...

12 months ago
Reply
Re: Implementing event sourcing with Apache Kafka

From the ops trenches, here's our takes we've developed: Monitoring - Prometheus with Grafana dashboards. Alerting - custom Slack integration. Documen...

1 year ago
Forum
Reply
Re: Deep dive: On-call rotation best practices to prevent burnout

Interesting points, but let me offer a counterargument on the tooling choice. In our environment, we found that Vault, AWS KMS, and SOPS worked better...

1 year ago
Reply
Re: Follow-up: Data lake architecture on AWS: S3, Glue, and Athena

This resonates strongly. We've learned that the most important factor was cross-team collaboration is essential for success. We initially struggled wi...

1 year ago
Reply
Re: Part 2: Terraform vs Pulumi: A comprehensive comparison for IaC

On the technical front, several aspects deserve attention. First, compliance requirements. Second, failover strategy. Third, cost optimization. We spe...

1 year ago
Forum
Reply
Re: Part 2: Building a DevOps culture in a traditional enterprise

Allow me to present an alternative view on the timeline. In our environment, we found that Vault, AWS KMS, and SOPS worked better because automation s...

1 year ago
Reply
Re: Update: On-call rotation best practices to prevent burnout

Great writeup! That said, I have some concerns on the team structure. In our environment, we found that Grafana, Loki, and Tempo worked better because...

1 year ago
Forum
Reply
Re: Follow-up: Setting up a multi-region disaster recovery strategy on AWS

Our implementation in our organization and can confirm the benefits. One thing we added was chaos engineering tests in staging. The key insight for us...

1 year ago
Reply
Re: Practical guide: Implementing GitOps workflow with ArgoCD and Kubernetes

Same experience on our end! We learned: Phase 1 (1 month) involved tool evaluation. Phase 2 (2 months) focused on pilot implementation. Phase 3 (ongoi...

1 year ago
Page 3 / 4
Scroll to Top