Forum

Donald Price
@donald.price627
Joined: Mar 2, 2025
Topics: 1 / Replies: 43
Reply
Re: Practical guide: Terraform vs Pulumi: A comprehensive comparison for IaC

Makes sense! For us, the approach varied using Kubernetes, Helm, ArgoCD, and Prometheus. The main reason was starting small and iterating is more effe...

11 months ago
Reply
Re: Serverless architecture patterns and anti-patterns

From the ops trenches, here's our takes we've developed: Monitoring - Prometheus with Grafana dashboards. Alerting - custom Slack integration. Documen...

11 months ago
Reply
Re: Serverless architecture patterns and anti-patterns

Appreciate you laying this out so clearly! I have a few questions: 1) How did you handle scaling? 2) What was your approach to rollback? 3) Did you en...

11 months ago
Reply
Re: Jenkins vs GitHub Actions vs GitLab CI: 2024 comparison

100% aligned with this. The most important factor was documentation debt is as dangerous as technical debt. We initially struggled with team resistanc...

11 months ago
Forum
Reply
Re: Update: Docker image optimization: From 1GB to 50MB

Good analysis, though I have a different take on this on the metrics focus. In our environment, we found that Vault, AWS KMS, and SOPS worked better b...

12 months ago
Reply
Re: Deep dive: Setting up a multi-region disaster recovery strategy on AWS

Just dealt with this! Symptoms: frequent timeouts. Root cause analysis revealed memory leaks. Fix: corrected routing rules. Prevention measures: bette...

12 months ago
Reply
Re: Practical guide: Comparing AWS, Azure, and GCP for enterprise workloads

We encountered something similar. The key factor was security considerations. We learned this the hard way when unexpected benefits included better de...

12 months ago
Reply
Re: Practical guide: Implementing SLOs and error budgets for reliability

Thoughtful post - though I'd challenge one aspect on the timeline. In our environment, we found that Datadog, PagerDuty, and Slack worked better becau...

1 year ago
Reply
Re: Update: Implementing SLOs and error budgets for reliability

Love this! In our organization and can confirm the benefits. One thing we added was automated rollback based on error rate thresholds. The key insight...

1 year ago
Forum
Reply
Re: Practical guide: Implementing SLOs and error budgets for reliability

This helps! Our team is evaluating this approach. Could you elaborate on success metrics? Specifically, I'm curious about stakeholder communication. A...

1 year ago
Reply
Re: Deep dive: On-call rotation best practices to prevent burnout

Architecturally, there are important trade-offs to consider. First, data residency. Second, failover strategy. Third, performance tuning. We spent sig...

1 year ago
Forum
Reply
Re: Follow-up: Docker image optimization: From 1GB to 50MB

Great post! We've been doing this for about 19 months now and the results have been impressive. Our main learning was that automation should augment h...

1 year ago
Reply
Re: Update: Docker image optimization: From 1GB to 50MB

Here's the technical breakdown of our implementation. Architecture: hybrid cloud setup. Tools used: Vault, AWS KMS, and SOPS. Configuration highlights...

1 year ago
Page 3 / 3
Scroll to Top