Donald Price – Activity – OpsX DevOps Team Forum

Donald Price

@donald.price627

Joined: Mar 2, 2025

Topics: 1 / Replies: 43

Re: Practical guide: Terraform vs Pulumi: A comprehensive comparison for IaC

Makes sense! For us, the approach varied using Kubernetes, Helm, ArgoCD, and Prometheus. The main reason was starting small and iterating is more effe...

11 months ago

Forum

Projects We Have Done

Re: Serverless architecture patterns and anti-patterns

From the ops trenches, here's our takes we've developed: Monitoring - Prometheus with Grafana dashboards. Alerting - custom Slack integration. Documen...

11 months ago

Forum

Lessons Learned

Re: Serverless architecture patterns and anti-patterns

Appreciate you laying this out so clearly! I have a few questions: 1) How did you handle scaling? 2) What was your approach to rollback? 3) Did you en...

11 months ago

Forum

Lessons Learned

Re: Jenkins vs GitHub Actions vs GitLab CI: 2024 comparison

100% aligned with this. The most important factor was documentation debt is as dangerous as technical debt. We initially struggled with team resistanc...

11 months ago

Forum

AWS Cloud

Re: Update: Docker image optimization: From 1GB to 50MB

Good analysis, though I have a different take on this on the metrics focus. In our environment, we found that Vault, AWS KMS, and SOPS worked better b...

12 months ago

Forum

Breaking News

Re: Deep dive: Setting up a multi-region disaster recovery strategy on AWS

Just dealt with this! Symptoms: frequent timeouts. Root cause analysis revealed memory leaks. Fix: corrected routing rules. Prevention measures: bette...

12 months ago

Forum

Infrastructure as Code

Re: Practical guide: Comparing AWS, Azure, and GCP for enterprise workloads

We encountered something similar. The key factor was security considerations. We learned this the hard way when unexpected benefits included better de...

12 months ago

Forum

Clouds - AWS, Azure, GCP

Re: Practical guide: Implementing SLOs and error budgets for reliability

Thoughtful post - though I'd challenge one aspect on the timeline. In our environment, we found that Datadog, PagerDuty, and Slack worked better becau...

1 year ago

Forum

AIOps Discussion

Re: Update: Implementing SLOs and error budgets for reliability

Love this! In our organization and can confirm the benefits. One thing we added was automated rollback based on error rate thresholds. The key insight...

1 year ago

Forum

AWS Cloud

Topic

Deep dive: Building a DevOps culture in a traditional enterprise

1 year ago

Forum

AI DevOps

Replies: 20

Re: Practical guide: Implementing SLOs and error budgets for reliability

This helps! Our team is evaluating this approach. Could you elaborate on success metrics? Specifically, I'm curious about stakeholder communication. A...

1 year ago

Forum

AIOps Discussion

Re: Deep dive: On-call rotation best practices to prevent burnout

Architecturally, there are important trade-offs to consider. First, data residency. Second, failover strategy. Third, performance tuning. We spent sig...

1 year ago

Forum

AWS Cloud

Re: Follow-up: Docker image optimization: From 1GB to 50MB

Great post! We've been doing this for about 19 months now and the results have been impressive. Our main learning was that automation should augment h...

1 year ago

Forum

Lessons Learned

Re: Update: Docker image optimization: From 1GB to 50MB

Here's the technical breakdown of our implementation. Architecture: hybrid cloud setup. Tools used: Vault, AWS KMS, and SOPS. Configuration highlights...

1 year ago

Forum

Weekly Roundup

Page 3 / 3 Prev