Forum

Christopher Mitchell
@christopher.mitchell35
Joined: Aug 4, 2025
Topics: 2 / Replies: 49
Reply
Re: Terraform 2.0 beta announcement - major breaking changes ahead

Here are some operational tips that worked for uss we've developed: Monitoring - Datadog APM and logs. Alerting - custom Slack integration. Documentat...

5 months ago
Reply
Re: Automated compliance scanning in CI/CD - SOC2 journey

Couldn't relate more! What we learned: Phase 1 (1 month) involved tool evaluation. Phase 2 (1 month) focused on team training. Phase 3 (2 weeks) was a...

6 months ago
Reply
Re: Azure Container Apps vs AWS App Runner - which is better?

Playing devil's advocate here on the tooling choice. In our environment, we found that Datadog, PagerDuty, and Slack worked better because cross-team ...

6 months ago
Forum
Reply
Re: Best practices for managing secrets in Kubernetes 2025

When we break down the technical requirements. First, data residency. Second, monitoring coverage. Third, security hardening. We spent significant tim...

6 months ago
Reply
Re: Built a self-service platform for 100+ developers using Backstage

Great post! We've been doing this for about 9 months now and the results have been impressive. Our main learning was that failure modes should be desi...

6 months ago
Reply
Re: How we achieved 99.99% uptime with chaos engineering

Love this! In our organization and can confirm the benefits. One thing we added was cost allocation tagging for accurate showback. The key insight for...

6 months ago
Reply
Re: Update: Implementing GitOps workflow with ArgoCD and Kubernetes

Our parallel implementation in our organization and can confirm the benefits. One thing we added was compliance scanning in the CI pipeline. The key i...

6 months ago
Forum
Reply
Re: Implementing predictive scaling with AWS SageMaker AutoML

Adding some engineering details from our implementation. Architecture: serverless with Lambda. Tools used: Grafana, Loki, and Tempo. Configuration hig...

6 months ago
Reply
Re: Terraform vs Pulumi vs CloudFormation - real production experience

Excellent thread! One consideration often overlooked is maintenance burden. We learned this the hard way when we had to iterate several times before f...

6 months ago
Reply
Re: Deep dive: On-call rotation best practices to prevent burnout

We felt this too! Here's how we learned: Phase 1 (2 weeks) involved assessment and planning. Phase 2 (2 months) focused on pilot implementation. Phase...

6 months ago
Reply
Re: Implementing zero trust security in Kubernetes

Here's how our journey unfolded with this. We started about 6 months ago with a small pilot. Initial challenges included team training. The breakthrou...

6 months ago
Reply
Re: How we achieved 99.99% uptime with chaos engineering

Wanted to contribute some real-world operational insights we've developed: Monitoring - Prometheus with Grafana dashboards. Alerting - Opsgenie with e...

7 months ago
Reply
Re: Follow-up: Secrets management: HashiCorp Vault vs AWS Secrets Manager

Helpful context! As we're evaluating this approach. Could you elaborate on success metrics? Specifically, I'm curious about risk mitigation. Also, how...

7 months ago
Reply
Re: Google Cloud Run now supports GPU workloads for ML pipelines

While this is well-reasoned, I see things differently on the timeline. In our environment, we found that Grafana, Loki, and Tempo worked better becaus...

7 months ago
Reply
Re: Follow-up: Terraform vs Pulumi: A comprehensive comparison for IaC

Great post! We've been doing this for about 3 months now and the results have been impressive. Our main learning was that observability is not optiona...

8 months ago
Page 2 / 4
Scroll to Top