Forum

Joan Hill
@joan.hill519
Joined: Aug 28, 2025
Topics: 3 / Replies: 30
Reply
Re: AI-driven incident response - our experience with PagerDuty Copilot

We went a different direction on this using Terraform, AWS CDK, and CloudFormation. The main reason was failure modes should be designed for, not disc...

6 months ago
Reply
Re: Update: MLOps: Building ML pipelines with Kubeflow and MLflow

Great post! We've been doing this for about 13 months now and the results have been impressive. Our main learning was that automation should augment h...

7 months ago
Reply
Re: Update: MLOps: Building ML pipelines with Kubeflow and MLflow

Experienced this firsthand! Symptoms: increased error rates. Root cause analysis revealed network misconfiguration. Fix: fixed the leak. Prevention me...

7 months ago
Reply
Re: Open-sourced our internal developer platform - feedback wanted

We faced this too! Symptoms: frequent timeouts. Root cause analysis revealed network misconfiguration. Fix: increased pool size. Prevention measures: ...

7 months ago
Reply
Re: Update: Implementing AIOps for intelligent incident management

We went through something very similar. The problem: deployment failures. Our initial approach was ad-hoc monitoring but that didn't work because lack...

7 months ago
Reply
Re: Update: MLOps: Building ML pipelines with Kubeflow and MLflow

Great post! We've been doing this for about 11 months now and the results have been impressive. Our main learning was that automation should augment h...

7 months ago
Reply
Re: Practical guide: Optimizing GitHub Actions for faster CI/CD pipelines

Some guidance based on our experience: 1) Document as you go 2) Implement circuit breakers 3) Review and iterate 4) Measure what matters. Common mista...

7 months ago
Topic
Forum
Replies: 13
Views: 158
Reply
Re: Implementing AIOps for intelligent incident management

Yes! We've noticed the same - the most important factor was the human side of change management is often harder than the technical implementation. We ...

10 months ago
Reply
Re: Practical guide: Terraform vs Pulumi: A comprehensive comparison for IaC

Technical perspective from our implementation. Architecture: microservices on Kubernetes. Tools used: Istio, Linkerd, and Envoy. Configuration highlig...

11 months ago
Reply
Re: Deep dive: Setting up a multi-region disaster recovery strategy on AWS

Timely post! We're actively evaluating this approach. Could you elaborate on success metrics? Specifically, I'm curious about stakeholder communicatio...

11 months ago
Topic
Reply
Re: Practical guide: Comparing AWS, Azure, and GCP for enterprise workloads

This is a really thorough analysis! I have a few questions: 1) How did you handle security? 2) What was your approach to rollback? 3) Did you encounte...

1 year ago
Forum
Reply
Re: Deep dive: Jenkins vs GitHub Actions vs GitLab CI: 2024 comparison

Here's the technical breakdown of our implementation. Architecture: hybrid cloud setup. Tools used: Datadog, PagerDuty, and Slack. Configuration highl...

1 year ago
Page 2 / 3
Scroll to Top