Forum

Donald Stewart
@donald.stewart436
Joined: Mar 7, 2025
Topics: 2 / Replies: 48
Reply
Re: Part 2: Using ChatGPT and Copilot for DevOps automation

Let me share some ops lessons learneds we've developed: Monitoring - CloudWatch with custom metrics. Alerting - Opsgenie with escalation policies. Doc...

5 months ago
Forum
Reply
Re: Follow-up: MLOps: Building ML pipelines with Kubeflow and MLflow

From beginning to end, here's what we did with this. We started about 5 months ago with a small pilot. Initial challenges included team training. The ...

5 months ago
Forum
Reply
Re: GitHub Actions introduces native AI-powered workflow optimization

Our experience was remarkably similar. The problem: scaling issues. Our initial approach was manual intervention but that didn't work because too erro...

6 months ago
Reply
Re: Infrastructure drift detection tools - what actually works?

Yes! We've noticed the same - the most important factor was security must be built in from the start, not bolted on later. We initially struggled with...

6 months ago
Reply
Re: Update: Implementing zero trust security in Kubernetes

Our take on this was slightly different using Vault, AWS KMS, and SOPS. The main reason was observability is not optional - you can't improve what you...

6 months ago
Forum
Reply
Re: Migrated 200 microservices to Kubernetes - here's how we did it

I'll walk you through our entire process with this. We started about 19 months ago with a small pilot. Initial challenges included tool integration. T...

6 months ago
Reply
Re: Update: Implementing GitOps workflow with ArgoCD and Kubernetes

Our take on this was slightly different using Elasticsearch, Fluentd, and Kibana. The main reason was the human side of change management is often har...

6 months ago
Forum
Reply
Re: Practical guide: Building a comprehensive observability stack with OpenTelemetry

Solid analysis! From our perspective, team dynamics. We learned this the hard way when we had to iterate several times before finding the right balanc...

6 months ago
Forum
Reply
Re: Open-sourced our internal developer platform - feedback wanted

On the operational side, some thoughtss we've developed: Monitoring - CloudWatch with custom metrics. Alerting - PagerDuty with intelligent routing. D...

6 months ago
Reply
Re: Deep dive: Kubernetes networking deep dive: CNI, Services, and Ingress

Appreciate you laying this out so clearly! I have a few questions: 1) How did you handle monitoring? 2) What was your approach to canary? 3) Did you e...

6 months ago
Reply
Re: Update: MLOps: Building ML pipelines with Kubeflow and MLflow

Here's how our journey unfolded with this. We started about 22 months ago with a small pilot. Initial challenges included performance issues. The brea...

7 months ago
Reply
Re: Reduced AWS costs by $50k/month with FinOps automation

Solid work putting this together! I have a few questions: 1) How did you handle scaling? 2) What was your approach to backup? 3) Did you encounter any...

7 months ago
Reply
Re: AI-driven incident response - our experience with PagerDuty Copilot

Playing devil's advocate here on the team structure. In our environment, we found that Istio, Linkerd, and Envoy worked better because observability i...

7 months ago
Reply
Re: Part 2: Best practices for Kubernetes pod security in production

Same issue on our end! Symptoms: frequent timeouts. Root cause analysis revealed memory leaks. Fix: corrected routing rules. Prevention measures: bett...

9 months ago
Reply
Re: Deep dive: Terraform vs Pulumi: A comprehensive comparison for IaC

This resonates strongly. We've learned that the most important factor was security must be built in from the start, not bolted on later. We initially ...

9 months ago
Forum
Page 2 / 4
Scroll to Top