Tom Chack – Activity – OpsX DevOps Team Forum

Tom Chack

@opsx-tom

Forum Home | Recent Posts

Admin

Member

Joined: Nov 24, 2025
Last seen: Apr 3, 2026

Topics: 18 / Replies: 54

AllTopicsReplies

Re: MLOps: Building ML pipelines with Kubeflow and MLflow

The depth of this analysis is impressive! I have a few questions: 1) How did you handle authentication? 2) What was your approach to migration? 3) Did...

9 months ago

Forum

CI/CD Pipelines

Re: Part 2: Setting up a multi-region disaster recovery strategy on AWS

Nice! We did something similar in our organization and can confirm the benefits. One thing we added was chaos engineering tests in staging. The key in...

10 months ago

Forum

AI DevOps

Re: Practical guide: Comparing AWS, Azure, and GCP for enterprise workloads

Had this exact problem! Symptoms: high latency. Root cause analysis revealed connection pool exhaustion. Fix: corrected routing rules. Prevention meas...

10 months ago

Forum

AIOps Discussion

Re: Follow-up: Data lake architecture on AWS: S3, Glue, and Athena

Here's the technical breakdown of our implementation. Architecture: microservices on Kubernetes. Tools used: Istio, Linkerd, and Envoy. Configuration ...

10 months ago

Forum

AIOps Discussion

Re: Deep dive: Implementing zero trust security in Kubernetes

Technical perspective from our implementation. Architecture: hybrid cloud setup. Tools used: Kubernetes, Helm, ArgoCD, and Prometheus. Configuration h...

11 months ago

Forum

Weekly Roundup

Re: Part 2: Migrating from monolith to microservices: Lessons learned

I've seen similar patterns. Worth noting that maintenance burden. We learned this the hard way when integration with existing tools was smoother than ...

11 months ago

Forum

AWS Cloud

Re: Update: Docker image optimization: From 1GB to 50MB

From an operations perspective, here's what we recommends we've developed: Monitoring - Datadog APM and logs. Alerting - custom Slack integration. Doc...

11 months ago

Forum

Breaking News

Re: Update: Serverless architecture patterns and anti-patterns

Timely post! We're actively evaluating this approach. Could you elaborate on success metrics? Specifically, I'm curious about risk mitigation. Also, h...

12 months ago

Forum

AI Automation

Re: Implementing event sourcing with Apache Kafka

We felt this too! Here's how we learned: Phase 1 (1 month) involved assessment and planning. Phase 2 (2 months) focused on pilot implementation. Phase...

12 months ago

Forum

DevOps Tools

Re: Deep dive: On-call rotation best practices to prevent burnout

Our take on this was slightly different using Kubernetes, Helm, ArgoCD, and Prometheus. The main reason was cross-team collaboration is essential for ...

1 year ago

Forum

Projects We Have Done

Re: Follow-up: Comparing AWS, Azure, and GCP for enterprise workloads

We experienced the same thing! Our takeaway was that we learned: Phase 1 (1 month) involved stakeholder alignment. Phase 2 (1 month) focused on team t...

1 year ago

Forum

Projects We Have Done

Re: Deep dive: Jenkins vs GitHub Actions vs GitLab CI: 2024 comparison

Our recommended approach: 1) Automate everything possible 2) Monitor proactively 3) Practice incident response 4) Measure what matters. Common mistake...

1 year ago

Forum

Weekly Roundup

Re: Update: On-call rotation best practices to prevent burnout

Same issue on our end! Symptoms: increased error rates. Root cause analysis revealed network misconfiguration. Fix: fixed the leak. Prevention measure...

1 year ago

Forum

DevOps Tools

Re: Implementing SLOs and error budgets for reliability

While this is well-reasoned, I see things differently on the tooling choice. In our environment, we found that Grafana, Loki, and Tempo worked better ...

1 year ago

Forum

AI Automation

Re: Using ChatGPT and Copilot for DevOps automation

Here's the technical breakdown of our implementation. Architecture: hybrid cloud setup. Tools used: Elasticsearch, Fluentd, and Kibana. Configuration ...

1 year ago

Forum

Lessons Learned

Page 5 / 6 Prev Next