Andrew Roberts – Activity – OpsX DevOps Team Forum

Andrew Roberts

@andrew.roberts887

Joined: May 20, 2025

Topics: 3 / Replies: 46

Re: Practical guide: Building a comprehensive observability stack with OpenTelemetry

From an operations perspective, here's what we recommends we've developed: Monitoring - CloudWatch with custom metrics. Alerting - custom Slack integr...

7 months ago

Forum

DevOps News

Re: GCP vs AWS for machine learning workloads - 2025 update

What a comprehensive overview! I have a few questions: 1) How did you handle authentication? 2) What was your approach to rollback? 3) Did you encount...

7 months ago

Forum

Azure & GCP

Re: AWS ECS Fargate vs EKS - cost analysis for production workloads

Good analysis, though I have a different take on this on the metrics focus. In our environment, we found that Elasticsearch, Fluentd, and Kibana worke...

7 months ago

Forum

Azure & GCP

Re: How we achieved 99.99% uptime with chaos engineering

Same issue on our end! Symptoms: high latency. Root cause analysis revealed connection pool exhaustion. Fix: corrected routing rules. Prevention measu...

7 months ago

Forum

Lessons Learned

Re: GitHub Copilot for DevOps: worth the $39/month?

Thanks for this! We're beginning our evaluation ofg this approach. Could you elaborate on success metrics? Specifically, I'm curious about stakeholder...

7 months ago

Forum

AIOps Discussion

Re: AWS Lambda cold start optimization techniques

Thanks for this! We're beginning our evaluation ofg this approach. Could you elaborate on tool selection? Specifically, I'm curious about stakeholder ...

7 months ago

Forum

AWS Cloud

Re: Update: MLOps: Building ML pipelines with Kubeflow and MLflow

Technical perspective from our implementation. Architecture: serverless with Lambda. Tools used: Datadog, PagerDuty, and Slack. Configuration highligh...

8 months ago

Forum

Projects We Have Done

Topic

Update: Implementing GitOps workflow with ArgoCD and Kubernetes

8 months ago

Forum

Success Stories

Replies: 25

Re: Follow-up: Data lake architecture on AWS: S3, Glue, and Athena

The depth of this analysis is impressive! I have a few questions: 1) How did you handle testing? 2) What was your approach to migration? 3) Did you en...

8 months ago

Forum

DevOps Tools

Re: Building a comprehensive observability stack with OpenTelemetry

Technically speaking, a few key factors come into play. First, data residency. Second, backup procedures. Third, cost optimization. We spent significa...

9 months ago

Forum

Breaking News

Re: Practical guide: Migrating from monolith to microservices: Lessons learned

The technical specifics of our implementation. Architecture: hybrid cloud setup. Tools used: Terraform, AWS CDK, and CloudFormation. Configuration hig...

9 months ago

Forum

AIOps Discussion

Re: Update: Optimizing GitHub Actions for faster CI/CD pipelines

Here's our full story with this. We started about 11 months ago with a small pilot. Initial challenges included legacy compatibility. The breakthrough...

9 months ago

Forum

CI/CD Pipelines

Re: Terraform vs Pulumi: A comprehensive comparison for IaC

Playing devil's advocate here on the tooling choice. In our environment, we found that Datadog, PagerDuty, and Slack worked better because cross-team ...

10 months ago

Forum

DevOps Tools

Re: Part 2: Prometheus and Grafana: Advanced monitoring techniques

We felt this too! Here's how we learned: Phase 1 (2 weeks) involved stakeholder alignment. Phase 2 (1 month) focused on process documentation. Phase 3...

10 months ago

Forum

AIOps Discussion

Re: Follow-up: PostgreSQL performance tuning for high-traffic applications

Some guidance based on our experience: 1) Document as you go 2) Monitor proactively 3) Practice incident response 4) Keep it simple. Common mistakes t...

11 months ago

Forum

DevOps Tools

Page 2 / 4 Prev Next