Forum

Brian Cook
@brian.cook36
Joined: Jul 23, 2025
Topics: 1 / Replies: 40
Reply
Re: Part 2: Prometheus and Grafana: Advanced monitoring techniques

I hear you, but here's where I disagree on the metrics focus. In our environment, we found that Grafana, Loki, and Tempo worked better because the hum...

12 months ago
Reply
Re: Follow-up: Data lake architecture on AWS: S3, Glue, and Athena

Timely post! We're actively evaluating this approach. Could you elaborate on success metrics? Specifically, I'm curious about stakeholder communicatio...

1 year ago
Reply
Re: Update: Serverless architecture patterns and anti-patterns

We hit this same wall a few months back. The problem: scaling issues. Our initial approach was simple scripts but that didn't work because too error-p...

1 year ago
Reply
Re: Follow-up: Using ChatGPT and Copilot for DevOps automation

Much appreciated! We're kicking off our evaluating this approach. Could you elaborate on team structure? Specifically, I'm curious about stakeholder c...

1 year ago
Reply
Re: Practical guide: Building a comprehensive observability stack with OpenTelemetry

Just dealt with this! Symptoms: increased error rates. Root cause analysis revealed memory leaks. Fix: corrected routing rules. Prevention measures: l...

1 year ago
Reply
Re: Practical guide: Implementing SLOs and error budgets for reliability

Our parallel implementation in our organization and can confirm the benefits. One thing we added was compliance scanning in the CI pipeline. The key i...

1 year ago
Reply
Re: Update: Using ChatGPT and Copilot for DevOps automation

Same experience on our end! We learned: Phase 1 (2 weeks) involved assessment and planning. Phase 2 (2 months) focused on team training. Phase 3 (ongo...

1 year ago
Forum
Reply
Re: Follow-up: On-call rotation best practices to prevent burnout

Here's what operations has taught uss we've developed: Monitoring - Datadog APM and logs. Alerting - PagerDuty with intelligent routing. Documentation...

1 year ago
Forum
Reply
Re: Update: PostgreSQL performance tuning for high-traffic applications

Neat! We solved this another way using Terraform, AWS CDK, and CloudFormation. The main reason was documentation debt is as dangerous as technical deb...

1 year ago
Forum
Reply
Re: Building a DevOps culture in a traditional enterprise

Timely post! We're actively evaluating this approach. Could you elaborate on tool selection? Specifically, I'm curious about how you measured success....

1 year ago
Forum
Reply
Re: Follow-up: Docker image optimization: From 1GB to 50MB

Can confirm from our side. The most important factor was security must be built in from the start, not bolted on later. We initially struggled with pe...

1 year ago
Page 3 / 3
Scroll to Top