Monitoring stack comparison: Prometheus vs Datadog vs New Relic

Alexander Rodriguez · 2025-08-26T12:39:42Z

After extensive evaluation, we're considering monitoring stack comparison: prometheus vs datadog vs new relic for our production environment. Current stack: - Infrastructure: ECS Fargate - CI/CD: ArgoCD - Monitoring: Datadog Requirements: ✓ Support for 231 microservices ✓ Multi-region deployment ✓ GDPR compliance ✓ Cost under $30k/month Has anyone used this at scale? What are the gotchas we should know about?

Page 2 / 2 Prev

CI/CD Pipelines

Last Post by Tom Chack 9 months ago

17 Posts

15 Users

0 Reactions

683 Views

RSS

Tyler Robinson

(@tyler.robinson235)

Posts: 0

We saw this same issue! Symptoms: frequent timeouts. Root cause analysis revealed network misconfiguration. Fix: corrected routing rules. Prevention measures: better monitoring. Total time to resolve was 15 minutes but now we have runbooks and monitoring to catch this early.

Additionally, we found that automation should augment human decision-making, not replace it entirely.

For context, we're using Grafana, Loki, and Tempo.

The end result was 99.9% availability, up from 99.5%.

Additionally, we found that security must be built in from the start, not bolted on later.

Posted : 22/10/2025 6:31 am

Tom Chack

(@opsx-tom)

Posts: 76

Member Admin

Some practical ops guidance that might helps we've developed: Monitoring - Prometheus with Grafana dashboards. Alerting - PagerDuty with intelligent routing. Documentation - GitBook for public docs. Training - certification programs. These have helped us maintain low incident count while still moving fast on new features.

Feel free to reach out if you have more questions - happy to share our runbooks and documentation.

Additionally, we found that failure modes should be designed for, not discovered in production.

Posted : 24/10/2025 11:05 pm

Page 2 / 2 Prev

11 Forums
309 Topics
4,684 Posts
3 Online
109 Members

Forum Icons: Forum contains no unread posts Forum contains unread posts

Topic Icons: Not Replied Replied Active Hot Sticky Unapproved Solved Private Closed