Multi-cloud Terraform modules - how we manage 3 cloud providers

Alex Chen · 2025-11-06T03:41:42Z

We're running multi-cloud terraform modules - how we manage 3 cloud providers in production and wanted to share our experience. Scale: - 782 services deployed - 21 TB data processed/month - 4M requests/day - 5 regions worldwide Architecture: - Compute: EKS - Data: RDS Aurora - Queue: EventBridge Monthly cost: ~$87k Lessons learned: 1. Multi-AZ costs add up fast 2. NAT Gateways are costly 3. Autoscaling needs careful tuning AMA about our setup!

✦ Summarize Topic

Page 2 / 2 Prev

Azure & GCP

Last Post by Donald Lee 3 months ago

18 Posts

16 Users

1 Reactions

506 Views

RSS

Samuel Miller

(@samuel.miller567)

Posts: 0

Translate ▼

Funny timing - we just dealt with this. The problem: security vulnerabilities. Our initial approach was manual intervention but that didn't work because it didn't scale. What actually worked: drift detection with automated remediation. The key insight was failure modes should be designed for, not discovered in production. Now we're able to deploy with confidence.

Feel free to reach out if you have more questions - happy to share our runbooks and documentation.

The end result was 3x increase in deployment frequency.

Posted : 23/12/2025 11:09 pm

Maria Carter

(@maria.carter392)

Posts: 0

Translate ▼

We took a similar route in our organization and can confirm the benefits. One thing we added was drift detection with automated remediation. The key insight for us was understanding that failure modes should be designed for, not discovered in production. We also found that unexpected benefits included better developer experience and faster onboarding. Happy to share more details if anyone is interested.

For context, we're using Datadog, PagerDuty, and Slack.

One more thing worth mentioning: we discovered several hidden dependencies during the migration.

Posted : 24/12/2025 10:14 pm

Donald Lee

(@donald.lee803)

Posts: 0

Translate ▼

We felt this too! Here's how we learned: Phase 1 (1 month) involved stakeholder alignment. Phase 2 (1 month) focused on team training. Phase 3 (1 month) was all about knowledge sharing. Total investment was $200K but the payback period was only 3 months. Key success factors: executive support, dedicated team, clear metrics. If I could do it again, I would set clearer success metrics.

Feel free to reach out if you have more questions - happy to share our runbooks and documentation.

Posted : 29/12/2025 12:12 am

Page 2 / 2 Prev

11 Forums
309 Topics
4,684 Posts
0 Online
109 Members

Forum Icons: Forum contains no unread posts Forum contains unread posts

Topic Icons: Not Replied Replied Active Hot Sticky Unapproved Solved Private Closed