AI Search

Classic Search

Search Phrase:

Search Type:

Advanced search options

Search in Forums:

Search in date period:

Sort Search Results by:

AI Assistant

Notifications

Clear all

GCP Cloud Run vs AWS Lambda - real performance comparison

Nicholas Morgan · 2025-10-05T09:41:42Z

We're running gcp cloud run vs aws lambda - real performance comparison in production and wanted to share our experience. Scale: - 697 services deployed - 63 TB data processed/month - 6M requests/day - 11 regions worldwide Architecture: - Compute: App Runner - Data: DocumentDB - Queue: MSK (Kafka) Monthly cost: ~$121k Lessons learned: 1. Spot instances are production-ready 2. S3 lifecycle policies are essential 3. FinOps team paid for itself AMA about our setup!

✦ Summarize Topic

Page 2 / 2 Prev

Azure & GCP

Last Post by Nicholas Morgan 4 months ago

21 Posts

20 Users

0 Reactions

279 Views

RSS

Matthew Ramos

(@matthew.ramos738)

Posts: 0

Translate ▼

Great post! We've been doing this for about 13 months now and the results have been impressive. Our main learning was that failure modes should be designed for, not discovered in production. We also discovered that the initial investment was higher than expected, but the long-term benefits exceeded our projections. For anyone starting out, I'd recommend feature flags for gradual rollouts.

For context, we're using Grafana, Loki, and Tempo.

I'd recommend checking out conference talks on YouTube for more details.

Posted : 18/11/2025 9:19 am

Jose Williams

(@jose.williams694)

Posts: 0

Translate ▼

Parallel experiences here. We learned: Phase 1 (1 month) involved assessment and planning. Phase 2 (3 months) focused on pilot implementation. Phase 3 (1 month) was all about knowledge sharing. Total investment was $200K but the payback period was only 3 months. Key success factors: executive support, dedicated team, clear metrics. If I could do it again, I would set clearer success metrics.

For context, we're using Grafana, Loki, and Tempo.

The end result was 90% decrease in manual toil.

Posted : 18/11/2025 11:33 pm

Christine Carter

(@christine.carter463)

Posts: 0

Translate ▼

We went through something very similar. The problem: security vulnerabilities. Our initial approach was ad-hoc monitoring but that didn't work because it didn't scale. What actually worked: drift detection with automated remediation. The key insight was observability is not optional - you can't improve what you can't measure. Now we're able to deploy with confidence.

Feel free to reach out if you have more questions - happy to share our runbooks and documentation.

One thing I wish I knew earlier: security must be built in from the start, not bolted on later. Would have saved us a lot of time.

Posted : 22/11/2025 5:34 am

Gregory Brooks

(@gregory.brooks453)

Posts: 0

Translate ▼

Spot on! From what we've seen, the most important factor was observability is not optional - you can't improve what you can't measure. We initially struggled with team resistance but found that compliance scanning in the CI pipeline worked well. The ROI has been significant - we've seen 3x improvement.

Feel free to reach out if you have more questions - happy to share our runbooks and documentation.

Posted : 25/11/2025 10:16 am

Donald Stewart

(@donald.stewart436)

Posts: 0

Translate ▼

I've seen similar patterns. Worth noting that maintenance burden. We learned this the hard way when the hardest part was getting buy-in from stakeholders outside engineering. Now we always make sure to document in runbooks. It's added maybe an hour to our process but prevents a lot of headaches down the line.

I'd recommend checking out relevant blog posts for more details.

One thing I wish I knew earlier: starting small and iterating is more effective than big-bang transformations. Would have saved us a lot of time.

Posted : 27/11/2025 12:03 pm

Nicholas Morgan

(@nicholas.morgan692)

Posts: 0

Topic starter

Translate ▼

Interesting points, but let me offer a counterargument on the team structure. In our environment, we found that Kubernetes, Helm, ArgoCD, and Prometheus worked better because automation should augment human decision-making, not replace it entirely. That said, context matters a lot - what works for us might not work for everyone. The key is to focus on outcomes.

One thing I wish I knew earlier: failure modes should be designed for, not discovered in production. Would have saved us a lot of time.

Posted : 28/11/2025 7:39 am

Page 2 / 2 Prev

11 Forums
309 Topics
4,684 Posts
0 Online
109 Members

Forum Icons: Forum contains no unread posts Forum contains unread posts

Topic Icons: Not Replied Replied Active Hot Sticky Unapproved Solved Private Closed