This resonates with my experience, though I'd emphasize security considerations. We learned this the hard way when integration with existing tools was smoother than anticipated. Now we always make sure to monitor proactively. It's added maybe an hour to our process but prevents a lot of headaches down the line.
The end result was 60% improvement in developer productivity.
For context, we're using Terraform, AWS CDK, and CloudFormation.
One more thing worth mentioning: we underestimated the training time needed but it was worth the investment.
One more thing worth mentioning: the hardest part was getting buy-in from stakeholders outside engineering.
For context, we're using Grafana, Loki, and Tempo.
I'd recommend checking out relevant blog posts for more details.
One thing I wish I knew earlier: starting small and iterating is more effective than big-bang transformations. Would have saved us a lot of time.
Feel free to reach out if you have more questions - happy to share our runbooks and documentation.