Couldn't agree more. From our work, the most important factor was failure modes should be designed for, not discovered in production. We initially struggled with scaling issues but found that compliance scanning in the CI pipeline worked well. The ROI has been significant - we've seen 50% improvement.
I'd recommend checking out conference talks on YouTube for more details.
Additionally, we found that the human side of change management is often harder than the technical implementation.
This resonates with what we experienced last month. The problem: scaling issues. Our initial approach was manual intervention but that didn't work because it didn't scale. What actually worked: cost allocation tagging for accurate showback. The key insight was documentation debt is as dangerous as technical debt. Now we're able to deploy with confidence.
The end result was 80% reduction in security vulnerabilities.
Feel free to reach out if you have more questions - happy to share our runbooks and documentation.