You’re spending somewhere between $60,000 and $150,000 a month on cloud infrastructure. Maybe more. It’s grown steadily for two years. Every time you ask engineering about it, they say it’s fine — you’re growing, the bill grows with it. Your CFO is asking pointed questions. And honestly, you don’t know who’s right.

Here’s what I’ve learned from doing cloud infrastructure reviews for companies at exactly this stage: both sides are usually partially correct, and neither side has the full picture.

Why This Happens

Cloud costs grow unchecked for a predictable set of reasons, none of which are malicious.

Engineers make sizing decisions under uncertainty. When you’re building something new, you don’t know the load, so you oversize and promise yourself you’ll come back to right-size later. You never do, because there’s always something more important. That decision compounds across every service, every environment, every year.

Development and staging environments silently balloon. I’ve walked into companies where staging was costing more than production because nobody had ever thought to right-size non-production infrastructure or schedule it to shut down at night.

Nobody owns the bill. This is the core problem. If engineering is judged on uptime and delivery speed, and there’s no one whose performance review includes cloud spend, costs are nobody’s job. They’re everyone’s assumption.

The First 48 Hours

Before you do anything else, get actual visibility. Not “here’s our AWS bill” visibility. Cost-by-service, cost-by-team, cost-by-environment visibility. If you can’t see that breakdown today, that’s your first problem to solve.

In AWS, this means enabling Cost Explorer with tag-based allocation. In GCP, it’s billing exports into BigQuery. In Azure, it’s cost management plus resource groups structured by team and environment. If your resources aren’t tagged — and at a lot of fast-growing companies, they’re not — you’re about to find out what’s been hiding in that undifferentiated blob.

Once you can see the breakdown, look for three things immediately:

Compute that runs 24/7 and doesn’t need to. Dev, staging, QA environments. Internal tools. Anything that isn’t customer-facing production has no business running at 3am on a Saturday. Scheduled scaling for non-production environments alone often saves 15-20% of the total bill.

Storage that nobody is using. Old EBS snapshots from terminated instances. S3 buckets full of logs that haven’t been queried in 18 months. Database backups retained at 5x your actual policy. I found $4,200/month in orphaned EBS snapshots at one company — they’d been paying for them for over a year and no one knew they existed.

Reserved capacity you didn’t buy but should have. If your baseline compute load has been stable for more than six months, you’re paying on-demand prices for load that qualifies for 40-60% reserved pricing discounts. This is money sitting on the table.

What Engineering Needs From You

Your engineers aren’t sandbagging when they say the bill is fine. They genuinely don’t have the context to know what “fine” looks like at your stage and margin structure. That’s not their fault — it’s a communication gap you can fix.

Give each engineering team a weekly view of their own spend, with trend data. Not a lecture. Not a cost-freeze policy. Just the number and the direction it’s moving. Engineers who can see that their service costs $8,000/month and went up 18% after last week’s deploy start thinking about cost the same way they think about latency — as a quality signal, not someone else’s problem.

Build cloud cost into your architecture review process. “What will this cost to run at scale?” should be a standard question alongside “will this scale?” and “is this secure?” Not as a veto — as a design input.

What’s Actually Normal

At your revenue stage, 8-15% of revenue going to cloud infrastructure is common for SaaS businesses. If you’re significantly above that, you have optimization work to do. If you’re significantly below it, you may be underinvesting in reliability and scale.

But the percentage is a starting point, not a verdict. The right number depends on your architecture, your growth rate, and your margin targets. What’s not acceptable is not knowing whether your number is right.

The pattern I’ve seen at companies spending $60-150k/month: two to four weeks of focused optimization work typically yields 25-40% savings with zero performance impact. That’s $200,000-$600,000 a year that can go to hiring, product, or runway.

This is the kind of problem a fractional CTO is built for — someone who can come in, get visibility fast, tell you what’s waste versus what’s investment, and give engineering a framework that keeps costs from creeping back. If your bill has become a source of tension between engineering and finance, let’s talk. You can book a 15-minute call here.


Related: What Is FinOps? | Tech Debt Translation: Making Your CFO Care | Engineering Metrics That Actually Matter