Skip to main content

Usage-Based Pricing for Copilot Is Good for Microsoft's Investors. Read That Sentence Again.

The Meter is running hot. Image by TW with some help from Claude and ChatGPT

TheStreet ran a piece this week arguing that, of Microsoft's two Copilot announcements, the shift to usage-based pricing matters more to investors than the DeepSeek flirtation. That read is correct. It is also the tell.

Here is what Microsoft actually did. Copilot Cowork, the agent that reaches across Microsoft 365 to run multi-step work on your data, is coming off the flat per-seat add-on and moving onto consumption billing the company calls "Copilot Credits." Charles Lamanna, who runs Copilot, told Axios the product could not be offered on an unlimited-use basis. The users he pointed to are the ones doing hundreds of tasks a week. He called them "way productive." And then he said the part vendors normally keep off the slide: their costs go very high.

So the most productive users are the expensive ones. Hold that thought, because the whole argument lives there.

What "good for investors" is really saying

A pricing model earns the label "good for investors" when three things are true. Revenue starts to track cost-to-serve. Revenue scales with consumption instead of sitting flat per seat. And the vendor stops eating the margin on its heaviest users. All three are true here. None of them is a statement about whether a customer got value.

That is the gap I want to sit in for a minute.

Usage-based pricing meters an input. Tokens, compute, credits, whatever the unit. The customer does not buy tokens because they want tokens. They want a finished report, a resolved ticket, a reconciled spreadsheet. The token count is the cost of producing the outcome, not the outcome. And the relationship between the two is loose at best.

TSIA put it plainly in its May analysis of AI pricing: usage does not equal value, and consumption models often fail to reflect actual business value. That is not a critic talking. That is a research firm whose audience is the vendors building these models.

Agents make the coupling worse, not better

A chatbot answers and stops. An agent keeps going. It reads files, calls tools, checks its own work, hits a wall, tries again. Each of those steps burns compute, and the steps are what get metered. Every retry, every verbose detour, every loop the agent runs to second-guess itself adds to the bill. The customer pays for all of it.

Now ask yourself the hard question. Does a workflow that took the agent three retries and a long chain of self-checks deliver more value than the same workflow done cleanly in one pass? Of course not. It delivers the same outcome and costs more. Under seat pricing, that inefficiency was the vendor's problem. Under usage pricing, it is line-itemed onto the buyer's invoice.

The billing platform Flexprice, which sells the plumbing for this, says it out loud to its own customers: retries, loops, and background jobs are friction, not value, and usage-based pricing only works when customers gain something real as the meter climbs. Their warning to vendors is the buyer's whole case.

The people who get punished are the people who bought in

We do not have to guess how this lands, because GitHub Copilot already ran the experiment. On June 1 it moved to token billing. The median user barely noticed. The pain landed on the top five to ten percent, and it landed hard: community projections of bills jumping ten to fifty times, one developer modeling a move from roughly $29 a month to nearly $750, another claiming $50 to $3,000. TechCrunch called it the end of Copilot's golden age.

Look at who those heavy users are. They are not abusers. They are the people who took the vendor's three-year advice to use the tool for everything, built agentic workflows around it, and made it part of how they work. The pricing change penalizes exactly the depth of adoption every vendor claims to want. And the old safety net, where running out of premium budget dropped you to a cheaper model so you could keep working, is gone. What is gone, too, is the cost ceiling.

Lamanna's "way productive" power user and GitHub's top-decile developer are the same person. The model charges most to the customer who is succeeding most. Reward and penalty have swapped places. The reward now goes to the vendor.

The structural problem, which is bigger than the bill

Now the second half, and this part is more important than any individual invoice.

Think about where accountability for value sits in each pricing model. With outcome-based pricing, the vendor gets paid when a result lands and not before. Fin's (formerly known as Intercom) Fin charges 99 cents per resolution, billed only when the customer confirms the AI actually solved the problem. Under that model, every failed attempt costs the vendor. So the vendor has a direct, financial reason to make the agent efficient, accurate, and sparing with compute. Their margin depends on it.

Usage pricing inverts that incentive. The vendor is paid for activity regardless of whether the activity worked. An agent that burns more tokens, retries more often, and reasons more verbosely produces more revenue, not less. I am not claiming Microsoft will deliberately bloat Cowork to pump credits. I am saying the financial pressure that used to push toward lean, effective agents has been switched off; and switched-off incentives have a tendency of showing up in the product eventually.

The demand side pulls in the same direction. There is a name for it now: tokenmaxxing, the workplace habit of treating AI usage as a proxy for productivity, where people get judged on how many token they burn rather than on what they shipped. Built In's writeup is blunt about it: the habit rewards visible activity, not results. So stack the three forces. Buyers under pressure to run up consumption as a status signal, a vendor that meters by consumption, and an agent that inflates consumption on its own. Everything drives the meter up. Nothing points it at the outcome.

That is the real cost of the model. It moves the vendor one step further from owning the question of whether you got value, and it hands that entire question to you. The vendor essentially plays Pontius Pilate. The buyer now runs FinOps for AI. You set the budget caps. You write the spending policies. You read the consumption dashboard. You type /cost to see what a task burned. Microsoft, to its credit, is shipping all of those controls, and they are better than the ones GitHub fumbled out the door. But notice what they are. They are tools for the customer to govern value. They are not the vendor guaranteeing it.

The honest counterargument

I would be doing the same vendor-spin thing I just criticized if I left out the other side.

Flat pricing for agentic tools is inherently unsustainable. The economics are upside down: the model subsidizes the heaviest five percent and overcharges the lightest fifty. Metered billing is the rational fix for that, and for a low-volume or experimental buyer it is a better deal than paying a fat seat fee to barely use the thing. Aligning price with cost-to-serve is a good thing. It is just a vendor virtue, not a customer one, and the trick to watch is anyone presenting the first as if it were the second.

Cheaper models do not solve the coupling problem. A fine-tuned DeepSeek on Azure lowers the unit price of the metered thing. It does not make the metered thing track value. You are paying less per token for a number that still has a loose relationship to your outcome. And who knows how many additional tokens a potentially inferior model burns.

Where this lands

On my orchestration battleground, Cowork is the M365 layer that coordinates work across your apps and your Graph. Pricing that orchestration by consumption reframes it from a capability you own into a utility you rent by the drink. That is a substantial shift in who carries the risk when an orchestrated workflow goes long, and it is the buyer.

So, on the null hypothesis. Is usage-based pricing good for customers? Largely no, and for the reasons the question assumed. The metered unit is loosely coupled to value, agents widen that gap rather than closing it, the model bills the most engaged users the most, and it relocates the entire burden of value accountability from the vendor onto the buyer. Good for investors and good for customers are not in alignment here. On this one, they partly trade off.

Three things to do if you are buying

Model your power users, not your average. The average user will not break your budget. The fifteen people who actually adopted the thing will, and they may very well be the ones delivering your return.

Make the vendor define the unit before you sign. If a task can cost anywhere from a few credits to a few hundred depending on how many times the agent talks to itself, that is not a price, it is a range. And you'll end up at the upper end, trust me. Ask vendors to commit to a per-outcome cost and watch how fast the conversation gets vague.

Push for outcome terms on anything that has a definable outcome. Resolution, completion, ticket closed. If the vendor will only price the effort and not the result, they are telling you something about how confident they are in the result.

The interesting question is not whether usage pricing is here. It is. The question is whether buyers will accept a model where the vendor is paid the same whether the agent nails it on the first try or flails through ten, or whether the market pushes back toward paying for outcomes the way Fin does. Or Zendesk. Or Hubspot. Or others. I do not know which way that goes. But the vendor whose margin improves when its agent works harder is not, structurally, the vendor most motivated to make the agent work better.

 

Comments

Last Year's Top 5 Popular Posts

You are only as good as your customer remembers

As you know, I am very interested in how organizations are using business applications, which problems they do address, and how they review their success. In a next instance of these customer interviews, I had the opportunity to talk with Melissa Gordon , Executive Vice President, Enterprise Solutions at Tidal Basin about their journey with Zoho. You can watch the full interview on YouTube. Tidal Basin is a government contractor that provides various services throughout the government space, including disaster response, technology and financial services, and contact centers. Tidal Basin started with Zoho CRM and was searching for a project management tool in 2019. This was prompted by mainly two drivers. First, employees were asking for tools to help them running their projects. Second, with a focus on organizational growth and bigger projects that involved more people, Tidal Basin wanted to reduce its risk exposure and increase the efficiency of project delivery. This way, the compa...

SAP Draws a Perimeter around Agentic AI and What That Means for the Rest of US

The most consequential enterprise AI governance document published this year arrived in late April with surprisingly little fanfare. SAP's updated API Policy, version 4/2026 , is a short document in plain English. The clause that is most interesting is Section 2.2.2. It restricts how autonomous and generative AI systems are permitted to interact with SAP APIs. Read literally, it has the potential to change the architecture of agentic AI projects across every SAP customer landscape. Read carefully, it is also more interesting than the lock-in headlines suggest. The policy targets a specific category of AI behavior, not AI as such. It connects to commercial mechanics that go well beyond API stability. And the literal text, in its current form, will probably not survive the next two policy revisions intact. There is a lot to unpack. I will walk through what the policy actually says, how the SAP-watching community is reading it, what the rest of the major enterprise vendors are doin...

The Illusion of Value: Why Salesforce’s Agentic Work Unit is the New "Bad Query" of the AI Era

The News On February. 25, 2026, Salesforce announced a pricing and metrics update . During the company’s Q4 FY2026 earnings call, CEO Marc Benio ff, together with CMO Patrick Stokes , unveiled the Agentic Work Unit (AWU). Positioned as a metric to quantify the labor performed by autonomous digital systems, Salesforce defines an AWU as one discrete task accomplished by an AI agent. According to Salesforce, this discrete task represents the exact moment " raw intelligence is converted into real work ". It is not a fixed unit but measured as a processed prompt, a completed reasoning chain, or an invoked tool. Salesforce explicitly designed the AWU to move the industry conversation away from the raw consumption of Large Language Model (LLM) tokens. As Benioff noted, tokens only measure "how much an AI talks," whereas the AWU is intended to measure actual business execution. The scale of this rollout is massive. Salesforce reported that its platform has already processe...

Data Wars: SAP Vs. Salesforce In The AI-Driven Enterprise Future

The past weeks certainly brought a lot of news, with SAP Sapphire and Salesforce's surely strategically timed announcement of acquiring Informatica , ranging at the top. I have covered both in recent articles. The enterprise software landscape is crackling with energy, and Artificial Intelligence (AI) is certainly the star of the show. It isn't anymore about AI as a mere feature; it's about AI as the strategic core of enterprise software. Two recent announcements underscored this shift: SAP's ambitious AI-centric vision that was unveiled at its Sapphire 2025 conference, and, arriving hot on its heels, Salesforce's agreement to acquire data management titan Informatica for $8 billion. Both signal an intensified battle for AI supremacy, where trusted, enterprise-wide data is the undisputed new monarch. Of course, SAP and Salesforce are not the only ones duking this one out. SAP's Sapphire Vision: An AI-Powered, Integrated Enterprise At its Sapphire 2025 event in ...

CPQ, Meet Price Optimization: Your Revenue Lifecycle Just Got Serious

The news On October 1, 2025, Conga announced its intent to acquire the B2B business of PROS , following PRO’s acquisition by Thomas Bravo . At the same time, ThomaBravo and PROS announced that PRO’s travel business segment will be run as a standalone business . The bigger picture Revenue operations, revenue management and revenue lifecycle management have become a thing in the past years, as evidenced by the number of specialized companies that solve parts of the overall problem of optimizing revenue. It also got abused to some extent (e.g., surge pricing models) when the users of the corresponding capabilities consider optimizing being the same as maximizing. Reality check: It is not. While optimizing involves a bit of identifying how much a customer is willing to pay, it also involves the thought of repeat business, or in other words customer loyalty, even without a formal loyalty program. And that involves the customer experience, part of which the speed of creating a quote with mat...