
We are a digital agency helping businesses develop immersive, engaging, and user-focused web, app, and software solutions.
2310 Mira Vista Ave
Montrose, CA 91020
2500+ reviews based on client feedback

What's Included?
ToggleBrian Armstrong, the head of Coinbase, recently shared a short list of ways his team is trying to keep the price tag of artificial intelligence under control. The crypto exchange is a fast‑moving place. New features land every few weeks, and a lot of that work uses large language models and other AI tools. Those tools can be expensive if you let them run unchecked. Armstrong’s goal is simple: let engineers experiment, but make sure the bill stays reasonable. He calls the idea “tokenmaxxing,” which basically means getting the most out of every token you spend on an AI model. The approach is a mix of culture, tooling, and clear limits. In this post I’ll walk through the five steps he mentioned and add my own take on what works and what could be improved.
The first rule is to give each team a token budget before they start a new AI experiment. Think of it like a mileage limit for a road trip. Engineers write a short plan that says how many tokens they expect to use and why. If they go over, they need to explain the extra cost. This forces people to think about efficiency early on instead of discovering waste later. In practice, it also creates a paper trail that finance can audit. I like this because it makes the cost visible to everyone, not just the finance department. It also encourages developers to ask, “Do I really need a 4‑kilobyte prompt, or can I get the same answer with less?”
Armstrong’s team built a thin layer around the external AI APIs. The wrapper automatically cuts off requests that exceed the set token budget. It can also trim prompts, remove unnecessary context, and suggest cheaper model alternatives. By putting the guardrail in code, they remove the temptation to click “send” on a massive request just to see what happens. From my perspective, this is a smart move. It’s easy to forget limits when you’re deep in debugging, and a software‑level block saves both time and money. The downside is that the wrapper needs regular updates as models evolve, but the trade‑off feels worth it.
Instead of punishing people for going over budget, Coinbase introduced a small reward system for teams that stay under. They get a badge on their internal dashboard and sometimes a modest budget increase for future projects. The idea is to make frugality feel like a win, not a chore. I think this works because it taps into the same competitive spirit that drives engineers to ship faster. When the reward is visible to peers, it creates a healthy rivalry. However, the program must stay transparent; otherwise, it could turn into a game of who hides usage rather than who truly optimizes it.
Armstrong’s list also mentions a shift toward open‑source alternatives for internal tools. Running a model on your own servers can be cheaper than paying per‑token to a cloud provider, especially at scale. Coinbase has started hosting smaller language models for tasks like code linting or simple text classification. The trade‑off is that these models may not be as powerful as the biggest commercial options, but for many internal jobs they are more than enough. My view is that this hybrid approach—using open‑source for routine work and reserving the pricey APIs for high‑impact tasks—makes a lot of sense. It also gives the engineering team more control over updates and security.
The final piece is a regular audit of AI spend. Every month, a small cross‑functional team looks at the token usage reports, checks for anomalies, and asks the engineers why certain spikes happened. They then feed that information back into the budgeting process. This loop makes sure that the rules don’t become stale and that new use cases get evaluated fairly. In my experience, a continuous feedback cycle is the only way to keep costs low in a fast‑changing environment. It also builds trust: finance sees that the engineering side is proactive, and engineers see that their concerns are heard.
Coinbase’s five‑point plan shows that you can still let engineers play with AI without letting the bill explode. By setting budgets, building technical guardrails, rewarding frugality, using cheaper models when they fit, and keeping a regular audit, the company creates a culture where cost awareness is part of the daily workflow. I think other tech firms can learn from this playbook, especially those that rely heavily on third‑party AI services. The key is to make the limits visible, enforceable, and even rewarding. When you do that, you get the best of both worlds: innovative products and a sustainable bottom line.
Source: Original Article



Comments are closed