What just happened — two labs, one quarter
If you build on a lab's free CLI or flat-rate plan, the last two months were a warning. In April, Anthropic stopped letting flat-rate Claude Pro/Max subscriptions run third-party agents like OpenClaw — pushing that usage onto pay-as-you-go. Heavy users saw bills jump as much as 50× (a $200 plan turning into $1,000–5,000 a month) before Anthropic softened it into a capped 'Agent SDK credit' (then pay-as-you-go above the cap). Then Google announced it's retiring the open-source Gemini CLI — roughly 100,000 GitHub stars — on June 18, 2026, replacing it with the closed-source Antigravity CLI. Free, Pro, and Ultra users lose access; only enterprise licences and paid API keys keep working. Neither move is evil; labs were subsidizing usage they'd underpriced. The lesson for builders is blunt:
- Anthropic (Apr 4): flat-rate plans blocked from third-party agents → metered credits, then pay-as-you-go
- Google (Jun 18): open-source Gemini CLI retired → closed-source Antigravity CLI; free/Pro/Ultra cut off
- Two of the three major labs restricted open/flat access in one quarter — that's a pattern, not a coincidence
The real lesson: a lab's CLI is not a foundation
A vendor's free tool is a convenience, not infrastructure. The moment your CI pipeline, your product, or a client deployment runs on one lab's flat-rate plan or one CLI binary, your costs and uptime are set by their pricing page. They can change that page overnight, with a few weeks' notice at best. Loyalty to a different lab doesn't fix it. Make the provider a swappable part instead.
- If a tool is free today, it's a pricing decision tomorrow. Plan for that.
- The danger isn't the specific lab. It's leaning your whole stack on any one of them at the layer that matters.
- Going provider-agnostic isn't a stance. It's what keeps you shipping when a billing page moves.
Three moves to go provider-agnostic this week
You don't need to rip everything out. You need a setup where swapping the model behind your agent is a config change, not a rebuild.
- 1. Move off flat-rate subscriptions for anything automated → run on your OWN API keys (Anthropic/OpenAI/Google/OpenRouter), so a subscription policy change can't break your pipeline.
- 2. Put a provider-agnostic harness in front: OpenCode (~75 providers), Aider, or Claude Code pointed at API keys — the agent stays, the model behind it is swappable.
- 3. Keep a local fallback for the cheap, high-volume tasks: an open-weight model (Qwen / Gemma class) running locally so not every call depends on a frontier API.
- Bonus: at the app layer, route by capability and cost — frontier model for the hard calls, cheap/local for the rest. The router is the thing you own.
Your 5-point lock-in audit
Run this before June 18. Each 'yes' is a single point of failure to fix.
- Does any CI job, cron, or product call
gemini(or one specific CLI) directly? → wrap it so the backend is swappable. - Are you running automated agents on a FLAT-RATE subscription rather than API keys? → move to keys.
- If your main provider doubled prices tomorrow, what breaks — and how fast could you switch?
- Do you have a local/open-weight fallback for non-critical, high-volume work?
- Is 'the model' a config value in your stack, or is it hard-wired into the code?
Your 10-minute starting move
Don't boil the ocean. Pick the one place you're most exposed and decouple it first.
- Find your single biggest dependency on a free CLI or flat-rate plan (usually CI or a daily automation).
- Swap it to a provider-agnostic harness (OpenCode/Aider/Claude Code) running on your own API key.
- Add one local model as a fallback for the cheap tasks.
- Now a lab's next pricing change is an annoyance, not an outage.
Get the next drop
New AI build guides + the occasional bonus template. No spam, unsubscribe anytime.