SaaS Tuesday Issue #5: When AI pricing gets real

This week in SaaS

OpenAI raises GPT-4.1 pricing

Input tokens are now 50% more expensive; output tokens jumped 25%. The move signals OpenAI's confidence in the model's dominance—and a willingness to test how much teams will pay for quality. If you're locked into GPT-4.1 for production, budget accordingly. — techcrunch.com

Anthropic's Claude Sonnet 5 in testing

Sources say Sonnet 5 is being tested with select partners, positioning itself as a faster, cheaper alternative to GPT-4.1. The timing is no accident: Anthropic is clearly watching OpenAI's pricing move and readying a competitive response. — theverge.com

Vercel launches AI gateway pricing tier

The edge platform now offers a metered pricing option for AI inference, letting teams pay per request instead of flat-rate. It's a direct shot at reducing vendor lock-in and giving builders more granular cost control. — news.ycombinator.com

Mistral closes Series C funding round

The open-source AI darling raised significant capital to scale inference infrastructure and compete head-to-head with OpenAI and Anthropic on cost and speed. Expect Mistral's pricing to stay aggressive. — axios.com

Funding & moves

Cursor raised Series B at a valuation that signals the AI code editor is now a must-watch. The funding validates that developers will pay for AI-first IDEs if they genuinely accelerate shipping. — techcrunch.com

Mistral closed Series C, cementing its position as the European counterweight to U.S. AI labs. Capital will fuel inference capacity and model training—critical as enterprises demand alternatives to OpenAI. — axios.com

Perplexity AI's annualized revenue is growing faster than expected, per recent reports. The AI search play is monetizing faster than most predicted—a signal that search disruption is real and investors are betting on it. — theinformation.com

Deal of the week

PostHog is the obvious move if you're building AI features and need to track how users actually interact with them. With Claude and GPT pricing in flux, you need visibility into which models are driving value—and PostHog's session replay + analytics combo tells you exactly where AI is helping or hurting UX.

The timing this week is tight: if you're evaluating AI gateway costs or considering multi-model strategies, PostHog's product analytics will show you which inference calls are actually moving the needle. No guessing, no wasted budget on features nobody uses.

Setup takes hours, not weeks. Most teams see ROI within 30 days. If you're shipping AI features in May, lock this in now—you'll want the data before Q2 planning.

Quick hits

Inference costs matter now. Compare Vercel's new metered tier against Mistral's API pricing. Even 10-20% savings per request compounds fast at scale.
Claude Sonnet 5 testing = imminent launch. If you're GPT-4.1 only, start testing Sonnet's beta. You'll want leverage when contract renewal hits.
Cursor's valuation validates AI IDEs. If your team isn't using AI-assisted coding yet, the market is telling you it's table stakes.
Perplexity's revenue growth is real. Search disruption isn't theoretical anymore. If you're a content or discovery platform, this matters.
Anthropic is pricing to compete. Watch for Claude Sonnet 5 pricing drop vs. OpenAI's move. Price wars in AI inference are just starting.
Open-source models are gaining momentum. Mistral's Series C validates that enterprises want alternatives. Evaluate Mistral and Llama for cost-sensitive workloads.

Until next Tuesday

Pricing pressure is here. If you're evaluating AI infrastructure this quarter, get quotes from at least two providers—and build monitoring into your stack from day one. Forward this to your engineering lead if you're shopping for inference or analytics tools.