Baseten, a startup that provides infrastructure for running AI inference workloads, is reportedly raising $1.5 billion in a new funding round.
According to reports, Baseten has a platform that handles the computational side of deploying and serving AI models at scale. No lead investor or post-money valuation has been disclosed at this stage.
Why Inference Infrastructure Attracts Capital
AI inference, the process of running a trained model to generate outputs, has become a distinct infrastructure layer from model training. As enterprises deploy AI applications in production, they need reliable, low-latency serving capacity that model providers do not always supply directly.
Baseten sits in that gap. The company competes with cloud providers and a small group of dedicated inference startups. A $1.5B raise would give it capital to expand GPU capacity, extend geographic coverage, and deepen integrations with enterprise customers.
The raise, if confirmed, would follow a wave of large AI infrastructure rounds in 2026. Odyssey, a world-model AI lab, closed a $310M Series B earlier this week. Swiss private AI firm Prem AI is also pursuing a $100M Series A.
Also Read: Ethereum’s Code Army Nears 140,000 While Traders Flee ETH
Recent Context
AI inference has drawn increasing investor attention in the past 30 days as the broader AI deployment cycle matures. Fortune reported on June 18 that enterprises are shifting away from broad AI deployments and toward tighter cost controls and cheaper models.
That shift increases pressure on inference providers to offer competitive pricing. Baseten has not commented publicly on the reported raise. SiliconAngle cited its reporting as based on sourcing from within the funding process.
Read Next: Adam Back Traces Bitcoin To 1997, Rejects Todd Satoshi Theory





