Baseten, an AI inference company powering a new generation of category-defining AI applications, has announced a major $300 million financing round led by IVP, CapitalG, and NVIDIA as anchor investors. As a result, the investment values Baseten at an impressive $5 billion. Moreover, this marks the company’s third funding round within the past year, clearly highlighting the surging demand for reliable, high-performance infrastructure capable of running modern AI models at production scale.
As AI adoption accelerates across industries, organizations increasingly seek greater control over their models, intellectual property, and deployment environments. Addressing this need, Baseten has positioned itself as a critical infrastructure partner for ambitious AI teams. As one investor noted, “In a world where every ambitious AI team wants to run many models and fully own its IP, Baseten gives them the freedom, reliability, and economics to do that at scale.”
AI Authority Trend: Cloudflare and JD Cloud Partner to Accelerate AI Inference for Global Developers
Over the past six years, Baseten has steadily built its platform to support real-world AI applications. Today, it serves as the inference backbone for several influential AI-driven products that are transforming how people work and build software. Notably, companies such as Cursor, OpenEvidence, Abridge, Notion, and Clay rely on Baseten to deploy and scale their AI capabilities with confidence.
Customer feedback further reinforces Baseten’s growing reputation. “Baseten lets us run the models we need, the way we need to run them,” says Shiv Rao, Co-founder & CEO of Abridge. “The performance is best-in-class, but what sets them apart is everything else: the reliability, the developer experience, the fact that they’re constantly finding ways to lower our costs. They’re a partner, not a vendor.”
Meanwhile, the broader AI industry is undergoing a rapid shift. After years of prioritizing model training, the spotlight has moved to inference the ability for AI systems to reason and generate outputs in real time within practical workflows. Industry analysts now estimate that inference will represent nearly two-thirds of all AI compute by the end of 2026, compared to just one-third in 2023. Recognizing this transition early, Baseten designed its platform specifically to help customers run an increasing number of models in production efficiently and reliably.
“If cloud was the foundation that enabled the last generation of great technology companies, inference is the foundation for the next,” says Tuhin Srivastava, co-founder and CEO of Baseten. “Every breakout AI application depends on fast, reliable, and cost-effective inference. We’ve spent six years building the infrastructure to make that possible and we’re ready for this next chapter of hundreds and then thousands of new models.”
AI Authority Trend: Sabey Data Centers Transforms SDC Manhattan into AI Inference Hub
Importantly, the latest investment reinforces Baseten’s vision of a multi-model AI future. Rather than concentrating power in a few massive models, the company aims to support thousands of specialized models built for specific industries and use cases. “Our customers are building precise models for everything from elite software development to medical documentation to high-stakes legal reasoning, and they need a platform that will allow them to deliver deep specificity, expertise, and performance,” says Srivastava. “This is the type of user experience and the promise of AI we’re dedicated to making possible.”
Finally, Baseten differentiates itself through open runtimes, multi-cloud flexibility, and a strong focus on security, guardrails, and observability. These capabilities allow companies to retain ownership of their IP while optimizing for cost and reliability. As investor Sarah Guo, General Partner at Conviction, explains, “Baseten is quickly becoming default infrastructure,” adding that its combination of open runtimes, multi-cloud resilience, and a thoughtfully designed developer experience represents “the new standard the best companies expect.”
AI Authority Trend: d-Matrix Secures $275 Million Series C to Transform AI Inference for Data Centers
To share your insights, please write to us at info@intentamplify.com





