• Skip to main content
  • Skip to secondary menu
  • Skip to footer

Technologies.org

Technology Trends: Follow the Money

  • Technology Events 2026-2027
  • Sponsored Post
  • Technology Jobs
  • Technology Markets
  • About
    • GDPR
  • Contact

Baseten Secures $150 Million Series D, Cementing Its Role as the Backbone of AI Inference

September 6, 2025 By admin Leave a Comment

Baseten, the company building the infrastructure behind some of the fastest-growing AI applications, has announced a $150 million Series D funding round, valuing the company at $2.15 billion. The round was led by BOND with participation from CapitalG, Premji, Kevin and Elizabeth Weil of Scribble, as well as continued support from Conviction, 01a, IVP, Spark, and Greylock. This new infusion brings Baseten’s total funding to over $285 million, a milestone that underscores how critical inference infrastructure has become to the broader AI economy.

The heart of Baseten’s pitch lies in inference—the process of running AI models in real-world applications at scale. Just as the cloud became the indispensable foundation for the last generation of internet companies, inference is shaping up to be the cornerstone for AI-native businesses. Inference revenue is inherently tied to every AI application, and with the market already exceeding $100 billion and growing rapidly, Baseten is positioning itself as the essential utility provider for this new wave of growth. Customers ranging from Abridge in healthcare to Clay in go-to-market software to OpenEvidence in medical information delivery are already relying on Baseten to handle billions of inference calls per week, translating into billions of dollars of downstream revenue.

CEO and co-founder Tuhin Srivastava framed it directly: “Every breakout AI application depends on fast, reliable, and cost-effective inference, the same way the last 15 years of companies depended on the cloud. Baseten makes that possible. In the same way Stripe became an index of the internet economy, Baseten will become an index of the AI economy.” The comparison to Stripe is deliberate—Stripe built the transaction rails for the internet, and Baseten is building the inference rails for AI. This model positions them not as a niche infrastructure provider, but as a core layer of the emerging AI stack.

What differentiates Baseten’s offering is its Inference Stack, a platform that combines applied research, flexible infrastructure, and developer tooling. For companies like Clay, this means faster launches and more reliable customer-facing AI experiences. For healthcare startups like Abridge and OpenEvidence, it means life-critical scalability and trustworthiness, with Baseten supporting millions of clinical notes and high-stakes medical queries weekly. Investors see this as a signal that inference is not just a technical bottleneck but a fundamental enabler. Jay Simons of BOND compared Baseten’s trajectory to Atlassian’s rise, noting that the company is “years ahead in both product and adoption.”

The timing of this round aligns with Baseten’s recent product expansions. With the launch of Model APIs and Baseten Training, the company now enables developers not only to deploy models quickly but also to fine-tune them for performance and quality. These capabilities integrate with their Inference Stack to allow multiple models to operate together, creating seamless AI-powered experiences without compromising speed or cost-efficiency. This is where the company is doubling down—using the Series D capital to scale research into model performance, expand infrastructure, and invest heavily in customer success teams.

The broader implication is clear: inference is the new battleground of AI infrastructure, and Baseten has staked its claim. While hyperscalers like AWS, Google Cloud, and Azure will continue to compete for general compute, Baseten is carving out a specialized domain focused on high-performance, cost-optimized inference at scale. If the company succeeds, it will sit at the center of the AI economy much like Stripe did for online payments, creating a layer that nearly every fast-scaling AI application touches. For investors and customers alike, this fundraise is not just about Baseten’s growth, but about validating inference as the critical infrastructure layer of the AI age.

Filed Under: News

Reader Interactions

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Footer

Recent Posts

  • Wizerr AI Unveils Agentic BOM Engine, Ushering Hardware Into Its Long-Awaited AI Era
  • ZincFive Secures $30 Million to Support AI-Era Data Center Resilience
  • Ply secures $8.5M to automate inventory for the trades, partners with Ferguson Ventures
  • LizzyAI Secures $5M to Rebuild the Interview From the Ground Up
  • When Open Source Meets Custom Silicon: Red Hat and AWS Shift the AI Infrastructure Game
  • Sokin Secures $50M Series B to Scale Global Payments Ambitions
  • Tutor Intelligence Raises $34M to Scale Human-Like Warehouse Robots
  • Harmonic Reaches Unicorn Status as Mathematical Superintelligence Moves Into the Real World
  • CoPlane Raises $14M: Reinventing the Most Boring — and Most Expensive — Part of Enterprise Software
  • Google Cloud Secures New NATO Cloud Contract: Sovereign AI Meets Military-Grade Security

Media Partners

  • Market Analysis
  • Cybersecurity Market
Rio Tinto’s First Nuton® Copper in Arizona Marks a Quiet Technological Turning Point for U.S. Copper Supply
Next-Gen Nuclear Could Transform Emerging Economy Power Grids
Diamond Market, November 2025 — A Cooling Curve for Small Stones, Steady Ground for Big Gems
The Silent Monopoly: Why China’s Grip on Shipping Containers May Be the Real Strategic Risk
The China Illusion: Why Negotiating Market Access No Longer Makes Sense
The 5-to-9 Revolution: Why Side Hustles Became the New Career Fast-Track
Dassault Systèmes & Mistral AI: Europe Starts Building Its Own AI Backbone
Why Pay-As-You-Go eSIM Deserves Its Moment
Refurbished Containers Market Outlook: Demand, Drivers, and Emerging Use-Cases
Trump’s Ukraine “Peace Plan”: Locking In a Defeat, Saving a Failing Russia
Check Point Earns Leader Position in 2025 Gartner Magic Quadrant for Email Security
CyberMarketingCon 2025, December 7–10, Austin, TX
NTT DATA Launches AI-Powered Cyber Defense Centers Across India, UK and US
USX Cyber Expands Guardient with Native JAMF Log Ingestion for Deeper macOS Security
Salt Security Extends Its Shield to MCP Servers Inside AWS
Geography of Cyber Risk Is Shifting Faster Than the Market Can Adapt
The Sleepless Identity: Why AI Now Poses a Data Risk Enterprises Can’t Ignore
SentinelOne Expands AI Security Capabilities with New AWS Integrations
Resecurity at Milipol Paris 2025
CrowdStrike Joins HPE’s Unleash AI Program — A Signal of Where Enterprise AI Security Is Headed

Media Partners

  • Market Research Media
  • Technology Conferences
Clipbook Raises $3.3M Seed Round — And the PR World Just Got a Warning Shot
BrandsToShop.com — the right domain to have for Cyber Monday, Black Friday and every loud shopping season ahead
PressEspresso.com
NcodiN Secures €16 Million to Scale Optical Interposer Technology and Break the Copper Wall
OPINT.com — Where Understanding Becomes Power
AppCoding.com — A Clear, Flexible Identity at the Center of the Software-Everywhere Economy
APIcoding.com — A Digital Asset Aligned With the Infrastructure of the Modern Software Economy
NewsInstances.com — A Digital Identity Built for Event-Driven Media and AI-Generated Reporting
Marketing Content Creation Services in 2025
Visual Storytelling and the Rise of Gamma in the AI Productivity Stack
DLD Munich 26, January 15–17, Munich, Germany
SPIE Photonics West 2026, January 17–22, San Francisco
Gurobi Decision Intelligence Summit, October 28–29, 2025, Vienna
MIT Sloan CFO Summit, November 20, 2025, Cambridge
Roblox Expands the Future of Creation at RDC 2025
Apple Announces WWDC25, June 9 to 13, 2025
Adobe Summit 2025, March 17-20, Las Vegas
Embedded World 2025, from 11 to 13 March 2025 in Nuremberg
SATELLITE 2025: Uniting the Global Satellite and Space Communities
The milestone 10th edition of Chatbot Summit on March 31 – April 1, 2025, The Ritz-Carlton, Berlin

Copyright © 2022 Technologies.org

Media Partners: Market Analysis & Market Research and Exclusive Domains