• Skip to main content
  • Skip to secondary menu
  • Skip to footer

Technologies.org

Technology Trends: Follow the Money

  • Technology Events 2026-2027
  • Sponsored Post
  • Technology Markets
  • About
    • GDPR
  • Contact

Baseten Secures $150 Million Series D, Cementing Its Role as the Backbone of AI Inference

September 6, 2025 By admin Leave a Comment

Baseten, the company building the infrastructure behind some of the fastest-growing AI applications, has announced a $150 million Series D funding round, valuing the company at $2.15 billion. The round was led by BOND with participation from CapitalG, Premji, Kevin and Elizabeth Weil of Scribble, as well as continued support from Conviction, 01a, IVP, Spark, and Greylock. This new infusion brings Baseten’s total funding to over $285 million, a milestone that underscores how critical inference infrastructure has become to the broader AI economy.

The heart of Baseten’s pitch lies in inference—the process of running AI models in real-world applications at scale. Just as the cloud became the indispensable foundation for the last generation of internet companies, inference is shaping up to be the cornerstone for AI-native businesses. Inference revenue is inherently tied to every AI application, and with the market already exceeding $100 billion and growing rapidly, Baseten is positioning itself as the essential utility provider for this new wave of growth. Customers ranging from Abridge in healthcare to Clay in go-to-market software to OpenEvidence in medical information delivery are already relying on Baseten to handle billions of inference calls per week, translating into billions of dollars of downstream revenue.

CEO and co-founder Tuhin Srivastava framed it directly: “Every breakout AI application depends on fast, reliable, and cost-effective inference, the same way the last 15 years of companies depended on the cloud. Baseten makes that possible. In the same way Stripe became an index of the internet economy, Baseten will become an index of the AI economy.” The comparison to Stripe is deliberate—Stripe built the transaction rails for the internet, and Baseten is building the inference rails for AI. This model positions them not as a niche infrastructure provider, but as a core layer of the emerging AI stack.

What differentiates Baseten’s offering is its Inference Stack, a platform that combines applied research, flexible infrastructure, and developer tooling. For companies like Clay, this means faster launches and more reliable customer-facing AI experiences. For healthcare startups like Abridge and OpenEvidence, it means life-critical scalability and trustworthiness, with Baseten supporting millions of clinical notes and high-stakes medical queries weekly. Investors see this as a signal that inference is not just a technical bottleneck but a fundamental enabler. Jay Simons of BOND compared Baseten’s trajectory to Atlassian’s rise, noting that the company is “years ahead in both product and adoption.”

The timing of this round aligns with Baseten’s recent product expansions. With the launch of Model APIs and Baseten Training, the company now enables developers not only to deploy models quickly but also to fine-tune them for performance and quality. These capabilities integrate with their Inference Stack to allow multiple models to operate together, creating seamless AI-powered experiences without compromising speed or cost-efficiency. This is where the company is doubling down—using the Series D capital to scale research into model performance, expand infrastructure, and invest heavily in customer success teams.

The broader implication is clear: inference is the new battleground of AI infrastructure, and Baseten has staked its claim. While hyperscalers like AWS, Google Cloud, and Azure will continue to compete for general compute, Baseten is carving out a specialized domain focused on high-performance, cost-optimized inference at scale. If the company succeeds, it will sit at the center of the AI economy much like Stripe did for online payments, creating a layer that nearly every fast-scaling AI application touches. For investors and customers alike, this fundraise is not just about Baseten’s growth, but about validating inference as the critical infrastructure layer of the AI age.

Filed Under: News

Reader Interactions

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Footer

Recent Posts

  • Preply Reaches $1.2B Valuation After $150M Series D to Scale Human-Led, AI-Enhanced Language Learning
  • Datarails Raises $70M Series C to Turn the CFO’s Office into an AI-Native Nerve Center
  • Emergent Raises $70M Series B as AI Turns Software Creation Into an Entrepreneurial Commodity
  • Fujifilm Introducing SX400: A Long-Range Camera Designed for the Real World
  • D-Wave Becomes the First Dual-Platform Quantum Computing Company After Quantum Circuits Acquisition
  • Wasabi Technologies Secures $70M to Fuel the Next Phase of AI-Ready Cloud Storage
  • Samsung Maintenance Mode: The Quiet Feature That Actually Changed How I Buy Phones
  • Miro AI Workflows Launch: From Whiteboard Chaos to Enterprise-Grade Deliverables
  • 10 Breakthrough Technologies of 2026
  • Samsung Walked Away From Long Zoom — And Left a Gap It Once Owned

Media Partners

  • Market Analysis
  • Cybersecurity Market
Nvidia’s China Problem Is Self-Inflicted, and Washington Should Stop Pretending Otherwise
USPS and the Theater of Control: How Government Freezes Failure in Place
Skild AI Funding Round Signals a Shift Toward Platform Economics in Robotics
Saks Sucks: Luxury Retail’s Debt-Fueled Mirage Collapses
Alpaca’s $1.15B Valuation Signals a Maturity Moment for Global Brokerage Infrastructure
The Immersive Experience in the Museum World
The Great Patent Pause: 2025, the Year U.S. Innovation Took a Breath
OpenAI Acquires Torch, A $100M Bet on AI-Powered Health Records Analytics
Iran’s Unreversible Revolt: When Internal Rupture Meets External Signals
Global Robotics Trends 2026: Where Machines Start Thinking for Themselves
Lumu’s 2026 Compromise Report: Why Cybersecurity Has Entered the Age of Silent Breaches
Novee Emerges from Stealth, 2025, Offensive Security at Machine Speed
depthfirst Raises $40M Series A to Build AI-Native Software Defense
Bitwarden Doubles Down on Identity Security as Passwords Finally Start to Lose Their Grip
Cloudflare App Innovation Report 2026: Why Technical Debt Is the Real AI Bottleneck
CrowdStrike Acquires Seraphic Security: Browser Security Becomes the New Cyber Frontline
Hedge Funds Quietly Rewrite Their Risk Playbook as Cybersecurity Becomes Non-Negotiable
Torq Raises $140M Series D, Reaches $1.2B Valuation as Agentic AI Redefines the SOC
CrowdStrike–SGNL Deal Signals Identity’s Promotion to the Center of Cyber Defense
CrowdStrike Backs the Next Wave of AI-Native Cybersecurity Startups

Media Partners

  • Market Research Media
  • Technology Conferences
BBC and the Gaza War: How Disproportionate Attention Reshapes Reality
Parallel Museums: Why the Future of Art Might Be Copies, Not Originals
ClickHouse Series D, The $400M Bet That Data Infrastructure, Not Models, Will Decide the AI Era
AI Productivity Paradox: When Speed Eats Its Own Gain
Voice AI as Infrastructure: How Deepgram Signals a New Media Market Segment
Spangle AI and the Agentic Commerce Stack: When Discovery and Conversion Converge Into One Layer
PlayStation and the Quiet Power Center of a $200 Billion Gaming Industry
Adobe FY2025: AI Pulls the Levers, Cash Flow Leads the Story
Canva’s 2026 Creative Shift and the Rise of Imperfect-by-Design
fal Raises $140M Series D: Scaling the Core Infrastructure for Real-Time Generative Media
Humanoid Robot Forum 2026, June 22–25, Chicago
Supercomputing Asia 2026, January 26–29, Osaka International Convention Center, Japan
Chiplet Summit 2026, February 17–19, Santa Clara Convention Center, Santa Clara, California
HumanX, 22–24 September 2026, Amsterdam
CES 2026, January 7–10, Las Vegas
Humanoids Summit Tokyo 2026, May 28–29, 2026, Takanawa Convention Center
Japan Pavilion at CES 2026, January 6–9, Las Vegas
KubeCon + CloudNativeCon Europe 2026, 23–26 March, Amsterdam
4YFN26, 2–5 March 2026, Fira Gran Via — Barcelona
DLD Munich 26, January 15–17, Munich, Germany

Copyright © 2022 Technologies.org

Media Partners: Market Analysis & Market Research and Exclusive Domains, Photography