• Skip to main content
  • Skip to secondary menu
  • Skip to footer

Technologies.org

Technology Trends: Follow the Money

  • Technology Events 2026-2027
  • Sponsored Post
  • Technology Markets
  • About
    • GDPR
  • Contact

Baseten Secures $150 Million Series D, Cementing Its Role as the Backbone of AI Inference

September 6, 2025 By admin Leave a Comment

Baseten, the company building the infrastructure behind some of the fastest-growing AI applications, has announced a $150 million Series D funding round, valuing the company at $2.15 billion. The round was led by BOND with participation from CapitalG, Premji, Kevin and Elizabeth Weil of Scribble, as well as continued support from Conviction, 01a, IVP, Spark, and Greylock. This new infusion brings Baseten’s total funding to over $285 million, a milestone that underscores how critical inference infrastructure has become to the broader AI economy.

The heart of Baseten’s pitch lies in inference—the process of running AI models in real-world applications at scale. Just as the cloud became the indispensable foundation for the last generation of internet companies, inference is shaping up to be the cornerstone for AI-native businesses. Inference revenue is inherently tied to every AI application, and with the market already exceeding $100 billion and growing rapidly, Baseten is positioning itself as the essential utility provider for this new wave of growth. Customers ranging from Abridge in healthcare to Clay in go-to-market software to OpenEvidence in medical information delivery are already relying on Baseten to handle billions of inference calls per week, translating into billions of dollars of downstream revenue.

CEO and co-founder Tuhin Srivastava framed it directly: “Every breakout AI application depends on fast, reliable, and cost-effective inference, the same way the last 15 years of companies depended on the cloud. Baseten makes that possible. In the same way Stripe became an index of the internet economy, Baseten will become an index of the AI economy.” The comparison to Stripe is deliberate—Stripe built the transaction rails for the internet, and Baseten is building the inference rails for AI. This model positions them not as a niche infrastructure provider, but as a core layer of the emerging AI stack.

What differentiates Baseten’s offering is its Inference Stack, a platform that combines applied research, flexible infrastructure, and developer tooling. For companies like Clay, this means faster launches and more reliable customer-facing AI experiences. For healthcare startups like Abridge and OpenEvidence, it means life-critical scalability and trustworthiness, with Baseten supporting millions of clinical notes and high-stakes medical queries weekly. Investors see this as a signal that inference is not just a technical bottleneck but a fundamental enabler. Jay Simons of BOND compared Baseten’s trajectory to Atlassian’s rise, noting that the company is “years ahead in both product and adoption.”

The timing of this round aligns with Baseten’s recent product expansions. With the launch of Model APIs and Baseten Training, the company now enables developers not only to deploy models quickly but also to fine-tune them for performance and quality. These capabilities integrate with their Inference Stack to allow multiple models to operate together, creating seamless AI-powered experiences without compromising speed or cost-efficiency. This is where the company is doubling down—using the Series D capital to scale research into model performance, expand infrastructure, and invest heavily in customer success teams.

The broader implication is clear: inference is the new battleground of AI infrastructure, and Baseten has staked its claim. While hyperscalers like AWS, Google Cloud, and Azure will continue to compete for general compute, Baseten is carving out a specialized domain focused on high-performance, cost-optimized inference at scale. If the company succeeds, it will sit at the center of the AI economy much like Stripe did for online payments, creating a layer that nearly every fast-scaling AI application touches. For investors and customers alike, this fundraise is not just about Baseten’s growth, but about validating inference as the critical infrastructure layer of the AI age.

Filed Under: News

Reader Interactions

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Footer

Recent Posts

  • Qualcomm Acquires Ventana Micro Systems: Why It Matters, What It Changes, and Why Arm Should Pay Attention
  • Scylos Secures $3M Seed Round to Rethink Endpoint Security from the Ground Up
  • Databricks has just closed a massive new funding round that pushes its valuation to roughly $134 billion
  • Nu Quantum’s $60M Leap Toward the Entanglement Era
  • Haven Energy Raises $40M to Scale Virtual Power Plants Across the U.S. Grid
  • Supermicro Expands NVIDIA Blackwell Portfolio with Liquid-Cooled HGX B300 Systems
  • UMC and imec Push Silicon Photonics Into Its Next Act
  • Wizerr AI Unveils Agentic BOM Engine, Ushering Hardware Into Its Long-Awaited AI Era
  • ZincFive Secures $30 Million to Support AI-Era Data Center Resilience
  • Ply secures $8.5M to automate inventory for the trades, partners with Ferguson Ventures

Media Partners

  • Market Analysis
  • Cybersecurity Market
U.S. Tech Employment Slows as Hiring Cools and AI Reshapes Demand
Semiconductor Equipment Boom, 2025–2027, Global Manufacturing Outlook
ServiceNow Sharpens Its Competitive Edge by Making Moveworks the Front Line of the Enterprise
NVIDIA Acquires SchedMD: How Owning the Brain of the Cluster Sharpens NVIDIA’s Competitive Edge
Cloudflare Year in Review 2025: How the Internet Quietly Rewired Itself
The $250 Billion Stablecoin Market: Who Uses It, Why It Exists, and Where the Growth Actually Comes From
Will It Save Intel? The $1.6B SambaNova Question
Crisp’s $26M Series B1 Shows Why Vertical AI Is Pulling Ahead
Europe’s Spectrum Trap: How Smarter Policy Could Unlock a €75 Billion 5G Boost
Airwallex’s $330M Series G: The New Gravity Center of Borderless Finance
Stellar Cyber Climbs to #2 in MSSP Alert 2025 Rankings, Signaling Deepening Trust Across the Global SecOps Ecosystem
Ascend 2026, May–October 2026, Global Event Series
Black Hat Europe 2025, December 9–12, London, United Kingdom
C1 and Texas Southern University Launch Cybersecurity Lab, Houston, Texas
GDIT Wins $285M Cybersecurity Contract to Fortify Virginia’s Digital Backbone
Why ServiceNow Wants Armis: Security as the Missing Layer in the Entrprise Workflow Empire
Opal Security Names Howard Ting CEO as AI Access Governance Enters Its Defining Moment
Cyber Week Israel 2025, December 8–11, Tel Aviv
Qryptonic Names Senior Leadership Team Driving Quantum-Era Cryptographic Security
Thales AI Security Fabric, 2025–2026: A New Perimeter for the Age of Agentic AI

Media Partners

  • Market Research Media
  • Technology Conferences
PlayStation and the Quiet Power Center of a $200 Billion Gaming Industry
Adobe FY2025: AI Pulls the Levers, Cash Flow Leads the Story
Canva’s 2026 Creative Shift and the Rise of Imperfect-by-Design
fal Raises $140M Series D: Scaling the Core Infrastructure for Real-Time Generative Media
Gaming’s Next Expansion Wave, 2026–2030
Morphography — A Visual Language for the Next Era of AI
Netflix’s $83B Grab for Warner Bros. & HBO: A Tectonic Shift in Global Media
Clipbook Raises $3.3M Seed Round — And the PR World Just Got a Warning Shot
BrandsToShop.com — the right domain to have for Cyber Monday, Black Friday and every loud shopping season ahead
PressEspresso.com
Humanoids Summit Tokyo 2026, May 28–29, 2026, Takanawa Convention Center
Japan Pavilion at CES 2026, January 6–9, Las Vegas
KubeCon + CloudNativeCon Europe 2026, 23–26 March, Amsterdam
4YFN26, 2–5 March 2026, Fira Gran Via — Barcelona
DLD Munich 26, January 15–17, Munich, Germany
SPIE Photonics West 2026, January 17–22, San Francisco
Gurobi Decision Intelligence Summit, October 28–29, 2025, Vienna
MIT Sloan CFO Summit, November 20, 2025, Cambridge
Roblox Expands the Future of Creation at RDC 2025
Apple Announces WWDC25, June 9 to 13, 2025

Copyright © 2022 Technologies.org

Media Partners: Market Analysis & Market Research and Exclusive Domains