• Skip to main content
  • Skip to secondary menu
  • Skip to footer

Technologies.org

Technology Trends: Follow the Money

  • Technology Events 2026-2027
  • Sponsored Post
  • Technology Markets
  • About
    • GDPR
  • Contact

Baseten Secures $150 Million Series D, Cementing Its Role as the Backbone of AI Inference

September 6, 2025 By admin Leave a Comment

Baseten, the company building the infrastructure behind some of the fastest-growing AI applications, has announced a $150 million Series D funding round, valuing the company at $2.15 billion. The round was led by BOND with participation from CapitalG, Premji, Kevin and Elizabeth Weil of Scribble, as well as continued support from Conviction, 01a, IVP, Spark, and Greylock. This new infusion brings Baseten’s total funding to over $285 million, a milestone that underscores how critical inference infrastructure has become to the broader AI economy.

The heart of Baseten’s pitch lies in inference—the process of running AI models in real-world applications at scale. Just as the cloud became the indispensable foundation for the last generation of internet companies, inference is shaping up to be the cornerstone for AI-native businesses. Inference revenue is inherently tied to every AI application, and with the market already exceeding $100 billion and growing rapidly, Baseten is positioning itself as the essential utility provider for this new wave of growth. Customers ranging from Abridge in healthcare to Clay in go-to-market software to OpenEvidence in medical information delivery are already relying on Baseten to handle billions of inference calls per week, translating into billions of dollars of downstream revenue.

CEO and co-founder Tuhin Srivastava framed it directly: “Every breakout AI application depends on fast, reliable, and cost-effective inference, the same way the last 15 years of companies depended on the cloud. Baseten makes that possible. In the same way Stripe became an index of the internet economy, Baseten will become an index of the AI economy.” The comparison to Stripe is deliberate—Stripe built the transaction rails for the internet, and Baseten is building the inference rails for AI. This model positions them not as a niche infrastructure provider, but as a core layer of the emerging AI stack.

What differentiates Baseten’s offering is its Inference Stack, a platform that combines applied research, flexible infrastructure, and developer tooling. For companies like Clay, this means faster launches and more reliable customer-facing AI experiences. For healthcare startups like Abridge and OpenEvidence, it means life-critical scalability and trustworthiness, with Baseten supporting millions of clinical notes and high-stakes medical queries weekly. Investors see this as a signal that inference is not just a technical bottleneck but a fundamental enabler. Jay Simons of BOND compared Baseten’s trajectory to Atlassian’s rise, noting that the company is “years ahead in both product and adoption.”

The timing of this round aligns with Baseten’s recent product expansions. With the launch of Model APIs and Baseten Training, the company now enables developers not only to deploy models quickly but also to fine-tune them for performance and quality. These capabilities integrate with their Inference Stack to allow multiple models to operate together, creating seamless AI-powered experiences without compromising speed or cost-efficiency. This is where the company is doubling down—using the Series D capital to scale research into model performance, expand infrastructure, and invest heavily in customer success teams.

The broader implication is clear: inference is the new battleground of AI infrastructure, and Baseten has staked its claim. While hyperscalers like AWS, Google Cloud, and Azure will continue to compete for general compute, Baseten is carving out a specialized domain focused on high-performance, cost-optimized inference at scale. If the company succeeds, it will sit at the center of the AI economy much like Stripe did for online payments, creating a layer that nearly every fast-scaling AI application touches. For investors and customers alike, this fundraise is not just about Baseten’s growth, but about validating inference as the critical infrastructure layer of the AI age.

Filed Under: News

Reader Interactions

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Footer

Recent Posts

  • Anthropic’s Stainless Acquisition Is an Infrastructure Seizure Disguised as a Developer Tools Deal
  • Blackstone and Google Are Building an AI Infrastructure Giant Outside the Traditional Cloud Model
  • Mind Robotics Crosses $1B in Total Funding; Rivian Is the Quiet Disclosure
  • Quantum Motion Raises $160 Million Series C to Scale Silicon-Based Quantum Computing
  • Fazeshift Raises $17 Million Series A to Automate Accounts Receivable With Autonomous AI Agents
  • Instant Power Becomes the Next AI Infrastructure Battleground as Nyobolt Raises $60 Million
  • NVIDIA and Corning Expand U.S. Optical Manufacturing for AI Infrastructure
  • QuantWare Raises $178 Million Series B, Announces 10,000-Qubit Processor Architecture
  • Panthalassa Raises $140 Million to Power AI Computing with Ocean Waves
  • JEDEC Advances DDR5 MRDIMM Architecture With New MDB Standard and Next-Gen Memory Roadmap

Media Partners

  • Market Analysis
  • Cybersecurity Market
  • App Coding
The Productivity Is Already Here. The Bubble Narrative Is Not.
The Collingridge Dilemma
Why Memory Prices Won’t Come Down
The Bill Comes Due
The Software-Defined Camera Won. The Open OS Did Not.
Cars Are Computers Now, and Most Carmakers Aren’t
Gartner: Global IT Spending to Hit $6.31 Trillion in 2026, Driven by AI Infrastructure
The SDK Generator Benchmarks: Infrastructure vs. Convenience
Infographic: We Are Likely in the Early Stages of Another Productivity Boom
Infographic: Establishing the National Multimodal Freight Network
Salt Typhoon, Volt Typhoon, Flax Typhoon: China’s 2024 Campaign Against U.S. Infrastructure
Foreign Criminal Cyberattacks Against the United States: Ransomware, Botnets, and Financial Fraud
Iran’s Cyber Operations: Infrastructure Attacks, Election Interference, and IRGC Proxies
North Korea’s Cyber Program: From Sony to Blockchain Theft
Russia’s State Cyber Operations: From SolarWinds to Logistics Warfare
China’s Cyber Campaigns Against the United States: Two Decades of Documented Operations
How the U.S. Government Attributes Cyberattacks — and Why It Is Harder Than It Looks
Thirteen Years of Cyberattacks Against the United States: The CRS Record
Billington Critical Infrastructure CyberSecurity Summit, Nov. 17–18, 2026, San Antonio, Texas
ShinyHunters Breaches Canvas LMS, Threatening Data on 275 Million Users
DigitalOcean Launches AI-Native Cloud at Deploy 2026
Verdent Updates AI Platform to Function as a Full Engineering Team for Solo Builders
The Side Project App Is Not Dead. The Side Project App Business Is.
The App Monetization Landscape Has Changed and Most Teams Have Not Caught Up
Building Offline-First Mobile Apps Is Harder Than It Looks and Worth It
State Management in React Native Has Too Many Options and One Right Answer
Mobile Accessibility Is the Case Developers Keep Ignoring
Testing Mobile Apps at Scale Without Losing Your Mind
App Store Optimization in 2026 Is a Different Game Than It Was
Cross-Platform vs Native: The Honest Assessment Nobody Gives You

Media Partners

  • Market Research Media
  • Technology Conferences
  • API Coding
China’s U.S. Treasury Holdings: The Great Repositioning (2021–2025)
Infographic: Why the 2025 CIPA Data Proves the APS-C Renaissance is Real
How WiFi Changed Media
Canva Acquires Simtheory and Ortto to Build End-to-End Work Platform
Netflix Price Hikes, The Economics of Dominance in a Saturated Streaming Market
America’s Brands Keep Winning Even as America Itself Slips
Kioxia’s Storage Gambit: Flash Steps Into the AI Memory Hierarchy
Mamdani Strangling New York
The Rise of Faceless Creators: Picsart Launches Persona and Storyline for AI Character-Driven Content
Apple TV Arrives on The Roku Channel, Expanding the Streaming Platform Wars
D.A. Davidson Technology Conference, June 11, 2026, Nashville
Bank of America Global Technology Conference, June 4, 2026, San Francisco
William Blair Growth Stock Conference, June 3, 2026, Chicago
TD Cowen Technology, Media & Telecom Conference, May 27, 2026, New York
J.P. Morgan Global Technology, Media and Communications Conference, May 18–20, 2026, Boston
Technology Investor Conference Circuit, May–June 2026
Automate 2026 Sets Its Agenda Around AI’s Role in Industrial Transformation, June 22–25, 2026, McCormick Place in Chicago
IBM Think 2026, May 5–8, Boston, Massachusetts, USA
AI & Creativity Summit New York 2026, May 14, The Lighthouse Brooklyn
SEMICON Southeast Asia 2026, May 5–7, Kuala Lumpur
Why Private Domain Data Is the Real Key to AI That Actually Works
Orkes Raises $60M to Bring Production-Grade AI Orchestration to Enterprise Developers
Form.io Launches MCP Server and Agentic Coding Toolset for Governed Enterprise AI Development
Appdome Upgrades MobileBOT Defense With Identity-First Mobile API Protection
Five SDK Generators Compared: Speakeasy, Stainless, Fern, APIMatic, and OpenAPI Generator
API Monetization Models That Work and the Ones That Drive Developers Away
gRPC in Production: What the Documentation Doesn't Tell You
Event-Driven Architecture vs Request-Response: Choosing the Right Communication Pattern
The Business Case for Internal APIs That Most Engineering Leaders Ignore
Breaking Changes: How to Avoid Shipping Them and What to Do When You Must

Copyright © 2026 Technologies.org

Media Partners: Market Analysis · Market Research · Referently · Photography