• Skip to main content
  • Skip to secondary menu
  • Skip to footer

Technologies.org

Technology Trends: Follow the Money

  • Technology Events 2026-2027
  • Sponsored Post
  • Technology Markets
  • About
    • GDPR
  • Contact

SC19: NVIDIA Announces Scalable GPU-Accelerated Supercomputer in the Microsoft Azure Cloud

November 19, 2019 By admin Leave a Comment

New Microsoft Azure NDv2 Supersized Instance Can Scale to Hundreds of Interconnected NVIDIA Tensor Core GPUs for Complex AI and High Performance Computing Applications

NVIDIA today announced the availability of a new kind of GPU-accelerated supercomputer in the cloud on Microsoft Azure.

Built to handle the most demanding AI and high performance computing applications, the largest deployments of Azure’s new NDv2 instance rank among the world’s fastest supercomputers, offering up to 800 NVIDIA V100 Tensor Core GPUs interconnected on a single Mellanox InfiniBand backend network. It enables customers for the first time to rent an entire AI supercomputer on demand from their desk, and match the capabilities of large-scale, on-premises supercomputers that can take months to deploy.

“Until now, access to supercomputers for AI and high performance computing has been reserved for the world’s largest businesses and organizations,” said Ian Buck, vice president and general manager of Accelerated Computing at NVIDIA. “Microsoft Azure’s new offering democratizes AI, giving wide access to an essential tool needed to solve some of the world’s biggest challenges.”

Girish Bablani, corporate vice president of Azure Compute at Microsoft Corp., added, “As cloud computing gains momentum everywhere, customers are seeking more powerful services. Working with NVIDIA, Microsoft is giving customers instant access to a level of supercomputing power that was previously unimaginable, enabling a new era of innovation.”

Dramatic Performance, Cost Benefits
The new offering — which is ideal for complex AI, machine learning and HPC workloads — can provide dramatic performance and cost advantages over traditional CPU-based computing. AI researchers needing fast solutions can quickly spin up multiple NDv2 instances and train complex conversational AI models in just hours.

Microsoft and NVIDIA engineers used 64 NDv2 instances on a pre-release version of the cluster to train BERT, a popular conversational AI model, in roughly three hours. This was achieved in part by taking advantage of multi-GPU optimizations provided by NCCL, an NVIDIA CUDA X™ library and high-speed Mellanox interconnects.

Customers can also see benefits from using multiple NDv2 instances to run complex HPC workloads, such as LAMMPS, a popular molecular dynamics application used to simulate materials down to the atomic scale in such areas as drug development and discovery. A single NDv2 instance can deliver an order of magnitude faster results than a traditional HPC node without GPU acceleration for specific types of applications, such as deep learning. This performance can scale linearly to a hundred instances for large-scale simulations.

All NDv2 instances benefit from the GPU-optimized HPC applications, machine learning software and deep learning frameworks like TensorFlow, PyTorch and MXNet from the NVIDIA NGC container registry and Azure Marketplace. The registry also offers Helm charts to easily deploy the AI software on Kubernetes clusters.

Availability and Pricing
NDv2 is available now in preview. One instance with eight NVIDIA V100 GPUs can be clustered to scale up to a variety of workload demands. See more details here.

About NVIDIA
NVIDIA’s (NASDAQ: NVDA) invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots and self-driving cars that can perceive and understand the world. More information at http://nvidianews.nvidia.com/.

Filed Under: Tech Tagged With: GPU-accelerated supercomputer

Reader Interactions

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Footer

Recent Posts

  • Apple’s Next-Generation Apple Intelligence Is Built on Google’s Gemini Models
  • Itera Emerges From Stealth With Fluid Circuit Board That Rewires in Under a Minute
  • Quantum Computing Stocks Are Down. They Are Not at the Bottom.
  • The Humanoid Trap: Form Factor as Distraction in Industrial Robotics
  • Hark Raises $700M Series A at $6B: The Vertical Integration Bet on Personal AI
  • Apple Brings Apple Intelligence to Accessibility, Adds Wheelchair Eye Control for Vision Pro
  • RADAR Raises $170M to Bring Real-Time Inventory Intelligence to Physical Retail
  • Anthropic’s Stainless Acquisition Is an Infrastructure Seizure Disguised as a Developer Tools Deal
  • Blackstone and Google Are Building an AI Infrastructure Giant Outside the Traditional Cloud Model
  • Mind Robotics Crosses $1B in Total Funding; Rivian Is the Quiet Disclosure

Media Partners

  • Market Analysis
  • Cybersecurity Market
  • App Coding
The Repricing and the Drain: How SpaceX, OpenAI, and Anthropic Rewire the Index
Quantum Computing Equities: Market Segment Memo
Quantum Computing Stocks Face Violent Selloff the Moment Markets Reopen Tuesday
The $2.6 Trillion Signal: What Gartner’s AI Spending Forecast Actually Tells You
The Productivity Is Already Here. The Bubble Narrative Is Not.
The Collingridge Dilemma
Why Memory Prices Won’t Come Down
The Bill Comes Due
The Software-Defined Camera Won. The Open OS Did Not.
Cars Are Computers Now, and Most Carmakers Aren’t
Google’s $32 Billion Wiz Bet Meets the OT Grid: Hitachi Becomes Its Critical-Infrastructure Channel
Cybersecurity Stocks Fall Friday as Nasdaq’s 4.2% Tech Rout Sweeps Up CrowdStrike and Palo Alto
IdentityTheft.org Sells for $30,000 on Sedo
Infosecurity Europe 2026, June 2–4, London
Ocean Launches From Stealth With $28 Million to Reinvent Email Security Using AI Agents
Salt Typhoon, Volt Typhoon, Flax Typhoon: China’s 2024 Campaign Against U.S. Infrastructure
Foreign Criminal Cyberattacks Against the United States: Ransomware, Botnets, and Financial Fraud
Iran’s Cyber Operations: Infrastructure Attacks, Election Interference, and IRGC Proxies
North Korea’s Cyber Program: From Sony to Blockchain Theft
Russia’s State Cyber Operations: From SolarWinds to Logistics Warfare
DigitalOcean Launches AI-Native Cloud at Deploy 2026
Verdent Updates AI Platform to Function as a Full Engineering Team for Solo Builders
The Side Project App Is Not Dead. The Side Project App Business Is.
The App Monetization Landscape Has Changed and Most Teams Have Not Caught Up
Building Offline-First Mobile Apps Is Harder Than It Looks and Worth It
State Management in React Native Has Too Many Options and One Right Answer
Mobile Accessibility Is the Case Developers Keep Ignoring
Testing Mobile Apps at Scale Without Losing Your Mind
App Store Optimization in 2026 Is a Different Game Than It Was
Cross-Platform vs Native: The Honest Assessment Nobody Gives You

Media Partners

  • Market Research Media
  • Technology Conferences
  • API Coding
Tuesday Open: AI Earnings Engine Holds the Line as Iran Overhang Fades to Noise
China’s U.S. Treasury Holdings: The Great Repositioning (2021–2025)
Infographic: Why the 2025 CIPA Data Proves the APS-C Renaissance is Real
How WiFi Changed Media
Canva Acquires Simtheory and Ortto to Build End-to-End Work Platform
Netflix Price Hikes, The Economics of Dominance in a Saturated Streaming Market
America’s Brands Keep Winning Even as America Itself Slips
Kioxia’s Storage Gambit: Flash Steps Into the AI Memory Hierarchy
Mamdani Strangling New York
The Rise of Faceless Creators: Picsart Launches Persona and Storyline for AI Character-Driven Content
WWDC 2026 Keynote, June 8, 2026, Apple Park, Cupertino
Baird 2026 Global Consumer, Technology & Services Conference, June 2–4, New York
D.A. Davidson Technology Conference, June 11, 2026, Nashville
Bank of America Global Technology Conference, June 4, 2026, San Francisco
William Blair Growth Stock Conference, June 3, 2026, Chicago
TD Cowen Technology, Media & Telecom Conference, May 27, 2026, New York
J.P. Morgan Global Technology, Media and Communications Conference, May 18–20, 2026, Boston
Technology Investor Conference Circuit, May–June 2026
Automate 2026 Sets Its Agenda Around AI’s Role in Industrial Transformation, June 22–25, 2026, McCormick Place in Chicago
IBM Think 2026, May 5–8, Boston, Massachusetts, USA
Why Private Domain Data Is the Real Key to AI That Actually Works
Orkes Raises $60M to Bring Production-Grade AI Orchestration to Enterprise Developers
Form.io Launches MCP Server and Agentic Coding Toolset for Governed Enterprise AI Development
Appdome Upgrades MobileBOT Defense With Identity-First Mobile API Protection
Five SDK Generators Compared: Speakeasy, Stainless, Fern, APIMatic, and OpenAPI Generator
API Monetization Models That Work and the Ones That Drive Developers Away
gRPC in Production: What the Documentation Doesn't Tell You
Event-Driven Architecture vs Request-Response: Choosing the Right Communication Pattern
The Business Case for Internal APIs That Most Engineering Leaders Ignore
Breaking Changes: How to Avoid Shipping Them and What to Do When You Must

Copyright © 2026 Technologies.org

Media Partners: Market Analysis · Market Research · Referently · Photography