• Skip to main content
  • Skip to secondary menu
  • Skip to footer

Technologies.org

Technology Trends: Follow the Money

  • Technology Events 2026-2027
  • Sponsored Post
  • Technology Markets
  • About
    • GDPR
  • Contact

SC19: NVIDIA Announces Scalable GPU-Accelerated Supercomputer in the Microsoft Azure Cloud

November 19, 2019 By admin Leave a Comment

New Microsoft Azure NDv2 Supersized Instance Can Scale to Hundreds of Interconnected NVIDIA Tensor Core GPUs for Complex AI and High Performance Computing Applications

NVIDIA today announced the availability of a new kind of GPU-accelerated supercomputer in the cloud on Microsoft Azure.

Built to handle the most demanding AI and high performance computing applications, the largest deployments of Azure’s new NDv2 instance rank among the world’s fastest supercomputers, offering up to 800 NVIDIA V100 Tensor Core GPUs interconnected on a single Mellanox InfiniBand backend network. It enables customers for the first time to rent an entire AI supercomputer on demand from their desk, and match the capabilities of large-scale, on-premises supercomputers that can take months to deploy.

“Until now, access to supercomputers for AI and high performance computing has been reserved for the world’s largest businesses and organizations,” said Ian Buck, vice president and general manager of Accelerated Computing at NVIDIA. “Microsoft Azure’s new offering democratizes AI, giving wide access to an essential tool needed to solve some of the world’s biggest challenges.”

Girish Bablani, corporate vice president of Azure Compute at Microsoft Corp., added, “As cloud computing gains momentum everywhere, customers are seeking more powerful services. Working with NVIDIA, Microsoft is giving customers instant access to a level of supercomputing power that was previously unimaginable, enabling a new era of innovation.”

Dramatic Performance, Cost Benefits
The new offering — which is ideal for complex AI, machine learning and HPC workloads — can provide dramatic performance and cost advantages over traditional CPU-based computing. AI researchers needing fast solutions can quickly spin up multiple NDv2 instances and train complex conversational AI models in just hours.

Microsoft and NVIDIA engineers used 64 NDv2 instances on a pre-release version of the cluster to train BERT, a popular conversational AI model, in roughly three hours. This was achieved in part by taking advantage of multi-GPU optimizations provided by NCCL, an NVIDIA CUDA X™ library and high-speed Mellanox interconnects.

Customers can also see benefits from using multiple NDv2 instances to run complex HPC workloads, such as LAMMPS, a popular molecular dynamics application used to simulate materials down to the atomic scale in such areas as drug development and discovery. A single NDv2 instance can deliver an order of magnitude faster results than a traditional HPC node without GPU acceleration for specific types of applications, such as deep learning. This performance can scale linearly to a hundred instances for large-scale simulations.

All NDv2 instances benefit from the GPU-optimized HPC applications, machine learning software and deep learning frameworks like TensorFlow, PyTorch and MXNet from the NVIDIA NGC container registry and Azure Marketplace. The registry also offers Helm charts to easily deploy the AI software on Kubernetes clusters.

Availability and Pricing
NDv2 is available now in preview. One instance with eight NVIDIA V100 GPUs can be clustered to scale up to a variety of workload demands. See more details here.

About NVIDIA
NVIDIA’s (NASDAQ: NVDA) invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots and self-driving cars that can perceive and understand the world. More information at http://nvidianews.nvidia.com/.

Filed Under: Tech Tagged With: GPU-accelerated supercomputer

Reader Interactions

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Footer

Recent Posts

  • Dify Raises $30 Million to Power the Next Wave of Production AI Applications
  • Nscale’s $2 Billion Bet on the Physical Backbone of the AI Economy
  • Why USB-C Charging on the MacBook Neo Raises Questions About Port Durability
  • MagSafe Wireless Charging: The Magnetic Reinvention of Power
  • Apple Unveils MacBook Neo: A $599 Entry Into the Mac Ecosystem
  • Apple Unveils M5 Pro and M5 Max: A New Era for MacBook Pro, MacBook Air, and Studio Display
  • Apple iPhone 17e: Performance, Practicality, and a Smarter Entry Point into the iPhone 17 Family
  • Apple iPad Air M4 Arrives With 12GB Memory, Wi-Fi 7, and a Serious AI Push
  • Ericsson and Intel Are Redefining What 6G Is Actually For
  • Hollow-Core Fibre, Light Running Through Air Instead of Glass

Media Partners

  • Market Analysis
  • Cybersecurity Market
Memory Crunch: Why Prices Are Surging and Why Making More Memory Isn’t Easy
The End of Accounting as We Knew It
The Era of Superhuman Logistics Has Arrived: Building the First Autonomous Freight Network
Why Nvidia Shares Jumped on Meta, and Why the Market Cared
Accrual Launches With $75M to Push AI-Native Automation Into Core Accounting Workflows
Europe’s Digital Sovereignty Moment, or How Regulation Became a Competitive Handicap
Palantir Q4 2025: From Earnings Beat to Model Re-Rating
Baseten Raises $300M to Dominate the Inference Layer of AI, Valued at $5B
Nvidia’s China Problem Is Self-Inflicted, and Washington Should Stop Pretending Otherwise
USPS and the Theater of Control: How Government Freezes Failure in Place
Day Zero Threat Research Summit, August 30 – September 1, 2026, Las Vegas
CrowdStrike Returns to Profit as Revenue Climbs to $1.31 Billion in Q4
Cloudflare 2026 Threat Report Signals the Automation of Cyberwar
Fal.Con Gov 2026, March 18, Washington, D.C.
Huper Corporation Raises $1.5M Pre-Seed to Build a Security-First AI Chief of Staff
CyberBay Summit 2026, March 11–13, Tampa, Florida
Zscaler’s Q2 Beat and the Market’s Reluctance to Celebrate
AI as the New Insider: Why Trust, Not Code, Is Now the Weakest Link
Cybersecurity Meets Corporate Travel: Darktrace Chooses AI-Driven Navan to Power Global Mobility
Black Hat Asia 2026, April 21–24, Singapore

Media Partners

  • Market Research Media
  • Technology Conferences
The Rise of Faceless Creators: Picsart Launches Persona and Storyline for AI Character-Driven Content
Apple TV Arrives on The Roku Channel, Expanding the Streaming Platform Wars
Why Attraction-Grabbing Stations Win at Tech Events
Why Nvidia Let Go of Arm, and Why It Matters Now
When the Market Wants a Story, Not Numbers: Rethinking AMD’s Q4 Selloff
BBC and the Gaza War: How Disproportionate Attention Reshapes Reality
Parallel Museums: Why the Future of Art Might Be Copies, Not Originals
ClickHouse Series D, The $400M Bet That Data Infrastructure, Not Models, Will Decide the AI Era
AI Productivity Paradox: When Speed Eats Its Own Gain
Voice AI as Infrastructure: How Deepgram Signals a New Media Market Segment
COMPUTEX 2026, June 2–5, Taipei
360° Mobility Mega Shows 2026, April 14–17, Taipei
Forrester CX Summit Series 2026: Amsterdam, New York, San Francisco
IAMPHENOM 2026, March 10–12, Pennsylvania Convention Center, Philadelphia
Billington State and Local CyberSecurity Summit, March 9–11, 2026, Washington, D.C.
Mobile World Congress (MWC) 2026 – 2–5 March, Barcelona, Spain
The AI Summit London, 10–11 June 2026, Tobacco Dock, London
aim10x Digital 2026, March 18, Virtual
Harvard Business Review Strategy Summit, February 26, 2026, Virtual
International Compact Modeling Conference, July 30–31, 2026, Long Beach, California

Copyright © 2022 Technologies.org

Media Partners: Market Analysis & Market Research and Exclusive Domains, Photography