• Skip to main content
  • Skip to secondary menu
  • Skip to footer

Technologies.org

Technology Trends: Follow the Money

  • Technology Events 2026-2027
  • Sponsored Post
  • Technology Markets
  • About
    • GDPR
  • Contact

GPU Instances in the Cloud

October 26, 2017 By admin Leave a Comment

Amazon Web Services, Inc. (AWS), an Amazon.com company (NASDAQ: AMZN), announced P3 instances, the next generation of Amazon Elastic Compute Cloud (Amazon EC2) GPU instances designed for compute-intensive applications that require massive parallel floating point performance, including machine learning, computational fluid dynamics, computational finance, seismic analysis, molecular modeling, genomics, and autonomous vehicle systems. The first instances to include NVIDIA Tesla V100 GPUs, P3 instances are the most powerful GPU instances available in the cloud.

P3 instances allow customers to build and deploy advanced applications with up to 14 times better performance than previous-generation Amazon EC2 GPU compute instances, and reduce training of machine learning applications from days to hours. With up to eight NVIDIA Tesla V100 GPUs, P3 instances provide up to one petaflop of mixed-precision, 125 teraflops of single-precision, and 62 teraflops of double-precision floating point performance, as well as a 300 GB/s second-generation NVIDIA NVLink interconnect that enables high-speed, low-latency GPU-to-GPU communication. P3 instances also feature up to 64 vCPUs based on custom Intel Xeon E5 (Broadwell) processors, 488 GB of DRAM, and 25 Gbps of dedicated aggregate network bandwidth using the Elastic Network Adapter (ENA).

“When we launched our P2 instances last year, we couldn’t believe how quickly people adopted them,” said Matt Garman, Vice President of Amazon EC2. “Most of the machine learning in the cloud today is done on P2 instances, yet customers continue to be hungry for more powerful instances. By offering up to 14 times better performance than P2 instances, P3 instances will significantly reduce the time involved in training machine learning models, providing agility for developers to experiment, and optimizing machine learning without requiring large investments in on-premises GPU clusters. In addition, high performance computing applications will benefit from up to 2.7 times improvement in double-precision floating point performance.”

AWS Deep Learning Machine Images (AMIs) are available in AWS Marketplace to help customers get started within minutes. The Deep Learning AMI comes preinstalled with the latest releases of Apache MXNet, Caffe2 and TensorFlow with support for Tesla V100 GPUs, and will be updated to support P3 instances with other machine learning frameworks such as Microsoft Cognitive Toolkit and PyTorch as soon as these frameworks release support for Tesla V100 GPUs. Customers can also use the NVIDIA Volta Deep Learning AMI that integrates deep learning framework containers from NVIDIA GPU Cloud, or start with AMIs for Amazon Linux, Ubuntu 16.04, Windows Server 2012 R2, or Windows Server 2016.

With P3 instances, customers have the freedom to choose the optimal framework for their application. “We are excited to support Caffe2 on the new Amazon EC2 P3 instances. The unparalleled power and capability of P3 instances allow developers to train and run models very efficiently at high scale,” said Yangqing Jia, Research Scientist Manager at Facebook. “It will help new innovations get to customers in hours instead of days by taking advantage of the speed with P3 and our modular, scalable deep learning framework with Caffe2.”

Customers can launch P3 instances using the AWS Management Console, AWS Command Line Interface (CLI), and AWS SDKs. Amazon EC2 P3 Instances are generally available in the US East (N. Virginia), US West (Oregon), EU West (Ireland), and Asia Pacific (Tokyo) regions with support for additional regions coming soon. They are available in three sizes, with one, four, and eight GPUs, and can be purchased On-demand, Reserved or Spot instances.

Related market report: Data Warehouse Acceleration Market

Filed Under: Tech Tagged With: Amazon EC2, GPU Acceleration, GPU Instances, P3 instances

Reader Interactions

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Footer

Recent Posts

  • Uptiq Raises $25M Series B to Push Financial AI Out of the Demo Trap
  • From Desk to Flight: High-Value 3D Printing Ideas for a Home Premise
  • Positron AI Raises $230M Series B, Redefines the Economics of AI Inference
  • What You Can Build in Loveable, and Why It Feels Different
  • Forrester Sees Global Tech Spending Hitting $5.6 Trillion in 2026 as AI Drives Growth Despite Tariffs
  • Chiplets Explained: How Modern Chips Are Really Built
  • January 31, 2026 — Tech & Markets Day Digest
  • DealHub Raises $100M to Redefine Enterprise Quote-to-Revenue
  • Preply Reaches $1.2B Valuation After $150M Series D to Scale Human-Led, AI-Enhanced Language Learning
  • Datarails Raises $70M Series C to Turn the CFO’s Office into an AI-Native Nerve Center

Media Partners

  • Market Analysis
  • Cybersecurity Market
Accrual Launches With $75M to Push AI-Native Automation Into Core Accounting Workflows
Europe’s Digital Sovereignty Moment, or How Regulation Became a Competitive Handicap
Palantir Q4 2025: From Earnings Beat to Model Re-Rating
Baseten Raises $300M to Dominate the Inference Layer of AI, Valued at $5B
Nvidia’s China Problem Is Self-Inflicted, and Washington Should Stop Pretending Otherwise
USPS and the Theater of Control: How Government Freezes Failure in Place
Skild AI Funding Round Signals a Shift Toward Platform Economics in Robotics
Saks Sucks: Luxury Retail’s Debt-Fueled Mirage Collapses
Alpaca’s $1.15B Valuation Signals a Maturity Moment for Global Brokerage Infrastructure
The Immersive Experience in the Museum World
CyberCube Appoints Chris Methven as CEO, Signaling Next Phase of Growth
Modveon Raises $10M to Build a Verified Operating System for Governments and Citizens
Modirum Platforms Joins Digital Defence Ecosystem Finland to Expand Europe’s Secure Digital Defence Capabilities
Salt Typhoon Reaches Scandinavia: When Telecom Espionage Goes Public in Norway
SentinelOne Expands AI Security to the First Mile, Redefining How Enterprises Protect AI Systems
NETSCOUT SYSTEMS Q3 FY2026: Quiet Acceleration, Better Mix, and a Cautious Turn Toward Growth
India’s Cyber Delegation Arrives in Tel Aviv for CyberTech 2026
Andersen Consulting Expands Cybersecurity and Legal Tech Capabilities in Strategic HaystackID Partnership
Lionsgate Network to Present AI-Powered Crypto Fraud Solutions at CyberTech Tel Aviv 2026
Cybertech 2026, January 26–28, Tel Aviv Expo

Media Partners

  • Market Research Media
  • Technology Conferences
When the Market Wants a Story, Not Numbers: Rethinking AMD’s Q4 Selloff
BBC and the Gaza War: How Disproportionate Attention Reshapes Reality
Parallel Museums: Why the Future of Art Might Be Copies, Not Originals
ClickHouse Series D, The $400M Bet That Data Infrastructure, Not Models, Will Decide the AI Era
AI Productivity Paradox: When Speed Eats Its Own Gain
Voice AI as Infrastructure: How Deepgram Signals a New Media Market Segment
Spangle AI and the Agentic Commerce Stack: When Discovery and Conversion Converge Into One Layer
PlayStation and the Quiet Power Center of a $200 Billion Gaming Industry
Adobe FY2025: AI Pulls the Levers, Cash Flow Leads the Story
Canva’s 2026 Creative Shift and the Rise of Imperfect-by-Design
Chiplet Summit 2026, February 17–19, Santa Clara Convention Center, Santa Clara, California
MIT Sloan CIO Symposium Innovation Showcase 2026, May 19, 2026, Cambridge, Massachusetts
Humanoid Robot Forum 2026, June 22–25, Chicago
Supercomputing Asia 2026, January 26–29, Osaka International Convention Center, Japan
Chiplet Summit 2026, February 17–19, Santa Clara Convention Center, Santa Clara, California
HumanX, 22–24 September 2026, Amsterdam
CES 2026, January 7–10, Las Vegas
Humanoids Summit Tokyo 2026, May 28–29, 2026, Takanawa Convention Center
Japan Pavilion at CES 2026, January 6–9, Las Vegas
KubeCon + CloudNativeCon Europe 2026, 23–26 March, Amsterdam

Copyright © 2022 Technologies.org

Media Partners: Market Analysis & Market Research and Exclusive Domains, Photography