Today, AWS announces new Amazon Elastic Compute Cloud (Amazon EC2) P5 instance size with one NVIDIA H100 GPU that allows businesses to right-size their machine learning (ML) and high-performance computing (HPC) resources with cost-effectiveness.
The new instance size enables customers to start small and scale in granular increments, providing more flexible control over infrastructure costs. Customers developing small to medium Large Language Models (LLMs) such as chatbots or specialized language translation tools can now run inference tasks more economically. Customers can also use these instances to deploy HPC applications for pharmaceutical discovery, fluid flow analysis, and financial modeling without committing to expensive, large-scale GPU deployments.
P5.4xlarge instances are now available through Amazon EC2 Capacity Blocks for ML in the following AWS Regions: US East (North Virginia, Ohio), US West (Oregon), Europe (London), Asia Pacific (Mumbai, Sydney, Tokyo) and South America (Sao Paulo) regions. These instances can be purchased On-Demand, Spot or through Savings Plans in Europe (London), Asia Pacific (Mumbai, Jakarta, Tokyo), and South America (Sao Paulo) regions.
To learn more about P5.4xlarge instances, visit Amazon EC2 P5 instances.
Source:: Amazon AWS