AWS HPC Blog

Deploying Generative AI Applications with NVIDIA NIM Microservices on Amazon Elastic Kubernetes Service (Amazon EKS) – Part 2

Learn how to deploy AI models at scale with @AWS using NVIDIA’s NIM and Amazon EKS! This step-by-step guide shows you how to create a GPU cluster for inference in this second post of a two-part series!

Performance gains with AWS Graviton4 – a DevitoPRO case study

Performance gains with AWS Graviton4 – a DevitoPRO case study

This post was contributed by Gerard Gorman from Devito, and Cyril Lagrange, Gilles Tourpe, and Theo Wu from AWS The AWS Graviton4 processor represents a significant leap forward, with 96 Neoverse V2 cores and an enhanced memory subsystem. The 12 DDR5-5600 channels provide up to 75% more memory bandwidth than Graviton3 which is beneficial for […]