AWS will support NVIDIA’s T4 GPUs focusing on intensive machine learning workloads
Amazon Web Services (AWS) will release its latest GPU-equipped instance with support for NVIDIA’s T4 Tensor Core GPUs with a particular focus on machine learning workloads.
The specs, while not fully there yet, promise so far AWS-custom Intel CPUs, up to 384 gibibytes (GiB) of memory, up to 1.8 TB of fast local NVMe storage, and up to 100 Gbps in networking capacity.
From NVIDIA’s side, T4 will be supported by Amazon Elastic Container Service for Kubernetes. “Because T4 GPUs are extremely efficient for AI inference, they are well-suited for companies that seek powerful, cost-efficient cloud solutions for deploying machine learning models into production,” Ian Buck, NVIDIA general manager and vice president of accelerated computing wrote in a blog post.
“NVIDIA and AWS have worked together for a long time to help customers run compute-intensive AI workloads in the cloud and create incredible new AI solutions,” added Matt Garman, AWS vice president of compute services in a statement. “With our new T4-based G4 instances, we’re making it even easier and more cost-effective for customers to accelerate their machine learning inference and graphics-intensive applications.”
The announcement came at NVIDIA’s GPU Technology Conference in San Jose alongside various other news. From the AWS stable it was also announced that the NVIDIA Jetson AI platform now supports robotics service AWS RoboMaker, while AI Playground, as reported by sister publication AI News, is an accessible online space for users to become familiar with deep learning experiences.
It’s worth noting that AWS is not the first company to score in this area. In November Google touted itself as the ‘first and only’ major cloud vendor to offer T4 compatibility. In January, the company announced its T4 GPU instances were available in beta across six countries.
Interested in hearing industry leaders discuss subjects like this and sharing their experiences and use-cases? Attend the Cyber Security & Cloud Expo World Series with upcoming events in Silicon Valley, London and Amsterdam to learn more.
- » Cloud providers are under attack - and sabotaged services will freeze operations
- » How the combination of cloud and AI is influencing IT investment strategy
- » Exploring the journey from cloud to AI - with a few big data bumps along the way
- » The six pillars of cloud cost optimisation – and how to get them to work for you
- » Microsoft and Oracle partner up to interconnect clouds – with retail customers cited