AWS Neuron is the SDK for Amazon EC2 Inferentia and Trainium based instances purpose-built for generative AI. Today, with Neuron 2.16 release, we are announcing support for Llama-2 70b model inference on Inf2 instances.
Source:: Amazon AWS
AWS Neuron is the SDK for Amazon EC2 Inferentia and Trainium based instances purpose-built for generative AI. Today, with Neuron 2.16 release, we are announcing support for Llama-2 70b model inference on Inf2 instances.
Source:: Amazon AWS