Amazon SageMaker Inference Recommender (IR) helps customers select the best instance type and configuration (such as instance count, container parameters, and model optimizations) for deploying their ML models on SageMaker. Today, we are announcing deeper integration with Amazon CloudWatch for logs and metrics, python SDK support for running IR jobs, enabling customers to run IR jobs within a VPC subnet of their choice, support for running load tests on existing endpoint via a new API, and several usability improvements for easily getting started with IR.
Source:: Amazon AWS