Anthropic’s upgraded Claude 3.5 Sonnet model and computer use now in Amazon Bedrock

Anthropic’s upgraded Claude 3.5 Sonnet model is now available in Amazon Bedrock. According to Anthropic, the model delivers across-the-board improvements over its predecessor, with significant gains in coding—an area where it already led the field.

The upgraded Claude 3.5 Sonnet model shows wide-ranging improvements on industry benchmarks. On coding the model improves performance on SWE-bench Verified from 33% to 49%, scoring higher than all publicly available models, according to Anthropic. It also improves performance on TAU-bench, an agentic tool use task, from 62.6% to 69.2% in the retail domain, and from 36.0% to 46.0% in the airline domain. The new Claude 3.5 Sonnet offers these advancements at the same price of its predecessor. Additionally, Claude 3.5 Sonnet now offers computer use capabilities in Amazon Bedrock in a public beta, allowing Claude to perceive and interact with computer interfaces. Developers can direct Claude to use computers the way people do—by looking at a screen, moving a cursor, clicking buttons, and typing text. Given this technology is early, developers are encouraged to explore lower-risk tasks.

The upgraded Claude 3.5 Sonnet model is now available in Amazon Bedrock in the US West (Oregon) Region. Computer use is now available in public beta. To learn more, read the AWS News launch blog, Claude in Amazon Bedrock product page, and documentation. To get started with Claude, visit the Amazon Bedrock console.

Source:: Amazon AWS