“By opting in, developers no longer have to spend time and effort predicting demand fluctuations,” the company wrote in a blog post.
“Moreover, this capability prioritizes the connected Amazon Bedrock API source/primary region when possible, helping to minimize latency and improve responsiveness. As a result, customers can enhance their applications’ reliability, performance, and efficiency,” it added.
Developers can start using cross-inferencing by either APIs or the Bedrock AWS console to define the primary region and the set of secondary regions where the requests will flow in case of traffic spikes.