Artificial Intelligence News Feed

AWS Machine Learning Blog Official Machine Learning Blog of Amazon Web Services

  • Supercharge your LLM performance with Amazon SageMaker Large Model Inference container v15
    by Vivek Gangasani on April 22, 2025 at 17:28

    Today, we’re excited to announce the launch of Amazon SageMaker Large Model Inference (LMI) container v15, powered by vLLM 0.8.4 with support for the vLLM V1 engine. This release introduces significant performance improvements, expanded model compatibility with multimodality (that is, the ability to understand and analyze text-to-text, images-to-text, and text-to-images data), and provides built-in integration with vLLM to help you seamlessly deploy and serve large language models (LLMs) with the highest performance at scale.

  • Accuracy evaluation framework for Amazon Q Business – Part 2
    by Rui Cardoso on April 22, 2025 at 17:18

    In the first post of this series, we introduced a comprehensive evaluation framework for Amazon Q Business, a fully managed Retrieval Augmented Generation (RAG) solution that uses your company’s proprietary data without the complexity of managing large language models (LLMs). The first post focused on selecting appropriate use cases, preparing data, and implementing metrics to

  • Use Amazon Bedrock Intelligent Prompt Routing for cost and latency benefits
    by Shreyas Subramanian on April 22, 2025 at 17:15

    Today, we’re happy to announce the general availability of Amazon Bedrock Intelligent Prompt Routing. In this blog post, we detail various highlights from our internal testing, how you can get started, and point out some caveats and best practices. We encourage you to incorporate Amazon Bedrock Intelligent Prompt Routing into your new and existing generative AI applications.

  • How Infosys improved accessibility for Event Knowledge using Amazon Nova Pro, Amazon Bedrock and Amazon Elemental Media Services
    by Aparajithan Vaidyanathan on April 22, 2025 at 17:12

    In this post, we explore how Infosys developed Infosys Event AI to unlock the insights generated from events and conferences. Through its suite of features—including real-time transcription, intelligent summaries, and an interactive chat assistant—Infosys Event AI makes event knowledge accessible and provides an immersive engagement solution for the attendees, during and after the event.

  • Amazon Bedrock Prompt Optimization Drives LLM Applications Innovation for Yuewen Group
    by Wang Rui on April 21, 2025 at 22:57

    Today, we are excited to announce the availability of Prompt Optimization on Amazon Bedrock. With this capability, you can now optimize your prompts for several use cases with a single API call or a click of a button on the Amazon Bedrock console. In this blog post, we discuss how Prompt Optimization improves the performance of large language models (LLMs) for intelligent text processing task in Yuewen Group.

MIT News – Artificial intelligence MIT news feed about: Artificial intelligence

Google DeepMind Blog Read the latest articles and stories from DeepMind and find out more about our latest breakthroughs in cutting-edge AI research.