AWS Machine Learning Blog Official Machine Learning Blog of Amazon Web Services
- Supercharge your LLM performance with Amazon SageMaker Large Model Inference container v15by Vivek Gangasani on April 22, 2025 at 17:28
Today, we’re excited to announce the launch of Amazon SageMaker Large Model Inference (LMI) container v15, powered by vLLM 0.8.4 with support for the vLLM V1 engine. This release introduces significant performance improvements, expanded model compatibility with multimodality (that is, the ability to understand and analyze text-to-text, images-to-text, and text-to-images data), and provides built-in integration with vLLM to help you seamlessly deploy and serve large language models (LLMs) with the highest performance at scale.
- Accuracy evaluation framework for Amazon Q Business – Part 2by Rui Cardoso on April 22, 2025 at 17:18
In the first post of this series, we introduced a comprehensive evaluation framework for Amazon Q Business, a fully managed Retrieval Augmented Generation (RAG) solution that uses your company’s proprietary data without the complexity of managing large language models (LLMs). The first post focused on selecting appropriate use cases, preparing data, and implementing metrics to
- Use Amazon Bedrock Intelligent Prompt Routing for cost and latency benefitsby Shreyas Subramanian on April 22, 2025 at 17:15
Today, we’re happy to announce the general availability of Amazon Bedrock Intelligent Prompt Routing. In this blog post, we detail various highlights from our internal testing, how you can get started, and point out some caveats and best practices. We encourage you to incorporate Amazon Bedrock Intelligent Prompt Routing into your new and existing generative AI applications.
- How Infosys improved accessibility for Event Knowledge using Amazon Nova Pro, Amazon Bedrock and Amazon Elemental Media Servicesby Aparajithan Vaidyanathan on April 22, 2025 at 17:12
In this post, we explore how Infosys developed Infosys Event AI to unlock the insights generated from events and conferences. Through its suite of features—including real-time transcription, intelligent summaries, and an interactive chat assistant—Infosys Event AI makes event knowledge accessible and provides an immersive engagement solution for the attendees, during and after the event.
- Amazon Bedrock Prompt Optimization Drives LLM Applications Innovation for Yuewen Groupby Wang Rui on April 21, 2025 at 22:57
Today, we are excited to announce the availability of Prompt Optimization on Amazon Bedrock. With this capability, you can now optimize your prompts for several use cases with a single API call or a click of a button on the Amazon Bedrock console. In this blog post, we discuss how Prompt Optimization improves the performance of large language models (LLMs) for intelligent text processing task in Yuewen Group.
MIT News – Artificial intelligence MIT news feed about: Artificial intelligence
- 3D modeling you can feelby Adam Conner-Simons | MIT CSAIL on April 22, 2025 at 19:00
TactStyle, a system developed by CSAIL researchers, uses image prompts to replicate both the visual appearance and tactile properties of 3D models.
- Norma Kamali is transforming the future of fashion with AIby MIT Professional Education on April 22, 2025 at 18:00
The renowned designer embraces generative AI to preserve and propel her legacy.
- MIT’s McGovern Institute is shaping brain science and improving human lives on a global scaleby Julie Pryor | McGovern Institute for Brain Research on April 18, 2025 at 14:40
A quarter century after its founding, the McGovern Institute reflects on its discoveries in the areas of neuroscience, neurotechnology, artificial intelligence, brain-body connections, and therapeutics.
- Making AI-generated code more accurate in any languageby Adam Zewe | MIT News on April 18, 2025 at 04:00
A new technique automatically guides an LLM toward outputs that adhere to the rules of whatever programming language or other format is being used.
- A faster way to solve complex planning problemsby Adam Zewe | MIT News on April 16, 2025 at 04:00
By eliminating redundant computations, a new data-driven method can streamline processes like scheduling trains, routing delivery drivers, or assigning airline crews.
Google DeepMind Blog Read the latest articles and stories from DeepMind and find out more about our latest breakthroughs in cutting-edge AI research.
- Introducing Gemini 2.5 Flashon April 17, 2025 at 19:02
Gemini 2.5 Flash is our first fully hybrid reasoning model, giving developers the ability to turn thinking on or off.
- Generate videos in Gemini and Whisk with Veo 2on April 15, 2025 at 17:00
Transform text-based prompts into high-resolution eight-second videos in Gemini Advanced and use Whisk Animate to turn images into eight-second animated clips.
- DolphinGemma: How Google AI is helping decode dolphin communicationon April 14, 2025 at 17:00
DolphinGemma, a large language model developed by Google, is helping scientists study how dolphins communicate — and hopefully find out what they’re saying, too.
- Taking a responsible path to AGIon April 2, 2025 at 13:31
We’re exploring the frontiers of AGI, prioritizing technical safety, proactive risk assessment, and collaboration with the AI community.
- Evaluating potential cybersecurity threats of advanced AIon April 2, 2025 at 13:30
Our framework enables cybersecurity experts to identify which defenses are necessary—and how to prioritize them