Artificial Intelligence News Feed

AWS Machine Learning Blog Official Machine Learning Blog of Amazon Web Services

  • Amazon SageMaker inference launches faster auto scaling for generative AI models
    by James Park on July 25, 2024 at 21:13

    Today, we are excited to announce a new capability in Amazon SageMaker inference that can help you reduce the time it takes for your generative artificial intelligence (AI) models to scale automatically. You can now use sub-minute metrics and significantly reduce overall scaling latency for generative AI models. With this enhancement, you can improve the

  • Find answers accurately and quickly using Amazon Q Business with the SharePoint Online connector
    by Vijai Gandikota on July 25, 2024 at 17:53

    Amazon Q Business is a fully managed, generative artificial intelligence (AI)-powered assistant that helps enterprises unlock the value of their data and knowledge. With Amazon Q, you can quickly find answers to questions, generate summaries and content, and complete tasks by using the information and expertise stored across your company’s various data sources and enterprise

  • Evaluate conversational AI agents with Amazon Bedrock
    by Sharon Li on July 25, 2024 at 17:47

    As conversational artificial intelligence (AI) agents gain traction across industries, providing reliability and consistency is crucial for delivering seamless and trustworthy user experiences. However, the dynamic and conversational nature of these interactions makes traditional testing and evaluation methods challenging. Conversational AI agents also encompass multiple layers, from Retrieval Augmented Generation (RAG) to function-calling mechanisms that

  • Node problem detection and recovery for AWS Neuron nodes within Amazon EKS clusters
    by Darren Lin on July 25, 2024 at 17:39

    In the post, we introduce the AWS Neuron node problem detector and recovery DaemonSet for AWS Trainium and AWS Inferentia on Amazon Elastic Kubernetes Service (Amazon EKS). This component can quickly detect rare occurrences of issues when Neuron devices fail by tailing monitoring logs. It marks the worker nodes in a defective Neuron device as unhealthy, and promptly replaces them with new worker nodes. By accelerating the speed of issue detection and remediation, it increases the reliability of your ML training and reduces the wasted time and cost due to hardware failure.

  • Mistral Large 2 is now available in Amazon Bedrock
    by Niithiyn Vijeaswaran on July 24, 2024 at 20:14

    Mistral AI’s Mistral Large 2 (24.07) foundation model (FM) is now generally available in Amazon Bedrock. Mistral Large 2 is the newest version of Mistral Large, and according to Mistral AI offers significant improvements across multilingual capabilities, math, reasoning, coding, and much more. In this post, we discuss the benefits and capabilities of this new

MIT News – Artificial intelligence MIT news feed about: Artificial intelligence

Google DeepMind Blog Read the latest articles and stories from DeepMind and find out more about our latest breakthroughs in cutting-edge AI research.