Efficient Multimodal Neural Networks for Trigger-less Voice Assistants

The adoption of multimodal interactions by Voice Assistants (VAs) is growing rapidly to enhance human-computer interactions. Smartwatches have now incorporated trigger-less methods of invoking VAs, such as Raise To Speak (RTS), where the user raises their watch and speaks to VAs without an explicit trigger. Current state-of-the-art RTS systems rely on heuristics and engineered Finite …

VisualCaptions

Visual captions: Using large language models to augment video conferences with dynamic visuals

Posted by Ruofei Du, Research Scientist, and Alex Olwal, Senior Staff Research Scientist, Google Augmented Reality Recent advances in video conferencing have significantly improved remote video communication through features like live captioning and noise cancellation. However, there are various situations where dynamic visual augmentation would be useful to better convey complex and nuanced information. For …

image 10

Build high-performance ML models using PyTorch 2.0 on AWS – Part 1

PyTorch is a machine learning (ML) framework that is widely used by AWS customers for a variety of applications, such as computer vision, natural language processing, content creation, and more. With the recent PyTorch 2.0 release, AWS customers can now do same things as they could with PyTorch 1.x but faster and at scale with …

Climate Cardinals: Bridging the climate information gap with AI-powered translations

Editor’s note: On a trip to visit family in Iran, Sophia Kianni made an alarming observation. Despite facing the disproportionate impacts of climate change, and with temperatures in the Middle East rising twice as fast as the global average, her relatives knew almost nothing about the world’s environmental challenges. When she realized that scientific literature …

software

Fish-Farming Startup Casts AI to Make Aquaculture More Efficient, Sustainable

As a marine biology student, Josef Melchner always dreamed of spending his days cruising the oceans to find dolphins, whales and fish — but also “wanted to do something practical, something that would benefit the world,” he said. When it came time to choose a career, he dove head first into aquaculture. He’s now CEO …

New AI boosts teamwork training

Researchers have developed a new artificial intelligence (AI) framework that is better than previous technologies at analyzing and categorizing dialogue between individuals, with the goal of improving team training technologies. The framework will enable training technologies to better understand how well individuals are coordinating with one another and working as part of a team.

A Gentle Introduction to Hallucinations in Large Language Models

Last Updated on June 6, 2023 Large Language Models (LLMs) are known to have “hallucinations.” This is a behavior in that the model speaks false knowledge as if it is accurate. In this post, you will learn why hallucinations are a nature of an LLM. Specifically, you will learn: Why LLMs hallucinate How to make …