Our next-generation model: Gemini 1.5
The model delivers dramatically enhanced performance, with a breakthrough in long-context understanding across modalities.
The model delivers dramatically enhanced performance, with a breakthrough in long-context understanding across modalities.
We explore large-scale training of generative models on video data. Specifically, we train text-conditional diffusion models jointly on videos and images of variable durations, resolutions and aspect ratios. We leverage a transformer architecture that operates on spacetime patches of video and image latent codes. Our largest model, Sora, is capable of generating a minute of …
You’re headed to your favorite drive-thru to grab fries and a cheeseburger. It’s a simple order and as you pull in you notice there isn’t much of a line. What could possibly go wrong? Plenty. The restaurant is near a busy freeway with roaring traffic noise and airplanes fly low overhead as they approach the …
Read more “Unveiling the transformative AI technology behind watsonx Orders”
With the use of cloud computing, big data and machine learning (ML) tools like Amazon Athena or Amazon SageMaker have become available and useable by anyone without much effort in creation and maintenance. Industrial companies increasingly look at data analytics and data-driven decision-making to increase resource efficiency across their entire portfolio, from operations to performing …
Read more “Detect anomalies in manufacturing data using Amazon SageMaker Canvas”
In December, Google announced Gemini, our most capable and general model yet. Since December, select customers like Samsung and Palo Alto Networks have been building sophisticated AI agents with Gemini models in Vertex AI, unlocking new levels of productivity, personalized learning, and more for their users. Today, we’re bringing more Gemini models to our customers …
Read more “Google Cloud expands access to Gemini models for Vertex AI customers”
Posted by Nishant Jain, Pre-doctoral Researcher, and Pradeep Shenoy, Research Scientist, Google Research The constantly changing nature of the world around us poses a significant challenge for the development of AI models. Often, models are trained on longitudinal data with the hope that the training data used will accurately represent inputs the model may receive …
Read more “Learning the importance of training data under concept drift”
How data-driven acquisition identified over $90M in savings for VA in 6 months Editor’s Note: This post was written jointly with the U.S. Department of Veterans Affairs. Intro The U.S. Department of Veterans Affairs (VA) is one of the largest integrated health care systems in the United States, providing care at 1,298 health care facilities to over …
We terminated accounts associated with state-affiliated threat actors. Our findings show our models offer only limited, incremental capabilities for malicious cybersecurity tasks.
When thinking of artificial intelligence (AI) use cases, the question might be asked: What won’t AI be able to do? The easy answer is mostly manual labor, although the day might come when much of what is now manual labor will be accomplished by robotic devices controlled by AI. But right now, pure AI can …
Posted by Nishant Jain, Pre-doctoral Researcher, and Pradeep Shenoy, Research Scientist, Google Research The constantly changing nature of the world around us poses a significant challenge for the development of AI models. Often, models are trained on longitudinal data with the hope that the training data used will accurately represent inputs the model may receive …
Read more “Learning the importance of training data under concept drift”