ML 18175 001 vlm architecture 1024x467 1

How Apoidea Group enhances visual information extraction from banking documents with multimodal models using LLaMA-Factory on Amazon SageMaker HyperPod

This post is co-written with Ken Tsui, Edward Tsoi and Mickey Yip from Apoidea Group. The banking industry has long struggled with the inefficiencies associated with repetitive processes such as information extraction, document review, and auditing. These tasks, which require significant human resources, slow down critical operations such as Know Your Customer (KYC) procedures, loan …

Expanding our Risk Protection Program with new insurance partners and AI coverage

Today’s businesses have a vital need to manage and, when appropriate, transfer cyber risk in their cloud environments — even with robust security measures in place. At Google Cloud Next last month, we unveiled significant advancements to our Risk Protection Program, an industry-first collaboration between Google and leading cyber insurers that provides competitively priced cyber-insurance …

How to Become Immortal Using AI?

We all leave traces behind: emails, text messages, photos, voice notes. But what if you could go one step further? What if your loved ones could still talk to you after you’re gone? Thanks to advances in artificial intelligence, digital immortality is no longer the stuff of science fiction. It’s becoming technically possible to create custom AIs …

PixART Image

Cost-effective AI image generation with PixArt-Σ inference on AWS Trainium and AWS Inferentia

PixArt-Sigma is a diffusion transformer model that is capable of image generation at 4k resolution. This model shows significant improvements over previous generation PixArt models like Pixart-Alpha and other diffusion models through dataset and architectural improvements. AWS Trainium and AWS Inferentia are purpose-built AI chips to accelerate machine learning (ML) workloads, making them ideal for …

Unlock software delivery excellence and quality with Gemini Code Assist agents

According to DORA’s latest research – the Impact of Generative AI in Software Development report – AI tools are making software developers feel more productive, focused, and satisfied. They’re even writing better code and documentation more quickly. But the research uncovered a paradox: these individual gains do not translate directly into improved system-level performance. In …

ML 18386 image001

Securing Amazon Bedrock Agents: A guide to safeguarding against indirect prompt injections

Generative AI tools have transformed how we work, create, and process information. At Amazon Web Services (AWS), security is our top priority. Therefore, Amazon Bedrock provides comprehensive security controls and best practices to help protect your applications and data. In this post, we explore the security measures and practical strategies provided by Amazon Bedrock Agents …

Cool stuff customers built, May edition: Visual scouts, racing agents, agile ads & more

AI and cloud technology are reshaping every corner of every industry around the world. Without our customers, there would be no Google Cloud, as they are the ones building the future on our platform. In this monthly round-up, we dive into some of the exciting projects redefining businesses, shaping industries, and creating new categories.  For …

StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant

We present StreamBridge, a simple yet effective framework that seamlessly transforms offline Video-LLMs into streaming-capable models. It addresses two fundamental challenges in adapting existing models into online scenarios: (1) limited capability for multi-turn real-time understanding, and (2) lack of proactive response mechanisms. Specifically, StreamBridge incorporates (1) a memory buffer combined with a round-decayed compression strategy, …

image001 3

Build an intelligent community agent to revolutionize IT support with Amazon Q Business

In the era of AI and machine learning (ML), there is a growing emphasis on enhancing security— especially in IT contexts. In this post, we demonstrate how your organization can reduce the end-to-end burden of resolving regular challenges experienced by your IT support teams—from understanding errors and reviewing diagnoses, remediation steps, and relevant documentation, to …