TASER: Translation Assessment via Systematic Evaluation and Reasoning

We introduce TASER (Translation Assessment via Systematic Evaluation and Reasoning), a metric that uses Large Reasoning Models (LRMs) for automated translation quality assessment. TASER harnesses the explicit reasoning capabilities of LRMs to conduct systematic, step-by-step evaluation of translation quality. We evaluate TASER on the WMT24 Metrics Shared Task across both reference-based and reference-free scenarios, demonstrating …

1p9LQNNXgs2B5cBeLUijT8g

Inside the AIPCon 8 Demos Redefining the Future of Enterprise AI

Editor’s Note: AIPCon 8, Palantir’s most recent customer conference, featured breakthrough customer implementations that demonstrate what’s possible with enterprise AI today. In part one of this two-part series, we share highlights from the afternoon’s standout demo sessions. Those who attended AIPCon 8 all walked away with a shared experience — seeing firsthand the power of transformative AI …

ML 19489a 2

Enhance agentic workflows with enterprise search using Kore.ai and Amazon Q Business

This post was written with Meghana Chintalapudi and Surabhi Sankhla of Kore.ai. As organizations struggle with exponentially growing volumes of data distributed across multiple repositories and applications, employees lose significant time—approximately 30% according to the International Data Corporation (IDC)—searching for information that could be spent on higher-value work. The complexity of modern enterprise data networks …

maxresdefault

Building on the bananas momentum of generative media models on Google Cloud

It’s been exciting to see the capabilities of Nano Banana, our latest image editing model available in Gemini 2.5 Flash Image, go viral. And with transformative workflows like these, it is easy to see why: genmedia bundle carousel 1 Iterative refinement with Gemini 2.5 Flash Image genmedia bundle carousel 2 Context aware conversational editing with …

hapag eta image 1

How Hapag-Lloyd improved schedule reliability with ML-powered vessel schedule predictions using Amazon SageMaker

This post is cowritten with Thomas Voss and Bernhard Hersberger from Hapag-Lloyd. Hapag-Lloyd is one of the world’s leading shipping companies with more than 308 modern vessels, 11.9 million TEUs (twenty-foot equivalent units) transported per year, and 16,700 motivated employees in more than 400 offices in 139 countries. They connect continents, businesses, and people through …

image 1 croppedmax 1000x1000 1

Gemini CLI extension for PostgreSQL in action: Build a fuzzy search feature in minutes

Adding features to an app can be hard. One minute you’re writing code, the next you’re switching to the PostgreSQL database client to run a query, and then it’s over to the console to check on your instances. For example, let’s say you wanted to add search capabilities. This can mean adding the right extensions …