Evaluating Evaluation Metrics — The Mirage of Hallucination Detection

Hallucinations pose a significant obstacle to the reliability and widespread adoption of language models, yet their accurate measurement remains a persistent challenge. While many task- and domain-specific metrics have been proposed to assess faithfulness and factuality concerns, the robustness and generalization of these metrics are still untested. In this paper, we conduct a large-scale empirical …

Announcing new capabilities in Vertex AI Training for large-scale training

Building and scaling generative AI models demands enormous resources, but this process can get tedious. Developers wrestle with managing job queues, provisioning clusters, and resolving dependencies just to ensure consistent results. This infrastructure overhead, along with the difficulty of discovering the optimal training recipe and navigating the endless maze of hyperparameter and model architecture choices, …

Bias after Prompting: Persistent Discrimination in Large Language Models

A dangerous assumption that can be made from prior work on the bias transfer hypothesis (BTH) is that biases do not transfer from pre-trained large language models (LLMs) to adapted models. We invalidate this assumption by studying the BTH in causal models under prompt adaptations, as prompting is an extremely popular and accessible adaptation strategy …

Post-Training Generative Recommenders with Advantage-Weighted Supervised Finetuning

Author: Keertana Chidambaram, Qiuling Xu, Ko-Jen Hsiao, Moumita Bhattacharya (*The work was done when Keertana interned at Netflix.) Introduction This blog focuses on post-training generative recommender systems. Generative recommenders (GRs) represent a new paradigm in the field of recommendation systems (e.g. HSTU, OneRec). These models draw inspiration from recent advancements in transformer architectures used for …

Rooms from Motion: Un-posed Indoor 3D Object Detection as Localization and Mapping

We revisit scene-level 3D object detection as the output of an object-centric framework capable of both localization and mapping using 3D oriented boxes as the underlying geometric primitive. While existing 3D object detection approaches operate globally and implicitly rely on the a priori existence of metric camera poses, our method, Rooms from Motion (RfM) operates …

15deT4ysBlptxNf1o2H72nQ

Inside the AIPCon 8 Demos Transforming Manufacturing, Insurance, and Construction

Editor’s Note: This is the second in a two-part series highlighting demo sessions from AIPCon 8, Palantir’s most recent customer conference. In part one, we shared how partners across healthcare, retail, defense, and beyond are leveraging Palantir Foundry and AIP to tackle their most pressing operational challenges. AIPCon 8’s afternoon demo sessions showcased groundbreaking implementations …

ML 19257 comp arch v2

Responsible AI design in healthcare and life sciences

Generative AI has emerged as a transformative technology in healthcare, driving digital transformation in essential areas such as patient engagement and care management. It has shown potential to revolutionize how clinicians provide improved care through automated systems with diagnostic support tools that provide timely, personalized suggestions, ultimately leading to better health outcomes. For example, a …

5 ad agencies used Gemini 2.5 Pro and gen media models to create an “impossible ad”

The conversation around generative AI in the enterprise is getting creative.  Since launching our popular Nano Banana model, consumers have created 13 billion images and 230 million videos1. Enterprises can combine Gemini 2.5 Pro with our generative media models – Lyria, Chirp, Imagen, and Veo – to bring their ideas to life.  To us, generative …

1kvIufLu66YkOtk2QdGe2rQ

Acerca de Palantir

Respuestas a preguntas frecuentes sobre Palantir Hemos recibido muchas preguntas sobre Palantir. Dado el gran interés que despierta nuestra empresa, hemos recopilado las preguntas que nos hacen con más frecuencia y las respondemos en esta entrada del blog. ¿A qué se dedica Palantir? Palantir Technologies es una empresa de software que proporciona plataformas para accionar datos e …

image 1 6

Generate Gremlin queries using Amazon Bedrock models

Graph databases have revolutionized how organizations manage complex, interconnected data. However, specialized query languages such as Gremlin often create a barrier for teams looking to extract insights efficiently. Unlike traditional relational databases with well-defined schemas, graph databases lack a centralized schema, requiring deep technical expertise for effective querying. To address this challenge, we explore an …