Categories: FAANG

SelfReflect: Can LLMs Communicate Their Internal Answer Distribution?

The common approach to communicate a large language model’s (LLM) uncertainty is to add a percentage number or a hedging word to its response. But is this all we can do? Instead of generating a single answer and then hedging it, an LLM that is fully transparent to the user needs to be able to reflect on its internal belief distribution and output a summary of all options it deems possible, and how likely they are. To test whether LLMs possess this capability, we develop the SelfReflect metric, an information-theoretic distance between a given summary and a distribution over answers. In…

Self-reflective Uncertainties: Do LLMs Know Their Internal Answer Distribution?

This paper was accepted at the Workshop on Reliable and Responsible Foundation Models (RRFMs) Workshop at ICML 2025. Uncertainty quantification plays a pivotal role when bringing large language models (LLMs) to end-users. Its primary goal is that an LLM should indicate when it is unsure about an answer it gives.…

July 10, 2025

In "FAANG"

Reducing Hallucinations with the Ontology in Palantir AIP

July 10, 2024

In "FAANG"

Thinking Outside the (Black) Box

August 13, 2024

In "FAANG"

AI Generated Robotic Content

Next The 3 Invisible Risks Every LLM App Faces (And How to Guard Against Them) »

Previous « Correcting the Record: Response to the EFF January 15, 2026 Report on Palantir

Published by

AI Generated Robotic Content

Tags: ai/mlfaang

4 months ago

Anima – Sharing Some Prompts and Results

Been experimenting with Anima lately and ended up spending way too much time refining prompts.…

7 hours ago

AI/ML News

Keychron K2 HE Concrete Edition Review: Rock-Solid Typing

Keychron's K2 HE Concrete Edition sounds like a cute gimmick, but as I discovered, there's…

8 hours ago

AI/ML News

AI generates full battery electrolyte recipes, matching top lithium metal battery performance

Battery electrolytes aren't just one chemical, but a complex mixture of salts, solvents, and additives…

8 hours ago

Image

Nava – A 6.3B audio-video model .

Page: https://ernie-research.github.io/NAVA/ Model: https://huggingface.co/ernie-research/NAVA Github: https://github.com/ernie-research/NAVA NAVA is a 6.3 B-parameter joint audio-video generator that…

1 day ago