Categories: FAANG

Over-Searching in Search-Augmented Large Language Models

Search-augmented large language models (LLMs) excel at knowledge-intensive tasks by integrating external retrieval.
However, they often over-search – unnecessarily invoking search tool even when it does not improve response quality,
which leads to computational inefficiency and hallucinations by incorporating irrelevant context. In this work, we conduct a
systematic evaluation of over-searching across multiple dimensions, including query types, model categories, retrieval
conditions, and multi-turn conversations. Our finding shows: (i) search generally improves answer accuracy on…

Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment

Query Auto-Completion (QAC) is a critical feature of modern search systems that improves search efficiency by suggesting completions as users type. However, existing approaches face fundamental challenges: traditional retrieve-and-rank pipelines have poor long-tail coverage and require extensive feature engineering, while recent generative methods suffer from hallucination and safety risks. We…

February 19, 2026

In "FAANG"

Semantic Mastery: Enhancing LLMs with Advanced Natural Language Understanding

Large language models (LLMs) have greatly improved their capability in performing NLP tasks. However, deeper semantic understanding, contextual coherence, and more subtle reasoning are still difficult to obtain. The paper discusses state-of-the-art methodologies that advance LLMs with more advanced NLU techniques, such as semantic parsing, knowledge integration, and contextual reinforcement…

December 10, 2025

In "FAANG"

IBM watsonx Assistant: Driving generative AI innovation with Conversational Search

Generative AI has taken the business world by storm. Organizations around the world are trying to understand the best way to harness these exciting new developments in AI while balancing the inherent risks of using these models in an enterprise context at scale. Whether its concerns over hallucination, traceability, training…

October 11, 2023

In "FAANG"

AI Generated Robotic Content