LiveBench is an open LLM benchmark that uses contamination-free test data and objective scoring

AI-generated image of a robot sitting at a computer running tests.


Yann LeCun and other researchers have developed LiveBench, an open AI benchmark evaluating models using challenging, contamination-free test data.Read More