Benchmarking framework reveals major safety risks of using AI in lab experiments
While artificial intelligence (AI) models have proved useful in some areas of science, like predicting 3D protein structures, a new study shows that it should not yet be trusted in many lab experiments. The study, published in Nature Machine Intelligence, revealed that all of the large-language models (LLMs) and vision-language models (VLMs) tested fell short on lab safety knowledge. Overtrusting these AI models for help in lab experiments can put researchers at risk.
A new University of California San Diego study unveils the first empirical evidence that a modern artificial intelligence system can pass the Turing test—a major scientific benchmark that asks whether a machine can imitate human conversation so convincingly that people can't reliably tell it apart from a real person. In…
As artificial intelligence models become increasingly prevalent and are integrated into diverse sectors like health care, finance, education, transportation, and entertainment, understanding how they work under the hood is critical. Interpreting the mechanisms underlying AI models enables us to audit them for safety and biases, with the potential to deepen…
An international study team, led by Flinders University in collaboration with Khalifa University UAE, built the machine-learning platform to act like a "smart materials discovery engine," which is capable of dramatically reducing the time spent on complex computer or lab experiments to test and find new materials for future semiconductors.