Evaluating Scientific Understanding and Creativity in LLMs

DIEP seminar by Merijn Moody

In this edition of the DIEP seminar series, Merijn Moody will present his work on LLMs. Merijn joined DIEP in October 2024 as a FAEME PhD student. He is currently working on the development of a mathematical framework to identify and characterize emergent information structures in multivariate discrete data.

Date: 19 February 2026
Time: 11:00 -12:00
Location: Oude Turfmarkt 145-147
Room: Second-floor library

Title: Evaluating Scientific Understanding and Creativity in LLMs

Abstract:

A major debate in the philosophy of Artificial Intelligence is whether LLMs and other AI systems are merely "stochastic parrots" repeating training data, or if they are capable of genuine creativity and understanding. While Large Language Models are increasingly used in research, current benchmarks often fail to measure whether a model truly understands physics and is capable of creativity, or is simply memorizing facts. In this talk, a new benchmark framework is proposed to measure the scientific understanding and creativity of LLMs. In this framework, models are tested on tasks ranging from standard textbook problems to complex coding challenges, such as the classification of particle collision events. Finally, a collaborative approach is proposed for the continuous generation of new tasks to ensure the lasting relevance of the benchmark.

Speaker Bio:

Merijn Moody joined DIEP in October 2024 as a FAEME PhD student. He is working on the development of a mathematical framework to identify and characterize emergent information structures in multivariate discrete data. Concretely, he is working on connecting high-order spin models (which are a complete family of statistical models for discrete data) with the approaches used in graph theory, in particular, he is investigating a connection between the partition function of high-order spin models and the Tutte polynomial on matroids.

If you wish to attend this seminar online, please send an email to r.lier@uva.nl to receive the zoom-link.

Subscribe to the DIEP seminar mailing list

About DIEP

Evaluating Scientific Understanding and Creativity in LLMs

Title: Evaluating Scientific Understanding and Creativity in LLMs

Abstract:

Speaker Bio:

Cookie Consent