For best experience please turn on javascript and use a modern browser!
You are using a browser that is no longer supported by Microsoft. Please upgrade your browser. The site may not present itself correctly if you continue browsing.
In this edition of the DIEP seminar series, Merijn Moody will present his work on LLMs. Merijn joined DIEP in October 2024 as a FAEME PhD student. He is currently working on the development of a mathematical framework to identify and characterize emergent information structures in multivariate discrete data.
Event details of Evaluating Scientific Understanding and Creativity in LLMs
Date
19 February 2026
Time
11:00 -12:00
Room
Second-floor library

Title: Evaluating Scientific Understanding and Creativity in LLMs

Abstract

A major debate in the philosophy of Artificial Intelligence is whether LLMs and other AI systems are merely "stochastic parrots" repeating training data, or if they are capable of genuine creativity and understanding. While Large Language Models are increasingly used in research, current benchmarks often fail to measure whether a model truly understands physics and is capable of creativity, or is simply memorizing facts. In this talk, a new benchmark framework is proposed to measure the scientific understanding and creativity of LLMs. In this framework, models are tested on tasks ranging from standard textbook problems to complex coding challenges, such as the classification of particle collision events. Finally, a collaborative approach is proposed for the continuous generation of new tasks to ensure the lasting relevance of the benchmark.

Speaker Bio:

Merijn Moody joined DIEP in October 2024 as a FAEME PhD student. He is working on the development of a mathematical framework to identify and characterize emergent information structures in multivariate discrete data. Concretely, he is working on connecting high-order spin models (which are a complete family of statistical models for discrete data) with the approaches used in graph theory, in particular, he is investigating a connection between the partition function of high-order spin models and the Tutte polynomial on matroids.

If you wish to attend this seminar online, please send an email to r.lier@uva.nl to receive the zoom-link.