Information

The course registration period is open. Register for semester 1 courses before Monday, 16 June at 13:00.

student.uva.nl
What is your study programme?
UvA Logo
What is your study programme?
Information

The course registration period is open. Register for semester 1 courses before Monday, 16 June at 13:00.

Colloquium credits

Presentation Master's thesis - Leyi Chen - Brain & Cognition

Colloquium credits

Presentation Master's thesis - Leyi Chen - Brain & Cognition

Last modified on 13-06-2025 11:53
Can Large Language Models Grade University Theses? An Exploratory Step towards Automated Thesis Scoring
Show information for your study programme
What is your study programme?
or
event-summary.start-date
16-06-2025 11:00
event-summary.end-date
16-06-2025 12:00
event-summary.location

Roeterseilandcampus - Gebouw G, Straat: Nieuwe Achtergracht 129-B, Ruimte: GS.34. Vanwege beperkte zaalcapaciteit is deelname op basis van wie het eerst komt, het eerst maalt. Leraren moeten zich hieraan houden.

Grading has long been one of the most demanding and challenging tasks for educators. Even with pre-defined rubrics, issues regarding grading validity and consistency among different graders persist. For over six decades, automated essay scoring (AES) systems have been developed to reduce manual workload and increase efficiency. However, most of these existing AES systems focus on relatively simple, closed-ended questions with known preferred answers, which is far less challenging than assessing a scientific thesis.

With the emergence of Large Language Models (LLMs), could these models present a new opportunity for automating thesis evaluation? This study explores that possibility through an experiment: we systematically manipulated existing theses to represent different quality levels, and used three mainstream LLMs to grade them according to a standardized rubric. The results indicate that all three LLMs reliably recognized thesis quality and assigned scores as hypothesized. These results support the findings of previous research and highlight a positive future for integrating AI into grading tasks in higher education.