student.uva.nl
Welke opleiding volg je?
UvA Logo
Welke opleiding volg je?
Colloquiumpunten

Presentation Master's thesis - Nitya Shah - Psychological Research Methods

Colloquiumpunten

Presentation Master's thesis - Nitya Shah - Psychological Research Methods

Laatst gewijzigd op 19-10-2024 16:02
Social Reasoning in Humans and Language-and-Vision Models (VLMs)
Toon informatie voor jouw opleiding
Welke opleiding volg je?
Startdatum
15-07-2024 09:30
Einddatum
15-07-2024 10:00
Locatie

Roeterseilandcampus - Gebouw A, Straat: Roetersstraat 11, Ruimte: A2.06

State-of-the-Art large language models (LLMs) appear to have formal linguistic competence (i.e., rules of language and grammar), but lack functional linguistic competence, which includes formal and social reasoning, among others. Functional linguistic competence is necessary for using language and engaging with the world. Since human interaction takes place in multimodal settings  (i.e., visual, auditory and textual input), computational models that leverage a key modality in human perception, vision, could be advantageous. Given the lack of research in this field, we specifically examine whether language-and-vision models (VLMs) exhibit social reasoning. In particular, we focus on Theory of Mind (ToM), which we test using the Edinburgh Social Cognition Test (ESCoT) (Baksh et al., 2018). The state-of-the-art VLM, LLaVA, and human participants were tested on theory of mind through the ESCoT. To examine which modalities facilitate social reasoning (ToM), we placed human participants in three different conditions: (1) image-only version of the ESCoT task, (2) image + textual description and (3) only textual description. The VLM, LLaVA, also received the task under these three conditions. The results of this experiment and its implications on our understanding of human cognition and AI's cognitive capabilities will be discussed.