I2SC Lecture Series
The Interdisciplinary Institute for Societal Computing offers a regular Lecture Series to bring together researchers of different academic fields to analyze and discuss the broad topic of society and technology. The Lecture Series is designed as a laboratory of interdisciplinary research to encourage cooperation and new research approaches. The series will feature a mix of speakers from Computer Science, Social Science, and Digital Humanities.
November 8, 2024
Cornelius König (Industry and Organizational Psychology, Saarland University)
Pitfalls, Challenges, and Benefits of Interdisciplinary Research: Reflections from a Psychologist Collaborating with Computer Scientists for Over a Decade
November 15, 2024
Elisa Omodei (Network and Data Science, Central European University)
Linking Survey and Social Media Data to Study Political Polarization
November 29, 2024
Marten Düring (Digital History, Luxembourg Centre for Contemporary and Digital History (C²DH))
Machine Learning to Link Historical Media Collections
December 13, 2024
David Garcia (Computer Science, University of Konstanz)
Language Understanding as a Constraint on Consensus Size in LLM Societies
January 10,
2025
Maria Antoniak (Computer Science, University of Colorado)
From Stories to Sonnets: Data-Centered NLP for Creative Works
January 24,
2025
Kiran Garimella (Computer Science, Rutgers University)
Misinformation and Hate speech on WhatsApp - Insights from a large-scale data donation study
February 7,
2025
Diana Sanz Roig (Computational Literary Studies, Universitat Oberta de Catalunya), TBD
The Lecture Series is in building E1 7, Room 3.23, on the campus of Saarland University from 12h-13h.
If you want to meet one of our speakers on the day of the event, please contact us: hello[@]i2sc.net
If you are interested in our events and want to stay up to date, please subscribe to our mailing list here.
For Guest lectures not from the lecture series, please check our Guest Lectures page.
Program
Cornelius König
Industry and Organizational Psychology
Saarland University
November 8, 2024
Pitfalls, Challenges, and Benefits of Interdisciplinary Research: Reflections from a Psychologist Collaborating with Computer Scientists for Over a Decade
The Interdisciplinary Institute for Societal Computing at Saarland University is dedicated to fostering research at the intersection of society and technology, which is why “interdisciplinary” is central to its name. In this talk, I would like to share my personal experiences as a psychologist who has worked collaboratively with computer scientists for over 13 years. I began this journey somewhat naively and have since undergone a long learning process. This experience has convinced me that anyone interested in interdisciplinary work should periodically reflect on the pitfalls, challenges, and benefits of such collaboration. I hope my lecture provides one of those opportunities for reflection.
Elisa Omodei
Network and Data Science
Central European University
November 15, 2024
Linking Survey and Social Media Data to Study Political Polarization
Social media data donation through data download packages (DDPs) is a promising new way of collecting individual-level digital trace data with informed consent. When linked with survey data, data donation is an even more promising tool that helps answer novel research questions. In this talk, I will show how this approach allowed us to investigate polarization measurement biases that arise when only visible traces accessible through platform APIs are considered, while neglecting invisible traces not recorded via online channels, which can reveal key aspects of political engagement online.
Marten Düring
Digital History
Luxembourg Centre for Contemporary and Digital History (C²DH)
November 29, 2024
Machine Learning to Link Historical Media Collections
Historical media count among the most attractive sources for historical research. Following mass digitisation efforts over the past decades, researchers now face the problem of overabundance of materials which can no longer be managed with keyword search and basic content filtering techniques alone even though only a fraction of the overall archival record has actually been made available. This poses challenges for the contextualisation and critical assessment of these sources which can be effectively addressed using semantic enrichments based on natural language processing techniques. In this lecture, I will present ongoing efforts by the project "Impresso. Media Monitoring of the Past". I will discuss epistemological challenges in data exploration and interface design as well as opportunities in terms of reflected content exploration, computational analysis, and including an outlook to forthcoming Impresso Datalab.
David Garcia
Computer Science
University of Konstanz
December 13, 2024
Language Understanding as a Constraint on Consensus Size in LLM Societies
The applications of Large Language Models (LLMs) are going towards collaborative tasks where several agents interact with each other like in an LLM society. In such a setting, large groups of LLMs could reach consensus about arbitrary norms for which there is no information supporting one option over another, regulating their own behavior in a self-organized way. In human societies, the ability to reach consensus without institutions has a limit in the cognitive capacities of humans. To understand if a similar phenomenon characterizes also LLMs, we apply methods from complexity science and principles from behavioral sciences in a new approach of AI anthropology. We find that LLMs are able to reach consensus in groups and that the opinion dynamics of LLMs can be understood with a function parametrized by a majority force coefficient that determines whether consensus is possible. This majority force is stronger for models with higher language understanding capabilities and decreases for larger groups, leading to a critical group size beyond which, for a given LLM, consensus is unfeasible. This critical group size grows exponentially with the language understanding capabilities of models and for the most advanced models, it can reach an order of magnitude beyond the typical size of informal human groups.
Maria Antoniak
Computer Science
University of Colorado
January 10, 2025
From Stories to Sonnets: Data-Centered NLP for Creative Works
In this talk, I'll share two recent studies that use natural language processing (NLP) techniques to model creative works like stories and poetry. In the first part of the talk, I'll discuss NLP approaches for story detection and analysis, focusing on how NLP methods can help us study storytelling at large scales and across diverse contexts. In the second part, I'll discuss the poetic capabilities of large language models (LLMs), focusing on audits of the vast pretraining datasets used to build these models. Both studies will highlight the challenges in creating open evaluation datasets for creative works and the importance of interdisciplinary collaboration between NLP and the humanities.
Kiran Garimella
Computer Science
Rutgers University
January 24, 2025
Misinformation and Hate speech on WhatsApp - Insights from a large-scale data donation study
In this talk, I will present a study looking at the prevalence of misinformation, political propaganda and hate speech on WhatsApp. Using a large WhatsApp data donation program in India and Brazil covering thousands of users, we systematically analyze private group messages to understand content prevalence, virality, and user profiles in problematic content spread. Our analysis from data in India shows a high prevalence of political content, with significant misinformation and hate speech against Muslims. This misinformation was notably prevalent in caste-based groups and had been previously debunked by fact-checkers, indicating a failure of fact-checks to reach these groups. This research is the first quantitative analysis of everyday WhatsApp use and highlights challenges with end-to-end encrypted platforms. It provides a baseline for developing moderation policies to combat misinformation and promote responsible use of encrypted communication channels. The study also develops novel data donation methods and tools to collect representative samples from hard to study platforms. These approaches can be scaled to other platforms to enable data collection in the post API age.
Diana Sanz Roig
Computational Literary Studies
Universitat Oberta de Catalunya
February 7, 2025
TBD