Uncategorized

Final Results of TWON

After three years of interdisciplinary research, the TWON, Twin of Online Social Networks project concluded with a final event in Berlin, where the international consortium presented key findings on how online platforms shape democratic discourse and how mechanisms of discourse manipulation emerge in digital environments.

The closing event, hosted at Publix Berlin, brought together researchers as well as representatives from politics, academia, and civil society. Led by the FZI Research Center for Information Technology, the consortium reflected on the project’s results and discussed implications for future research, platform governance, and regulation.

The event was opened by Dr Jonas Fegert, who emphasized the central role of online platforms and their underlying mechanisms in shaping public debate and democratic participation.

In the keynote, Dr Annette Zimmermann explored how platform mechanisms influence discourse dynamics on social media, including practices such as dog whistling and self censorship. She highlighted how these dynamics affect public deliberation and outlined important avenues for future research.

Parsa Marvi, Member of the German Bundestag, underlined the relevance of TWON for understanding democratic discourse in the digital age. He stressed the importance of evidence based research for effective and responsible platform regulation.

Key research results from the project were presented and discussed in a session moderated by Cosima Pfannschmidt, with contributions from Dr Alenka GuÄŤek, Prof Damian Trilling, Prof Achim Rettinger, and Prof Michael Maes, among others.

The event concluded with a panel discussion titled What’s next, featuring Annette Zimmermann, Parsa Marvi, Svea Windwehr, and Damian Trilling, moderated by Jonas Fegert. The discussion focused on concrete recommendations from research, policy, and implementation, particularly in relation to the Digital Services Act, and discussed how online social networks can be held accountable in times of increasing geopolitical tensions.

Throughout the evening, participants engaged with interactive project demonstrators, discussed research findings, and exchanged perspectives with TWON partners from across Europe.

TWON thanks all speakers, panelists, participants, and project partners for their valuable contributions and the close collaboration over the past three years.

TWON Policy Hackathon

As geopolitical tensions increasingly play out online, the need for a democratic digital public sphere has never been more urgent. Political interests, platform governance choices, and regulatory gaps all shape how online debate unfolds, but what concretely needs to change?

These questions were at the heart of the TWON Policy Hackathon, which brought together experts from research, policymaking, digital law, platform governance, content moderation, and civil society to develop actionable, empirically grounded policy recommendations for online social networks.

The hackathon addressed the shared question of how online social networks must evolve in order to better enable democratic online debate and to safeguard democratic societies. Throughout the afternoon, participants exchanged perspectives on current and future challenges related to online social networks and their governance.

Building on the work of the TWON project, the hackathon connected research on platform design choices and online debate with policy perspectives and practical experience. Draft policy recommendations developed within the project served as a starting point for discussion and were critically examined during the workshop sessions.

The agenda combined a spotlight round on pressing challenges with two structured workshop sessions. Working in professionally mixed groups, participants discussed the current state of knowledge, experiences from practice and regulation, and open research questions. The second workshop session focused on identifying regulatory needs and developing policy recommendations.

We would like to thank all participants for their valuable contributions, thoughtful discussions and engagement throughout the event.

 

IMG_0279

TWON Consortium Meeting – a recap 

Last week, the TWON consortium came together in Berlin to advance ongoing work on democratic online social networks and to strengthen the project’s dialogue with policy and civil society stakeholders.

Across a series of internal and public sessions, the meeting focused on how platform design and algorithmic choices shape democratic discourse, contribute to polarization, and influence the spread of disinformation. A dedicated Policy Hackathon provided space for consortium members and external experts to explore regulatory challenges and identify priorities for future policy-oriented work.

In addition, TWON hosted a public dissemination event at Publix, bringing together participants from politics, academia, and civil society to discuss responsible platform design and the broader implications of the project’s research.

The week also included TWON’s General Assembly, where the consortium reflected on progress and lessons learned and discussed how the project’s insights can support future research and collaboration beyond TWON.

The programme concluded with a visit to the German Chancellery and further exchanges with colleagues from the policy sphere, highlighting the importance of connecting research and governance perspectives in the field of online platforms.

The TWON consortium thanks all contributors for their engagement, constructive discussions, and continued collaboration.

From Research to Regulation: Rethinking Online Social Networks // January 28, Berlin

📆Date                             28 January 2026, 6:00-9:30pm

🎯Location                      Publix, Hermannstraße 90, 12051 Berlin

What do we know from research about the positive and negative effects of online social networks on societies? How can these platforms be designed to protect and strengthen democratic societies and foster a fair online public sphere? Which research is needed, and how can academia work hand in hand with regulators, civil society, and practitioners to jointly create change? These questions gain particular urgency at a time when global geopolitical tensions, disinformation, and the rise of right-wing extremist forces in many democracies worldwide increasingly shape digital infrastructures.

On this evening, we will present the research project “TWON – Twin of Online Social Networks” and discuss its results and implications with policymakers, journalists, and practitioners from civil society. TWON is an EU-funded research project that examines how the design of online platforms influences the quality of democratic online discourse. To this end, an interdisciplinary research team has developed a novel approach to studying online social networks: using a digital twin, simulations are conducted to explore, for example, how different ranking algorithms affect quality of debate, without experimenting on real users. The findings are translated into policy recommendations and discussed in participatory Citizen Labs with citizens across Europe. Members of the consortium include, among others, the Karlsruhe Institute of Technology (KIT), University of Trier, FZI Forschungszentrum Informatik, University of Amsterdam, University of Belgrade, Jožef Stefan Institute, and Robert Koch Institute (RKI).

Furthermore, the event will focus on how online social networks can be researched and shaped at the societal level. In particular, we will discuss promising avenues for future research and evidence-based policymaking, such as data access under the Digital Services Act (DSA), data donation frameworks, and current windows of opportunity in the European and global digital policy debate.

Before and after the stage program, guests are invited to explore interactive project demonstrators, engage with research results at poster stations, and connect informally with project partners from across Europe.

Proposed agenda:

17:30 – Arrival and Demonstrator & Poster Walk

18:00 – Opening: Jonas Fegert, TWON/FZI

18:15 – Impulse: Andrea Lindholz MdB, Vice President of the German Bundestag

18:30 – Keynote: Annette Zimmermann, University of Wisconsin-Madison

18:45 – Presentation of TWON project results

19:10 – Panel discussion

Annette Zimmermann, University of Wisconsin-Madison

Svea Windwehr, D64

Damian Trilling, TWON/University of Amsterdam

19:55 – Audience Q&A

20:10 – Buffet and Demonstrator & Poster Walk

We would be delighted to welcome you in Berlin and look forward to an open and productive discussion with you!

New Publication: Simulating Algorithmic Personalization and Polarization

We are pleased to announce a new peer-reviewed publication by Ljubiša Bojić, co-authored with Velibor Ilić, Veljko Prodanović, and Vuk Vuković, published in Chinese Political Science Review.

The paper introduces the Recommender Systems LLMs Playground (RecSysLLMsP), an agent-based simulation framework designed to study how recommender systems and large language models jointly shape engagement, emotional dynamics, and polarization in social media environments.

The study models a synthetic social media ecosystem with 100 agents grounded in real psychometric and demographic data. Agents interact through feeds with progressively increasing levels of personalization, while content is generated and adapted using large language models. This setup enables controlled observation of how algorithmic personalization affects collective behavior.

Key findings show that moderate personalization maximizes engagement, while full personalization significantly reduces content diversity and amplifies both structural and affective polarization. Network modularity increases sharply as personalization deepens, indicating the emergence of echo-chamber dynamics. At the same time, the simulation demonstrates that LLM-based agents can reproduce realistic patterns of emotional contagion and ideological clustering.

RecSysLLMsP provides a transparent and reproducible “digital laboratory” for testing recommender system designs and policy interventions before they are deployed at scale. The framework has direct relevance for research in computational social science, responsible AI, platform governance, and democratic communication.

Publication details:
An Agent-Based Simulation of Politicized Topics Using Large Language Models: Algorithmic Personalization and Polarization on Social Media
Chinese Political Science Review
DOI: 10.1007/s41111-025-00326-x

Out now: Our new demonstrator tool TWONderland

In the past weeks, our TWON researcher Fabio Sartori (KIT) and his colleagues worked on a new demonstrator tool to make the dynamics of Online Social Networks tangible for the broad public. The result is: TWONderland!

In our simulation TWONderland, we assign the user the job as the lead designer of a new Online Social Network. In a playful and interactive way, users explore how as the platform designer, they influence the interaction on the platform and how even the tiniest design choices can ripple out to shape behavior, sentiments and relationships between the users – and potentially spark fragmentation and fuel polarization.

Unique about this demonstrator is the step-by-step walkthrough of the functionalities of Online Social Networks (OSNs). The user starts by assigning moods – from aggressive to calm – to fictive platform users. We then visualize how their fictive users are connected to each other on the platform, and how their moods adapt as they are confronted with posts of each other. In TWONderland, every OSN user participates within a specific sentiment corridor, meaning that they will interact with and adapt to other users as long as their differences in sentiment are not too significant. Here, for instance, a very calm user would not immediately interact with somebody who is very aggressive. However, in our demonstrator, we visualize that the sentiment on a platform can still shift in positive and negative directions gradually. These network dynamics were modelled based on the Axelrod model (for further information and technicalities please refer to our Deliverable).

After getting an understanding of network dynamics, the user is asked to experiment with alternative platform mechanisms that determine what users (and their moods) influence their own fictive platform user. Based on the ranking algorithms the user sets, posts with different moods – again, aggressive to calm – will become visible to their fictive character, which influence their mood. From this individual level, the demonstrator then moves on to visualizing bigger networks in which many users influence each other based on the designated platform mechanics. To understand how users influence each other’s mood on OSNs, the user can run comparative simulations and experiment how polarization is fueled or minimized only through the ranking mechanics.

New paper by TWON researcher Simon Münker: Fingerprinting LLMs through Survey Item Factor Correlation: A Case Study on Humor Style Questionnaire

We are proud to announce that our researcher Simon Münker published a new paper with the title: Fingerprinting LLMs through Survey Item Factor Correlation: A Case Study on Humor Style Questionnaire. It is published in the Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing and the results will be presented in Shanghai on 5 November.

LLMs increasingly engage with psychological instruments, yet how they represent constructs internally remains poorly understood. Simon Münker introduces a novel approach to “fingerprinting” LLMs through their factor correlation patterns on standardized psychological assessments to deepen the understanding of LLMs constructs representation. Using the Humor Style Questionnaire as a case study, he analyzes how six LLMs represent and correlate humor-related constructs to survey participants. His results show that they exhibit little similarity to human response patterns. In contrast, participants’ subsamples demonstrate remarkably high internal consistency. Exploratory graph analysis further confirms that no LLM successfully recovers the four constructs of the Humor Style Questionnaire. These findings suggest that despite advances in natural language capabilities, current LLMs represent psychological constructs in fundamentally different ways than humans, questioning the validity of application as human simulacra.

It’s a wrap: CitizenLab 2025 in Chemnitz

On 8 October, we hosted another CitizenLab in the Stadthallenpark in Chemnitz, where we got to speak with citizens about our research on Online Social Networks.

We presented our demonstrators MicroTWONY, MacroTWONY, and TWONderland to interested citizens and participants, had inspiring conversations about the impact of Online Social Networks on society and democracy, as well as possibilities for regulation and ethical design. We are glad to see how many participants enjoyed experimenting with the demonstrators and exploring how digital dynamics become tangible!

In the evening, we joined an interesting event on memory culture in digital spaces at the NSU Documentation Center with TWON researcher Jonas Fegert, journalist Nhi Le and Susanne Siegert from the channel @keineerinnerungskultur, moderated by Benjamin Fischer. The discussion focused on the opportunities social networks offer for democratic education, especially for younger audiences, and on the limitations imposed by platform mechanisms that tend to amplify hate speech and misinformation.

A day full of dialogue, reflection, and future perspectives – thank you for everybody who was a part of it, and we’re looking forward to the next CitizenLab!

New publication: Can we use automated approaches to measure the quality of online political discussion?

We’re proud to announce that our consortium members Sjoerd Stolwijk, Damian Trilling (both University of Amsterdam) and Simon MĂĽnker (Trier University) contributed to a freshly published paper on measuring the debate quality of online political discussions. The paper was released in the “Communication Methods and Measures” journal by Routledge and is open access.

Our researchers review how debate quality has been measured in communication science, and systematically compare 50 automated metrics against numerous manually coded comments. Based on their experiments, they were able to give clear recommendations for how to (not) measure debate quality in terms of interactivity, diversity, rationality, and (in)civility according to Habermas.

Their results show that transformer models and generative AI (like Llama and GPT-models) outperform older methods, yet there is variance and the success depends on the measured concept, as some (e.g. rationality) remain difficult to capture also by human coding. Which measure should be preferred for future empirical applications is likely dependent on the
objective of the study in question. For some genres, language and communication style (e.g. satire), it is strongly advised to test the accuracy of automated methods against the human interpretation beforehand, even if methods are widely used. Some approaches and implementations performed so poorly that they are not suitable for studying debate quality.

Zero-shot prompt-based classification @ACL Vienna

Simon MĂĽnker recently presented his research on the use of zero-shot, prompt-based classification for analysing political discourse on German Twitter during the European energy crisis at the 2025 Association for Computational Linguistics Conference in Vienna. He gave a poster presentation and a talk about his newly published paper.

In their paper, Dr. Achim Rettinger, Kai Kugler and Simon MĂĽnker assess advancements in NLP, specifically large foundation models, for automating annotation processes on German Twitter data concerning European crises.

The study explores how recent advances in large language models (LLMs) can reduce the need for long manual work when labeling and categorizing social media content. Instead of training models with thousands of examples, LLMs can follow written prompts to classify tweets in a zero-shot setting, meaning without prior training on the specific task.

The dataset used was collected from a German Twitter dataset based on survey questions from the SOSEC project about the energy crisis in winter 2022/23. Two domain experts and native speakers annotated a random sample of around 7,000 tweets.

The models that were evaluated included: a baseline Naive Bayes classifier using token counts; a fine-tuned German-specific BERT transformer (“gbert-base”)- a model further adapted with additional pretraining on domain-specific tweets to improve domain relevance; and instruction-tuned models based on T5, which follow prompts to classify texts without domain-specific fine-tuning using zero-shot prompting techniques.

The results show that prompt-based approaches perform almost as well as fine-tuned BERT models. The study therefore concludes that a prompt-based approach can achieve comparable performance to fine-tuned BERT without requiring annotated training data.

However, the study also emphasizes limitations such as the inherited and potentially amplified biases present in the training data and differences in outcomes related to the language used (German/English), as well as cultural nuances.

Automating the analysis of political and social debates raises questions about the role AI can and should play in interpreting sensitive public discourse.