ManuelTonneau Profile Banner
Manuel Tonneau Profile
Manuel Tonneau

@ManuelTonneau

Followers
615
Following
4K
Media
36
Statuses
613

PhD candidate in Social Data Science @OIIOxford. NLP, Computational Social Science @WorldBank, 🐘 @[email protected]. 🇫🇷 🇪🇺

Berlin/Oxford
Joined October 2011
Don't wanna be here? Send us removal request.
@ManuelTonneau
Manuel Tonneau
8 months
Can we detect #hatespeech at scale on social media?. To answer this, we introduce 🤬HateDay🗓️, a global hate speech dataset representative of a day on Twitter. The answer: not really! Detection perf is low and overestimated by traditional eval methods. 🧵
Tweet media one
3
18
65
@ManuelTonneau
Manuel Tonneau
5 hours
RT @sehgal_neil: 🚨 New study!.We tested whether AI-generated messages – single static messages vs. conversations – can boost intent to scre….
0
2
0
@ManuelTonneau
Manuel Tonneau
3 months
RT @sehgal_neil: 🚨 New preprint on AI persuasion and public health 🚨 . A 3-min conversation with GPT-4o nudged HPV-vax-hesitant parents (wh….
0
4
0
@ManuelTonneau
Manuel Tonneau
7 months
RT @lalizaveta: 🚨New article🚨 We explore the potential of popular chatbots to detect misinformation & propose a new conceptual framework fo….
0
8
0
@ManuelTonneau
Manuel Tonneau
7 months
RT @oiioxford: New blog! @oiioxford doctoral researchers @deeliu97.@ManuelTonneau and Juliette Zaccour propose a series of recommendations….
0
3
0
@ManuelTonneau
Manuel Tonneau
8 months
Your feedback is much appreciated as we prepare the final version of the paper. We would like to thank @JurgenPfeffer and team who collected the TwitterDay dataset from which HateDay is sampled and without whom this work would not have been possible! 🙏🙏🙏.
0
0
2
@ManuelTonneau
Manuel Tonneau
8 months
What about moderation? Given low perf, automatic moderation is not desirable. We investigate the feasibility of human-in-the-loop moderation where models flag and humans verify. Moderating >80% of all hate would require humans to review >10% of all daily tweets which can get 💸💸
Tweet media one
1
0
1
@ManuelTonneau
Manuel Tonneau
8 months
We also find other reasons for low perf, such as the misalignment between target focus in academic work and target prevalence in the wild, as well as the difficulty to distinguish use and mention of hate presented in past work from @krisgligoric.
1
0
1
@ManuelTonneau
Manuel Tonneau
8 months
Why is perf so low? An important reason is it is hard to distinguish between offensive and hateful content (as exposed by @thomasrdavidson in seminal work) and offensive content is much more prevalent than hate in the wild, crowding out hate in the predicted positives
Tweet media one
1
1
1
@ManuelTonneau
Manuel Tonneau
8 months
We then evaluate popular hate speech detection LLMs on HateDay and compare with their performance on academic hate speech datasets and functional tests (HateCheck). We find that traditional eval methods systematically overestimate performance on representative data, which is low.
Tweet media one
1
0
1
@ManuelTonneau
Manuel Tonneau
8 months
We first look at the prevalence and composition of hate in HateDay and find that most types of hate are represented across contexts, with some local specificities in the importance of each hate type (e.g., green-bashing in German tweets, islamophobia in India).
Tweet media one
1
0
1
@ManuelTonneau
Manuel Tonneau
8 months
RT @agostina_cal: I'm creating a 🦋 starter pack for researchers working on (fighting) online harms, if you would like to be added send alon….
0
4
0
@ManuelTonneau
Manuel Tonneau
8 months
RT @WOAHWorkshop: 🥳We are excited to share that WOAH 2025, our 9th edition, will take place at #ACL2025 in Vienna! @aclmeeting . Our speci….
0
9
0
@ManuelTonneau
Manuel Tonneau
8 months
Our #hatespeech supersets, combining all 🤬 corpora in 8 lang, reached 2K downloads on 🤗🎉. To enable more cross-cultural 🤬 research, we release:.- author country for English, Arabic and Spanish posts from our @WOAHWorkshop 📜.- 3 country sets 🇮🇳🇳🇬🇰🇪.
0
4
38
@ManuelTonneau
Manuel Tonneau
9 months
RT @DG_Rand: 🚨Out in Nature!🚨.Many (eg Trump JimJordan @elonmusk) have accused social media of anti-conservative bias.Is this accurate?.We….
0
217
0
@ManuelTonneau
Manuel Tonneau
10 months
RT @c_schwemmer: We are looking for a student to assist our team in using AI to detect toxic content on social media. If you are familiar w….
0
11
0
@ManuelTonneau
Manuel Tonneau
10 months
RT @enfleisig: Does ChatGPT discriminate against speakers of different dialects? Our #EMNLP2024 paper finds that ChatGPT exhibits consisten….
0
9
0
@ManuelTonneau
Manuel Tonneau
11 months
RT @gvrkiran: Video data is widely available but has been difficult to analyze automatically. Particularly, TV data is a rich and important….
0
32
0
@ManuelTonneau
Manuel Tonneau
11 months
RT @davidschlangen: Time to accept we didn't manage to pull hybrid conferences off. But travel must still be reduced, no way around it. So….
0
1
0
@ManuelTonneau
Manuel Tonneau
11 months
Your feedback is super welcome! We are also expanding this analysis beyond 🇳🇬 with a paper out soon, so stay tuned! . This was fun team work and wouldn't have been possible without the awesome work from our annotators. 🙏🙏🙏.
0
0
1