
Manuel Tonneau
@ManuelTonneau
Followers
615
Following
4K
Media
36
Statuses
613
PhD candidate in Social Data Science @OIIOxford. NLP, Computational Social Science @WorldBank, 🐘 @[email protected]. 🇫🇷 🇪🇺
Berlin/Oxford
Joined October 2011
Can we detect #hatespeech at scale on social media?. To answer this, we introduce 🤬HateDay🗓️, a global hate speech dataset representative of a day on Twitter. The answer: not really! Detection perf is low and overestimated by traditional eval methods. 🧵
3
18
65
RT @sehgal_neil: 🚨 New study!.We tested whether AI-generated messages – single static messages vs. conversations – can boost intent to scre….
0
2
0
RT @sehgal_neil: 🚨 New preprint on AI persuasion and public health 🚨 . A 3-min conversation with GPT-4o nudged HPV-vax-hesitant parents (wh….
0
4
0
RT @lalizaveta: 🚨New article🚨 We explore the potential of popular chatbots to detect misinformation & propose a new conceptual framework fo….
0
8
0
RT @oiioxford: New blog! @oiioxford doctoral researchers @deeliu97.@ManuelTonneau and Juliette Zaccour propose a series of recommendations….
0
3
0
Your feedback is much appreciated as we prepare the final version of the paper. We would like to thank @JurgenPfeffer and team who collected the TwitterDay dataset from which HateDay is sampled and without whom this work would not have been possible! 🙏🙏🙏.
0
0
2
We also find other reasons for low perf, such as the misalignment between target focus in academic work and target prevalence in the wild, as well as the difficulty to distinguish use and mention of hate presented in past work from @krisgligoric.
1
0
1
Why is perf so low? An important reason is it is hard to distinguish between offensive and hateful content (as exposed by @thomasrdavidson in seminal work) and offensive content is much more prevalent than hate in the wild, crowding out hate in the predicted positives
1
1
1
RT @agostina_cal: I'm creating a 🦋 starter pack for researchers working on (fighting) online harms, if you would like to be added send alon….
0
4
0
RT @WOAHWorkshop: 🥳We are excited to share that WOAH 2025, our 9th edition, will take place at #ACL2025 in Vienna! @aclmeeting . Our speci….
0
9
0
Our #hatespeech supersets, combining all 🤬 corpora in 8 lang, reached 2K downloads on 🤗🎉. To enable more cross-cultural 🤬 research, we release:.- author country for English, Arabic and Spanish posts from our @WOAHWorkshop 📜.- 3 country sets 🇮🇳🇳🇬🇰🇪.
0
4
38
RT @c_schwemmer: We are looking for a student to assist our team in using AI to detect toxic content on social media. If you are familiar w….
0
11
0
RT @enfleisig: Does ChatGPT discriminate against speakers of different dialects? Our #EMNLP2024 paper finds that ChatGPT exhibits consisten….
0
9
0
RT @gvrkiran: Video data is widely available but has been difficult to analyze automatically. Particularly, TV data is a rich and important….
0
32
0
RT @davidschlangen: Time to accept we didn't manage to pull hybrid conferences off. But travel must still be reduced, no way around it. So….
0
1
0