Anna Wegmann Profile
Anna Wegmann

@anna_wegmann

Followers
221
Following
255
Media
16
Statuses
83

PhD candidate in NLP @UniUtrecht | Measuring language variation with ML/NLP | now mainly on 🦋 via https://t.co/gpk3bBPSrd

Joined December 2019
Don't wanna be here? Send us removal request.
@anna_wegmann
Anna Wegmann
1 year
Interested in whether people👂 each other in a conversation? 🚨New paper accepted at #EMNLP2024 with @tyskevdb and @dongng about detecting paraphrases between speakers 🤖 Detect? https://t.co/pngaXQTwfC 📊 Analyze? https://t.co/eIbqRzJICi 📄 Read? https://t.co/iB5okZnNks
0
1
14
@BenLitterer
Ben Litterer
1 year
Podcasts are a popular medium, but data for computational research is limited! We introduce the Structured Podcast Research Corpus (SPoRC - https://t.co/iBd8ZfUSmc), a large, multimodal dataset of English podcasts 🧵 https://t.co/UMej1aPzCv
Tweet card summary image
arxiv.org
Podcasts provide highly diverse content to a massive listener base through a unique on-demand modality. However, limited data has prevented large-scale computational analysis of the podcast...
3
22
73
@MiriamSchirmer
Miriam Schirmer
1 year
Heading to #EMNLP2024 in Miami! ✈️🏝️ Excited to connect and grab a coffee with anyone interested in #NLP for #ViolenceDetection and #MentalHealth. Let’s chat! #CSS
0
1
4
@anna_wegmann
Anna Wegmann
1 year
Semantics Track, Riverfront hall
0
0
0
@anna_wegmann
Anna Wegmann
1 year
Come talk to me and @dongng on Wednesday, Poster Session E from 4.00-5.30PM about paraphrases in dialog. See you at #EMNLP2024!
@anna_wegmann
Anna Wegmann
1 year
Interested in whether people👂 each other in a conversation? 🚨New paper accepted at #EMNLP2024 with @tyskevdb and @dongng about detecting paraphrases between speakers 🤖 Detect? https://t.co/pngaXQTwfC 📊 Analyze? https://t.co/eIbqRzJICi 📄 Read? https://t.co/iB5okZnNks
2
1
9
@karen_ullrich
Dr. Karen Ullrich
1 year
#Tokenization is undeniably a key player in the success story of #LLMs but we poorly understand why. I want to highlight progress we made in understanding the role of tokenization, developing the core incidents and mitigating its problems. 🧵👇
15
94
601
@dustin_wright37
Dustin Wright
1 year
Curious about using LLMs to simulate conversations? Check out this big collaborative project we did @umsi ! #NLProc
@AndersGiovanni
Anders Giovanni Møller
1 year
👩🏼‍💻 Real or Robotic? 🤖 Can LLMs accurately simulate qualities of human responses in dialogue? Human conversations with LLMs are great for assessing the capabilities of LLMs. But having lots of folks chat with LLMs is challenging (💰⏳🕵️). Could we have another LLM *simulate*
0
2
14
@suzan
Suzan Verberne 🤹‍♀️
2 years
The 34th edition of Computational Linguistics in The Netherlands (the Dutch-Belgian #NLProc conference) will be held @UniLeiden on August 30 The list of accepted abstracts is on the website and registration is open for everyone interested 💬 #clin34 https://t.co/E10OEojtLx
clin34.leidenuniv.nl
2
8
16
@dustin_wright37
Dustin Wright
2 years
🔎What values and opinions do we see when we use 6 LLMs to generate 156,000 responses to 62 political propositions? Our paper "Revealing Fine-Grained Values and Opinions in Large Language Models" answers this. 📰 https://t.co/QvDcIiDzN7 #NLProc #LLMs
3
13
71
@huashen218
Hua Shen✨
2 years
📢Is current “human-AI alignment” research clarified and comprehensive? 🤔 We systematically reviewed 400+ papers across HCI, NLP, and ML to develop a framework for 👫<>🤖"Bidirectional Human-AI Alignment", encompassing the dual paths of “Aligning AI to Human” and “Aligning Human
5
64
276
@leczhang
Lechen Zhang
2 years
[1/13] LLMs are increasingly skilled at mimicking human agents in social settings, but have they truly developed a consistent personality? Check out our work accepted to #NAACL2024 where we question the reliability of persona tests applied to LLMs. Arxiv: https://t.co/GbZrtemmS8
1
11
44
@dustin_wright37
Dustin Wright
2 years
📰New preprint! w/ @christian_igel @raghavian📰 BMRS: Bayesian Model Reduction for Structured Pruning Structured pruning makes neural nets efficient by removing full structures (e.g. neurons). But how do we know what to prune? Here's our approach: https://t.co/epxvZFtOJO
1
4
14
@debora_nozza
Debora Nozza
2 years
📢 JOBS📢 Come work with us @MilaNLProc! Looking for 2 POSTDOCS (two-year positions w/extension) to work on personalized and subjective approaches to #NLProc. Deadline: May 30 2024 Start date: from Sep 2024 Link:
Tweet card summary image
jobmarket.unibocconi.eu
Recruiting, Faculty, Post-doc Grant, Collaboration Contracts
1
23
44
@Jann1s
Jannis Androutsopoulos
2 years
Next up in the DiLCo Lecture Series 2024: Christoph Purschke @questoph presenting his multi-method approach to "Monitoring the public debate on multilingualism in Luxembourg". Thursday, April 25 at 4 pm CEST, open access. Registration: https://t.co/mQo8QRo9k7 @unihh
0
3
10
@MeeraDesai18
Meera Desai
2 years
Thread on our new paper!
@dallascard
Dallas Card
2 years
I'm excited to share that the journal version of our paper, "An archival perspective on pretraining data", is now available (open access) from Patterns! This project was led by @MeeraDesai18, along with @IrenePasquetto, @az_jacobs, and myself 1/n
4
6
32
@johannes_wachs
Johannes Wachs
3 years
@natfriedman Some colleagues and I have been studying the impact of ChatGPT on SO using data on posts, not views: https://t.co/JnLs4PQggU Besides a big decrease after ChatGPT, we observe a completely flat 2022, and earlier a big bump in activity during early Covid.
2
11
105
@indiiigosky
Indira Sen
3 years
Copenhagen is beautiful & #ic2s2 is amazing, but do you know what’s neither? 🚫 unintended bias towards marginalized people in hate speech detection systems. Presenting our poster (w/ @hide_yourself @clauwa @IAugenstein) today about how data augmentation can lead to such biases!
2
7
64
@johannes_wachs
Johannes Wachs
3 years
🚨 New working paper! Are Large Language Models a threat to digital public goods? @RMaria_drc N. Laurentsyeva and I find a 16% decrease in activity on @StackOverflow since release of #ChatGPT. Decrease is language dependent & reaches 25% by June: https://t.co/JnLs4PQggU Thread⬇️
17
122
375
@jiaxin_pei
Jiaxin Pei
3 years
How does annotator identity influence their judgments for NLP tasks? Collaborating with @Prolific, @david__jurgens and I created POPQUORN: a dataset with 45000 annotations on 4 NLP tasks by 1484 annotators with rich demographic information. Paper: https://t.co/cpI0NQtb9y 🧵 1/11
1
28
118
@SandraWachter5
Sandra Wachter [email protected]
3 years
It takes 360.000 gallons of water/day to cool a data centre! Exploitation of workers, workplace automation, & mass discrimination of marginalised groups, these are REAL existential risks, not this latest PR stunt, my interview https://t.co/iLWMtysopB @Independent @oiioxford
Tweet card summary image
independent.co.uk
Professor Sandra Wachter said the risk raised in the letter that AI could wipe out humanity is ‘science fiction fantasy’.
19
135
398