Anna Wegmann @anna_wegmann X Profile

Anna Wegmann

@anna_wegmann

Followers

221

Following

255

Media

16

Statuses

83

PhD candidate in NLP @UniUtrecht | Measuring language variation with ML/NLP | now mainly on 🦋 via https://t.co/gpk3bBPSrd

https://t.co/Q5z5x39KbP

Joined December 2019

Don't wanna be here? Send us removal request.

Anna Wegmann

@anna_wegmann

1 year

Interested in whether people👂 each other in a conversation? 🚨New paper accepted at #EMNLP2024 with @tyskevdb and @dongng about detecting paraphrases between speakers 🤖 Detect? https://t.co/pngaXQTwfC 📊 Analyze? https://t.co/eIbqRzJICi 📄 Read? https://t.co/iB5okZnNks

0

1

14

Ben Litterer

@BenLitterer

1 year

Podcasts are a popular medium, but data for computational research is limited! We introduce the Structured Podcast Research Corpus (SPoRC - https://t.co/iBd8ZfUSmc), a large, multimodal dataset of English podcasts 🧵 https://t.co/UMej1aPzCv

arxiv.org

Podcasts provide highly diverse content to a massive listener base through a unique on-demand modality. However, limited data has prevented large-scale computational analysis of the podcast...

3

22

73

Miriam Schirmer

@MiriamSchirmer

1 year

Heading to #EMNLP2024 in Miami! ✈️🏝️ Excited to connect and grab a coffee with anyone interested in #NLP for #ViolenceDetection and #MentalHealth. Let’s chat! #CSS

0

1

4

Anna Wegmann

@anna_wegmann

1 year

Semantics Track, Riverfront hall

0

Anna Wegmann

@anna_wegmann

1 year

Come talk to me and @dongng on Wednesday, Poster Session E from 4.00-5.30PM about paraphrases in dialog. See you at #EMNLP2024!

Anna Wegmann

@anna_wegmann

1 year

Interested in whether people👂 each other in a conversation? 🚨New paper accepted at #EMNLP2024 with @tyskevdb and @dongng about detecting paraphrases between speakers 🤖 Detect? https://t.co/pngaXQTwfC 📊 Analyze? https://t.co/eIbqRzJICi 📄 Read? https://t.co/iB5okZnNks

2

1

9

Dr. Karen Ullrich

@karen_ullrich

1 year

#Tokenization is undeniably a key player in the success story of #LLMs but we poorly understand why. I want to highlight progress we made in understanding the role of tokenization, developing the core incidents and mitigating its problems. 🧵👇

15

94

601

Dustin Wright

@dustin_wright37

1 year

Curious about using LLMs to simulate conversations? Check out this big collaborative project we did @umsi ! #NLProc

Anders Giovanni Møller

@AndersGiovanni

1 year

👩🏼‍💻 Real or Robotic? 🤖 Can LLMs accurately simulate qualities of human responses in dialogue? Human conversations with LLMs are great for assessing the capabilities of LLMs. But having lots of folks chat with LLMs is challenging (💰⏳🕵️). Could we have another LLM *simulate*

0

2

14

Suzan Verberne 🤹‍♀️

@suzan

2 years

The 34th edition of Computational Linguistics in The Netherlands (the Dutch-Belgian #NLProc conference) will be held @UniLeiden on August 30 The list of accepted abstracts is on the website and registration is open for everyone interested 💬 #clin34 https://t.co/E10OEojtLx

clin34.leidenuniv.nl

2

8

16

Dustin Wright

@dustin_wright37

2 years

🔎What values and opinions do we see when we use 6 LLMs to generate 156,000 responses to 62 political propositions? Our paper "Revealing Fine-Grained Values and Opinions in Large Language Models" answers this. 📰 https://t.co/QvDcIiDzN7 #NLProc #LLMs

3

13

71

Hua Shen✨

@huashen218

2 years

📢Is current “human-AI alignment” research clarified and comprehensive? 🤔 We systematically reviewed 400+ papers across HCI, NLP, and ML to develop a framework for 👫<>🤖"Bidirectional Human-AI Alignment", encompassing the dual paths of “Aligning AI to Human” and “Aligning Human

5

64

276

Lechen Zhang

@leczhang

2 years

[1/13] LLMs are increasingly skilled at mimicking human agents in social settings, but have they truly developed a consistent personality? Check out our work accepted to #NAACL2024 where we question the reliability of persona tests applied to LLMs. Arxiv: https://t.co/GbZrtemmS8

1

11

44

Dustin Wright

@dustin_wright37

2 years

📰New preprint! w/ @christian_igel @raghavian📰 BMRS: Bayesian Model Reduction for Structured Pruning Structured pruning makes neural nets efficient by removing full structures (e.g. neurons). But how do we know what to prune? Here's our approach: https://t.co/epxvZFtOJO

1

4

14

Debora Nozza

@debora_nozza

2 years

📢 JOBS📢 Come work with us @MilaNLProc! Looking for 2 POSTDOCS (two-year positions w/extension) to work on personalized and subjective approaches to #NLProc. Deadline: May 30 2024 Start date: from Sep 2024 Link:

jobmarket.unibocconi.eu

Recruiting, Faculty, Post-doc Grant, Collaboration Contracts

1

23

44

Jannis Androutsopoulos

@Jann1s

2 years

Next up in the DiLCo Lecture Series 2024: Christoph Purschke @questoph presenting his multi-method approach to "Monitoring the public debate on multilingualism in Luxembourg". Thursday, April 25 at 4 pm CEST, open access. Registration: https://t.co/mQo8QRo9k7 @unihh

0

3

10

Meera Desai

@MeeraDesai18

2 years

Thread on our new paper!

Dallas Card

@dallascard

2 years

I'm excited to share that the journal version of our paper, "An archival perspective on pretraining data", is now available (open access) from Patterns! This project was led by @MeeraDesai18, along with @IrenePasquetto, @az_jacobs, and myself 1/n

4

6

32

Johannes Wachs

@johannes_wachs

3 years

@natfriedman Some colleagues and I have been studying the impact of ChatGPT on SO using data on posts, not views: https://t.co/JnLs4PQggU Besides a big decrease after ChatGPT, we observe a completely flat 2022, and earlier a big bump in activity during early Covid.

2

11

105

Indira Sen

@indiiigosky

3 years

Copenhagen is beautiful & #ic2s2 is amazing, but do you know what’s neither? 🚫 unintended bias towards marginalized people in hate speech detection systems. Presenting our poster (w/ @hide_yourself @clauwa @IAugenstein) today about how data augmentation can lead to such biases!

2

7

64

Johannes Wachs

@johannes_wachs

3 years

🚨 New working paper! Are Large Language Models a threat to digital public goods? @RMaria_drc N. Laurentsyeva and I find a 16% decrease in activity on @StackOverflow since release of #ChatGPT. Decrease is language dependent & reaches 25% by June: https://t.co/JnLs4PQggU Thread⬇️

17

122

375

Jiaxin Pei

@jiaxin_pei

3 years

How does annotator identity influence their judgments for NLP tasks? Collaborating with @Prolific, @david__jurgens and I created POPQUORN: a dataset with 45000 annotations on 4 NLP tasks by 1484 annotators with rich demographic information. Paper: https://t.co/cpI0NQtb9y 🧵 1/11

1

28

118

Sandra Wachter [email protected]

@SandraWachter5

3 years

It takes 360.000 gallons of water/day to cool a data centre! Exploitation of workers, workplace automation, & mass discrimination of marginalised groups, these are REAL existential risks, not this latest PR stunt, my interview https://t.co/iLWMtysopB @Independent @oiioxford

independent.co.uk

Professor Sandra Wachter said the risk raised in the letter that AI could wipe out humanity is ‘science fiction fantasy’.

19

135

398