Sebastin Santy
@SebastinSanty
Followers
931
Following
9K
Media
25
Statuses
632
Multilingual data contains a wealth of information; yet, it is often discarded when training vision-language models. Our work (#NeurIPS spotlight) shows that enhancing the diversity of data origins can improve performance on many standard vision tasks https://t.co/suVmBjnVSv 1/n
5
30
140
🤔 Can human-AI collaboration facilitate self-guided mental health interventions❓ At #CHI2024 🏖️, I'm excited to present our work which leverages feedback from 120K+ @MentalHealthAm users to inform the design of a self-guided cognitive restructuring tool based on human-language
0
9
51
📢 New Paper! Ever wondered why transformers are able to capture hierarchical structure of human language without incorporating an explicit 🌲 structure in their architecture? In this work we delve deep into understanding hierarchical generalization in transformers. (1/n)
6
62
323
🏅Our #CHI2024 paper received an Honorable Mention Award!!🏅 We examine HIV clients' views on data security & privacy for electronic and mobile data collection in Malawi. Very thankful for my mentors and the collaboration that made this work happen✨ https://t.co/NtwbSkcT5v
📢Excited to share our #CHI2024 paper: HIV Client Perspectives on Digital Health in Malawi We examine HIV clients’ views of data security & privacy. Our work takes place in Malawi, a low- and middle-income country (LMIC) with high HIV incidence🧵 https://t.co/Wa4g5JKEMn
5
5
42
New work with @andrew_ilyas and @aleks_madry on tracing predictions back to individual components (conv filters, attn heads) in the model! Paper: https://t.co/zEJ3oV0wrF Thread: 👇
arxiv.org
How does the internal computation of a machine learning model transform inputs into predictions? In this paper, we introduce a task called component modeling that aims to address this question....
How do model components (conv filters, attn heads) collectively transform examples into predictions? Is it possible to somehow dissect how *every* model component contributes to a prediction? w/ @harshays_ @andrewilyas, we introduce a framework for tackling this question!
1
10
48
What challenges do developers face while using AI programming assistants like GitHub Copilot? 🤖🤔 Check out my #ICSE2024 paper (w/ @cyyang3_u and @bradamyers)! I'm presenting this work today at 4:15PM in the Fernando Pessoa room. See you there 🤗 https://t.co/GtoIlURgeN
3
21
110
NLPositionality: Characterizing Design Biases of Datasets and Models – Machine Learning Blog | ML@CMU | Carnegie Mellon University
blog.ml.cmu.edu
TLDR; Design biases in NLP systems, such as performance differences for different populations, often stem from their creator’s positionality, i.e., views and lived experiences shaped by identity and...
1
3
15
Ever noticed how Pixar adapts movies for international markets? The beloved newscaster in Zootopia is a jaguar in Brazil, a panda in China, a koala in Australia … While machine translation (MT) has only dealt with language in speech/text thus far, we extend the scope of MT to
11
38
248
How accurately can you make an LLM reverse strings? Write prompts to solve problems. Compare results against test cases. Try LeetPrompt out! https://t.co/4Z8ZmjBV33
0
3
5
NLP datasets & models are biased. Whose perspectives do they align with the most? Whose are left behind? We answer these questions via 16K+ annotations from 1K+ annotators from 87 countries. Learn more about NLPositionality in @jennytliang's post: https://t.co/XxrLDUNHKp
blog.ml.cmu.edu
TLDR; Design biases in NLP systems, such as performance differences for different populations, often stem from their creator’s positionality, i.e., views and lived experiences shaped by identity and...
1
4
16
We made a QLoRA promo video for @UWITNews. It is a very nice summary of the motivation behind QLoRA and what the environment was like to develop this research. @uwcse is a perfect place for doing such research! Article: https://t.co/quWj7mTiHC Youtube:
3
42
250
People are increasingly using LLMs like #ChatGPT for #mentalhealth support. But do we truly understand their capabilities and limitations❓ Introducing 🌟BOLT🌟, a computational framework to assess the behavior of LLMs🤖 when employed as therapists💬 https://t.co/cqczuGRH8h 🧵
3
23
95
LMs are used to process text from many topics, styles, dialects, etc., but how well do they do? 📈 Evaluating perplexity on just one corpus like C4 doesn't tell the whole story 📉 ✨📃✨ We introduce Paloma, a benchmark of 585 domains from NY Times to r/depression on Reddit.
5
61
238
A big shout out to LTI PhD student @AdithyaPratapa, who received the EMNLP 20203 Outstanding Paper Award for Background Summarization of Event Timelines. https://t.co/DCSU3w6YnJ. Congtratulations!
0
11
82
🏆 SODA won the outstanding paper award at #EMNLP2023 ! What an incredible journey that was! I'm grateful for this journey with you @jmhessel @liweijianglw @PeterWestTM @GXiming @YoungjaeYu3 @peizNLP @Ronan_LeBras @malihealikhani @gunheekim @MaartenSap @YejinChoinka & @allen_ai💙
Who knew water-cooler chitchat could be so useful? AI2's @hyunw__kim & collaborators did, which is why they're introducing SODA, the first million-scale high-quality social chitchat dataset. Learn about it on the blog: https://t.co/Xb5II4u62u
#EMNLP2023
13
20
163
It’s also been energizing to see many examples of HCI+NLP work at #EMNLP2023, from the awesome tutorial by @SebastinSanty @tongshangwu @DiyiYang on day 1 up to the last session of the conference—yet another research direction with blue skies ahead!
1
1
9
I’m thrilled that this Human-Centered MT paper was recognized with an outstanding paper award at #EMNLP2023. Congratulations to lead authors Nikita Mehandru (@ucberkeley iSchool) and @swetaagrawal20 (@umdclip @istecnico) for making this interdisciplinary collaboration a success!
2/8 "Physician Detection of Clinical Harm in Machine Translation: Quality Estimation Aids in Reliance and Backtranslation Identifies Critical Errors" with @nikita_mehandru @swetaagrawal20 @elainekhoong Niloufar Salehi among others https://t.co/zAkgavVJhA
https://t.co/kwYqSkyTqU
4
20
114
✈️12/10-12/16, I’ll be at #NeurIPS2023 and look forward to connecting. Let’s meet up and chat about research and more!🙂☕️ 📢I am also looking for PhD students to join my group at Rice CS @RiceCompSci in Fall 2024. Please DM if you want to chat at NeurIPS!
1
8
68