Cesare Spinoso-Di Piano Profile
Cesare Spinoso-Di Piano

@cesare_spinoso

Followers
44
Following
7
Media
7
Statuses
22

Hello! My name is Cesare (pronounced Chez-array or Chez). I'm a PhD student at McGill and Mila working on pragmatics and NLP for science.

Montréal, Québec
Joined September 2021
Don't wanna be here? Send us removal request.
@cesare_spinoso
Cesare Spinoso-Di Piano
11 days
Tweet media one
0
1
0
@cesare_spinoso
Cesare Spinoso-Di Piano
18 days
UPDATE: This will be in exhibition hall X5, poster board 55!.
0
0
3
@cesare_spinoso
Cesare Spinoso-Di Piano
20 days
RT @ziling_cheng: What do systematic hallucinations in LLMs tell us about their generalization abilities?. Come to our poster at #ACL2025 o….
0
7
0
@cesare_spinoso
Cesare Spinoso-Di Piano
20 days
How can we use models of cognition to help LLMs interpret figurative language (irony, hyperbole) in a more human-like manner? Come to our #ACL2025NLP poster on Wednesday at 11AM (exhibit hall - exact location TBA) to find out! @McGill_NLP @Mila_Quebec @aclmeeting
Tweet media one
2
11
26
@cesare_spinoso
Cesare Spinoso-Di Piano
20 days
RT @ljyflores38: ⏰ Sharing our work on calibrated confidence scores at #ACL2025NLP, July 29 – 4PM Vienna time (Virtual)!. 📰 Improving the C….
0
9
0
@cesare_spinoso
Cesare Spinoso-Di Piano
2 months
Thanks to collaborators @DavidAustinCS , @PiantanidaPablo and Jackie Cheung. We also received some amazing feedback from the @Mila_Quebec @McGill_NLP community! And thanks to @_jennhu, Justine Kao and @tsvilodub for sharing their datasets.
0
0
4
@cesare_spinoso
Cesare Spinoso-Di Piano
2 months
Other cool findings:.1. We prove that (RSA)^2 is more expressive than QUD-based RSA. 2. Naively applying RSA to LLMs leads to probability 𝘴𝘱𝘳𝘦𝘢𝘥𝘪𝘯𝘨, not 𝘯𝘢𝘳𝘳𝘰𝘸𝘪𝘯𝘨! Are there better ways to use RSA & LLMs?.3. We design a rhetorical strategy clustering algorithm!.
1
0
4
@cesare_spinoso
Cesare Spinoso-Di Piano
2 months
What about LLMs? We integrate LLMs within (RSA)^2 and test them on a new dataset, PragMega+. We show that LLMs augmented with (RSA)^2 produce probability distributions which are more aligned with human expectations.
Tweet media one
1
0
4
@cesare_spinoso
Cesare Spinoso-Di Piano
2 months
We test (RSA)^2 on two existing figurative language datasets: hyperbolic number expressions (e.g. “This kettle costs 1000$”) and ironic utterances about the weather (e.g. “The weather is amazing” during a Montreal blizzard). We obtain human-like meaning distributions!
Tweet media one
Tweet media two
1
0
4
@cesare_spinoso
Cesare Spinoso-Di Piano
2 months
Introducing (RSA)^2: a 𝘳𝘩𝘦𝘵𝘰𝘳𝘪𝘤𝘢𝘭-𝘴𝘵𝘳𝘢𝘵𝘦𝘨𝘺-𝘢𝘸𝘢𝘳𝘦 probabilistic framework of figurative language. In (RSA)^2 one listener will interpret language literally, another will interpret language ironically. Marginalization produces a distribution over meanings.
Tweet media one
1
0
4
@cesare_spinoso
Cesare Spinoso-Di Piano
2 months
A blizzard is raging in Montreal when your friend says “Wow, the weather is amazing!” Humans easily interpret irony, while LLMs struggle at it. We propose a 𝘳𝘩𝘦𝘵𝘰𝘳𝘪𝘤𝘢𝘭-𝘴𝘵𝘳𝘢𝘵𝘦𝘨𝘺-𝘢𝘸𝘢𝘳𝘦 probabilistic framework as a solution. @ #acl2025
Tweet media one
1
11
11
@grok
Grok
6 days
Generate videos in just a few seconds. Try Grok Imagine, free for a limited time.
393
665
3K
@cesare_spinoso
Cesare Spinoso-Di Piano
2 months
RT @Icannos: Thrilled to report that our work won Area Chair's Award!.You can find a nice high level explanation of our work!.Be sure to st….
0
4
0
@cesare_spinoso
Cesare Spinoso-Di Piano
2 months
RT @ian_porada: Do existing multi-dataset evaluations allow us to draw meaningful, reliable conclusions about coreference resolution models….
0
9
0
@cesare_spinoso
Cesare Spinoso-Di Piano
2 months
RT @bonadossou: 🚨 Thrilled to share that two of my papers have been accepted to @aclmeeting SRW 2025! 🧠🌍.They tackle key challenges in low-….
0
6
0
@cesare_spinoso
Cesare Spinoso-Di Piano
2 months
RT @oriern1: 🧵 New paper at Findings #ACL2025 @aclmeeting!.Not all documents are processed equally well. Some consistently yield poor resul….
0
12
0
@cesare_spinoso
Cesare Spinoso-Di Piano
2 months
RT @ljyflores38: ❗️ Confidence in text generation is tricky, as models can be confident in many, valid answers. 👀 Can we account for this….
0
7
0
@cesare_spinoso
Cesare Spinoso-Di Piano
2 months
RT @ziling_cheng: Do LLMs hallucinate randomly? Not quite. Our #ACL2025 (Main) paper shows that hallucinations under irrelevant contexts fo….
0
24
0
@cesare_spinoso
Cesare Spinoso-Di Piano
9 months
RT @ian_porada: LLMs that "solve" challenge sets might still be relatively inaccurate at resolving diverse, attested instances of the same….
0
3
0
@cesare_spinoso
Cesare Spinoso-Di Piano
9 months
Very happy to have presented my poster at the CustomNLP4U Workshop at EMNLP 2024 (@emnlpmeeting @customnlp4u).@McGill_NLP @Mila_Quebec #EMNLP
Tweet media one
0
3
14
@cesare_spinoso
Cesare Spinoso-Di Piano
9 months
RT @jad_kabbara: Social Impact Award celebration with cool EMNLP folks! 🏆💫🎉. #EMNLP2024 @emnlpmeeting
Tweet media one
0
5
0