Cesare Spinoso-Di Piano @cesare_spinoso X Profile

Cesare Spinoso-Di Piano

@cesare_spinoso

Followers

44

Following

7

Media

7

Statuses

22

Hello! My name is Cesare (pronounced Chez-array or Chez). I'm a PhD student at McGill and Mila working on pragmatics and NLP for science.

Montréal, Québec

Joined September 2021

Don't wanna be here? Send us removal request.

Cesare Spinoso-Di Piano

@cesare_spinoso

11 days

RT @mirandrom: @cesare_spinoso @McGill_NLP @Mila_Quebec @aclmeeting

0

1

0

Cesare Spinoso-Di Piano

@cesare_spinoso

18 days

UPDATE: This will be in exhibition hall X5, poster board 55!.

0

3

Cesare Spinoso-Di Piano

@cesare_spinoso

20 days

RT @ziling_cheng: What do systematic hallucinations in LLMs tell us about their generalization abilities?. Come to our poster at #ACL2025 o….

0

7

0

Cesare Spinoso-Di Piano

@cesare_spinoso

20 days

How can we use models of cognition to help LLMs interpret figurative language (irony, hyperbole) in a more human-like manner? Come to our #ACL2025NLP poster on Wednesday at 11AM (exhibit hall - exact location TBA) to find out! @McGill_NLP @Mila_Quebec @aclmeeting

2

11

26

Cesare Spinoso-Di Piano

@cesare_spinoso

20 days

RT @ljyflores38: ⏰ Sharing our work on calibrated confidence scores at #ACL2025NLP, July 29 – 4PM Vienna time (Virtual)!. 📰 Improving the C….

0

9

0

Cesare Spinoso-Di Piano

@cesare_spinoso

2 months

Thanks to collaborators @DavidAustinCS , @PiantanidaPablo and Jackie Cheung. We also received some amazing feedback from the @Mila_Quebec @McGill_NLP community! And thanks to @_jennhu, Justine Kao and @tsvilodub for sharing their datasets.

0

4

Cesare Spinoso-Di Piano

@cesare_spinoso

2 months

Other cool findings:.1. We prove that (RSA)^2 is more expressive than QUD-based RSA. 2. Naively applying RSA to LLMs leads to probability 𝘴𝘱𝘳𝘦𝘢𝘥𝘪𝘯𝘨, not 𝘯𝘢𝘳𝘳𝘰𝘸𝘪𝘯𝘨! Are there better ways to use RSA & LLMs?.3. We design a rhetorical strategy clustering algorithm!.

1

0

4

Cesare Spinoso-Di Piano

@cesare_spinoso

2 months

What about LLMs? We integrate LLMs within (RSA)^2 and test them on a new dataset, PragMega+. We show that LLMs augmented with (RSA)^2 produce probability distributions which are more aligned with human expectations.

1

0

4

Cesare Spinoso-Di Piano

@cesare_spinoso

2 months

We test (RSA)^2 on two existing figurative language datasets: hyperbolic number expressions (e.g. “This kettle costs 1000$”) and ironic utterances about the weather (e.g. “The weather is amazing” during a Montreal blizzard). We obtain human-like meaning distributions!

1

0

4

Cesare Spinoso-Di Piano

@cesare_spinoso

2 months

Introducing (RSA)^2: a 𝘳𝘩𝘦𝘵𝘰𝘳𝘪𝘤𝘢𝘭-𝘴𝘵𝘳𝘢𝘵𝘦𝘨𝘺-𝘢𝘸𝘢𝘳𝘦 probabilistic framework of figurative language. In (RSA)^2 one listener will interpret language literally, another will interpret language ironically. Marginalization produces a distribution over meanings.

1

0

4

Cesare Spinoso-Di Piano

@cesare_spinoso

2 months

A blizzard is raging in Montreal when your friend says “Wow, the weather is amazing!” Humans easily interpret irony, while LLMs struggle at it. We propose a 𝘳𝘩𝘦𝘵𝘰𝘳𝘪𝘤𝘢𝘭-𝘴𝘵𝘳𝘢𝘵𝘦𝘨𝘺-𝘢𝘸𝘢𝘳𝘦 probabilistic framework as a solution. @ #acl2025

1

11

Grok

@grok

6 days

Generate videos in just a few seconds. Try Grok Imagine, free for a limited time.

393

665

3K

Cesare Spinoso-Di Piano

@cesare_spinoso

2 months

RT @Icannos: Thrilled to report that our work won Area Chair's Award!.You can find a nice high level explanation of our work!.Be sure to st….

0

4

0

Cesare Spinoso-Di Piano

@cesare_spinoso

2 months

RT @ian_porada: Do existing multi-dataset evaluations allow us to draw meaningful, reliable conclusions about coreference resolution models….

0

9

0

Cesare Spinoso-Di Piano

@cesare_spinoso

2 months

RT @bonadossou: 🚨 Thrilled to share that two of my papers have been accepted to @aclmeeting SRW 2025! 🧠🌍.They tackle key challenges in low-….

0

6

0

Cesare Spinoso-Di Piano

@cesare_spinoso

2 months

RT @oriern1: 🧵 New paper at Findings #ACL2025 @aclmeeting!.Not all documents are processed equally well. Some consistently yield poor resul….

0

12

0

Cesare Spinoso-Di Piano

@cesare_spinoso

2 months

RT @ljyflores38: ❗️ Confidence in text generation is tricky, as models can be confident in many, valid answers. 👀 Can we account for this….

0

7

0

Cesare Spinoso-Di Piano

@cesare_spinoso

2 months

RT @ziling_cheng: Do LLMs hallucinate randomly? Not quite. Our #ACL2025 (Main) paper shows that hallucinations under irrelevant contexts fo….

0

24

0

Cesare Spinoso-Di Piano

@cesare_spinoso

9 months

RT @ian_porada: LLMs that "solve" challenge sets might still be relatively inaccurate at resolving diverse, attested instances of the same….

0

3

0

Cesare Spinoso-Di Piano

@cesare_spinoso

9 months

Very happy to have presented my poster at the CustomNLP4U Workshop at EMNLP 2024 (@emnlpmeeting @customnlp4u).@McGill_NLP @Mila_Quebec #EMNLP

0

3

14

Cesare Spinoso-Di Piano

@cesare_spinoso

9 months

RT @jad_kabbara: Social Impact Award celebration with cool EMNLP folks! 🏆💫🎉. #EMNLP2024 @emnlpmeeting

0

5

0