
Alberto Testoni
@alberto_testoni
Followers
241
Following
285
Media
16
Statuses
70
PostDoc @amsterdamumc / NLP4Health. Prev. @UvA_Amsterdam/@illc_amsterdam. PhD @UniTrento_DISI - @AmazonScience. MSc @cimec_unitrento, BSc @Unibo.
Amsterdam
Joined January 2012
RT @Cohere_Labs: Today, our team will share “From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions” at @aclmeeting!….
0
4
0
RT @ale_suglia: Excited to present PLAYPEN, an environment for learning through dialogue game self-play. Are you interested in LLM post-tra….
0
9
0
RT @CohereForAI: Can LLMs collaborate effectively over long-term interactions, like a human teammate, especially in coding tasks? 🤔. We int….
0
7
0
RT @mziizm: Excited to share insights from our new paper on evaluating LLMs in multi-session coding interactions! 📚📚📚. We introduce MEMORYC….
0
9
0
4/4 Our results reveal significant limitations and problems of overconfidence of state-of-the-art large V&L models. For more analyses on the role of the saliency features that guide the model selection and on CoT prompting, check out our paper! 🏓
arxiv.org
Ambiguity resolution is key to effective communication. While humans effortlessly address ambiguity through conversational grounding strategies, the extent to which current language models can...
0
0
2
1/4 Excited to share our latest paper “🏓 RAcQUEt: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs”. Joint work with @barbara_plank and @raquel_dmg. #NLProc 🧵
1
1
5
RT @cimec_unitrento: 🔍 Papers being presented:. 1️⃣ Learning to Ask Informative Questions: Enhancing LLMs with Preference Optimization and….
0
2
0
RT @dmazzaccara: Flying to Miami! I will present “Learning to Ask Informative Questions: Enhancing LLMs with Preference Optimization and Ex….
0
5
0
RT @barbara_plank: PhD opportunities in Munich 🥳 - consider applying to MCML and reach out if you are interested in @MaiNLPlab research the….
0
13
0
2) "Don't Buy it! Reassessing the Ad Understanding Abilities of Contrastive Multimodal Models" work led by @anna_bavaresco_ with @raquel_dmg (poster 12/8 at 14:00 + oral 13/8 11:45)
arxiv.org
Image-based advertisements are complex multimodal stimuli that often contain unusual visual elements and figurative language. Previous research on automatic ad understanding has reported...
0
0
7
I am attending #ACL2024 in Bangkok with 2 papers on multimodal #NLProc 🇹🇭 🧵.1) "Naming, Describing, and Quantifying Visual Objects in Humans and LLMs" with @sandropezzelle and J. Sprott (poster 12/8 at 14:00 - with a fun game for attendees)
arxiv.org
While human speakers use a variety of different expressions when describing the same object in an image, giving rise to a distribution of plausible labels driven by pragmatic constraints, the...
2
2
19
5/5 ⚠️ We conclude that LLMs are not yet ready to systematically replace human judges in NLP, and caution against using LLMs for this purpose. JUDGE-BENCH is intended as a living benchmark, and you are welcome to contribute:
github.com
Contribute to dmg-illc/JUDGE-BENCH development by creating an account on GitHub.
0
2
12
1/5 📣 Excited to share “LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks”! 🚀 We introduce JUDGE-BENCH, a benchmark to investigate to what extent LLM-generated judgements align with human evaluations. #NLProc
4
24
97
RT @raquel_dmg: I'm looking for a last-minute emergency reviewer for a COLM submission related to generation with LLMs. Reviews need to be….
0
4
0
RT @ELLISforEurope: 22 researchers from 12 European institutions discussed future directions in open #LLMs and multimodal language technolo….
0
7
0