GenBench Profile Banner
GenBench Profile
GenBench

@GenBench

Followers
452
Following
102
Media
72
Statuses
193

State-of-the-art generalisation testing in NLP. Tag us for a RT of your NLP generalisation paper tweet!

Joined April 2022
Don't wanna be here? Send us removal request.
@GenBench
GenBench
1 year
The GenBench workshop is back! Do you work on generalisation (benchmarking) in #NLProc? Submit to the 2nd edition ( co-located with #EMNLP2024. We have a regular track and a โœจcollaborative benchmarking task (CBT)โœจ that's fully LLM-focused this year (1/6).
1
12
22
@GenBench
GenBench
8 months
That's a wrap! We (@glnmario, @christos_c, @_dieuwke_, @vernadankers, @khuyagbaatar_b, @a_kazemnejad & @ryandcotterell) thank all presenters, authors, reviewers and attendees!! The keynotes, the cats ๐Ÿ˜ป, the posters, the talks and the lively panel: it was fantastic๐Ÿ‘ ๐Ÿ”ฅ
Tweet media one
0
7
48
@GenBench
GenBench
8 months
RT @najoungkim: so proud of @HayleyRossLing for getting a best paper award at @GenBench this year!! ๐ŸŽ‰๐Ÿช…๐ŸŽ‰ I'm sure @TeaAnd_OrCoffee would beโ€ฆ.
0
6
0
@GenBench
GenBench
8 months
RT @kanishkamisra: Woohoo go tinlab! Congrats @HayleyRossLing @TeaAnd_OrCoffee @najoungkim!!.
0
2
0
@GenBench
GenBench
8 months
Congratulations!.
@najoungkim
Najoung Kim ๐Ÿซ 
8 months
so proud of @HayleyRossLing for getting a best paper award at @GenBench this year!! ๐ŸŽ‰๐Ÿช…๐ŸŽ‰ I'm sure @TeaAnd_OrCoffee would be too :) check out our paper and share if you think homemade cats are cats!
Tweet media one
0
0
3
@GenBench
GenBench
8 months
Congrats to all the authors!.
0
0
2
@GenBench
GenBench
8 months
And we also have an honourable mention!
Tweet media one
Tweet media two
0
0
1
@GenBench
GenBench
8 months
Best paper!
Tweet media one
Tweet media two
2
0
7
@GenBench
GenBench
8 months
Closing remarks and best paper award by @vernadankers
Tweet media one
1
2
12
@GenBench
GenBench
8 months
Come listen to the hot takes of our panelist in the Brickell room! Do we still need generalisation evaluation? ๐Ÿง #GenBench2024 #EMNLP2024
Tweet media one
0
3
15
@GenBench
GenBench
8 months
Still at the poster session? Come join us for keynote 3 by @sameer_!
Tweet media one
0
1
5
@GenBench
GenBench
8 months
Did you miss the GenBench poster session? Don't worry we've got you, here are (nearly all) posters! ๐Ÿ˜‰ #GenBench2024 #EMNLP2024 Next up: keynote by Sameer Singh at 3!
0
2
13
@GenBench
GenBench
8 months
Last spotlight presentation:. MMLU-SR: A Benchmark for Stress-Testing Reasoning Capability of Large Language Models. Unfortunately the authors couldn't make it, the work is kindly presented by their colleague Hengyi Wang ๐Ÿ™
Tweet media one
0
1
1
@GenBench
GenBench
8 months
Continuing with Bastian Bunzeck, presenting. The SlayQA benchmark of social reasoning: testing gender-inclusive generalization with neopronouns.
Tweet media one
1
1
3
@GenBench
GenBench
8 months
Next presenter is Jiwoo Lee, presenting. MultiPragEval: Multilingual Pragmatic Evaluation of Large Language Models.
Tweet media one
1
0
0
@GenBench
GenBench
8 months
Second up, Maxim Kurkim presenting. OmniDialog: A Multimodal Benchmark for Generalization Across Text, Visual, and Audio Modalities.
Tweet media one
1
0
1
@GenBench
GenBench
8 months
Spotlight time! Mirella Bueno on. MLissard: Multilingual Long and Simple Sequential Reasoning Benchmarks.
Tweet media one
1
1
3
@GenBench
GenBench
8 months
@kylelostat Plus more cat pictures! ๐Ÿ˜ป๐Ÿ˜ป
Tweet media one
0
0
1
@GenBench
GenBench
8 months
@kylelostat He got all the room snickering already at slide 3! ๐Ÿ˜.
1
0
2
@GenBench
GenBench
8 months
Join us for our second keynote by Olmo co-lead @kylelostat
Tweet media one
1
4
15
@GenBench
GenBench
8 months
Tweet media one
0
2
2