omer goldman
@omerNLP
Followers
385
Following
393
Media
32
Statuses
269
PhD student doing NLP at @biunlp. If I'm not here I'm at @omagolda following the news
Joined August 2019
Wanna check how well a model can share knowledge between languages? Of course you do! 🤩 But can you do it without access to the model’s weights? Now you can with ECLeKTic 🤯
1
16
43
Producing reasoning texts boosts the capabilities of AI models, but do we humans correctly understand these texts? Our latest research suggests that we do not. This highlights a new angle on the "Are they transparent?" debate: they might be, but we misinterpret them. 🧵
8
29
141
First contributed talks are under way for Phonology, Morphology, and Syntax! Hall M1 Level 1. Schedule here: https://t.co/c3BDfhP605
#ACL2025NLP #CoNLL2025
0
3
7
🚨 New paper alert! 🚨 We propose an IQ Test for LLMs — a new way to evaluate models that goes beyond benchmarks and uncovers their core skills. Think: 🧠🤖 psychometrics for LLMs. 👇 (1/6)
1
14
29
Are you still around Vienna? Come hear about a new morphological task at CoNLL at ~11:20 (hall M.1) @rtsarfaty
1
3
10
whlie you drink you coffee at 10am, stop by the google booth to talk about cross-lingual transfer 🤓 https://t.co/KQsRC86ey1
Wanna check how well a model can share knowledge between languages? Of course you do! 🤩 But can you do it without access to the model’s weights? Now you can with ECLeKTic 🤯
0
1
12
Will be especially happy to talk to UK-based people as I'm moving in a couple of months to @CambridgeLTL
0
0
4
On my way to #ACL2025 ! 🤗 Find me talking about crosslingual transfer at the Google booth, morphology - at @conll_conf , and tokenization - just at the coffee breaks
6
0
13
🚨 RAG is a popular approach but what happens when the retrieved sources provide conflicting information?🤔 We're excited to introduce our paper: “DRAGged into CONFLICTS: Detecting and Addressing Conflicting Sources in Search-Augmented LLMs”🚀 A thread 🧵👇
2
14
36
🤯 MIND-BLOWN! A new paper just SHATTERED everything we thought we knew about AI reasoning! This is paradigm-shifting. A MUST-READ. Full breakdown below 👇 🧵 1/23
107
244
2K
I really wanted to see the review details. It's clearly above the acceptance threshold of findings for me. When you fall into the cycle of rejection from ARR, it's hard to come out.
0
1
19
RefVNLI Towards Scalable Evaluation of Subject-driven Text-to-image Generation
1
52
135
English is full of “lexical gaps,” words that are implied to exist but don’t, because we borrowed a bunch of words from Latin but not other, related words. Somebody made a chart to show it↓
453
2K
19K
our research featured in a Google Research blog post! @Uri_Shaham @mataneyal1
Introducing ECLeKTic, a new benchmark for Evaluating Cross-Lingual Knowledge Transfer in LLMs. It uses a closed-book QA task, where models must rely on internal knowledge to answer questions based on information captured only in a single language. More → https://t.co/hCSE2hGGJR
1
1
6
Evaluate & improve your favorite model with ECLeKTic! 📜paper: https://t.co/uzbDbFzCNj 📊data: https://t.co/pyTz8SkrYA
@Uri_Shaham Dan Malkin @mataneyal1 Sivan Eiger Avinatan Hassidim @ymatias @maynez_joshua @AdiMayrav @jasonriesa @shrutirij @laurarimell Idan Szpektor @rtsarfaty
kaggle.com
Evaluate Cross-Lingual Knowledge Transfer with this multilingual QA dataset
0
0
3
In line with previous works, we also found some evidence that a shared script eases the transfer of knowledge in ECLeKTic. Note that Indonesian is part of the “Latin script” block although it’s not genealogically related
1
0
1