Roy Schwartz @royschwartzNLP X Profile

Roy Schwartz

@royschwartzNLP

Followers

3K

Following

434

Media

20

Statuses

250

Senior Lecturer at @CseHuji. #NLPROC

https://t.co/9H15wHWxJg

Joined February 2016

Don't wanna be here? Send us removal request.

Roy Schwartz

@royschwartzNLP

6 years

The focus on SOTA has caused a dramatic increase in the cost of AI, leading to environmental tolls and inclusiveness issues. We advocate research on efficiency in addition to accuracy (#greenai). Work w/ @JesseDodge @nlpnoah and @etzioni at @allen_ai https://t.co/ZHIFMwxnZ8

0

46

155

HUJI NLP

@nlphuji

15 days

We're proud of our team's 11 papers accepted to #EMNLP2025! See you next week in Suzhou✈️

0

11

15

Tamer

@TamerGhattas911

3 months

Excited to share: our paper “On Pruning State-Space LLMs” was accepted to EMNLP 2025! 🎉 Preprint: https://t.co/8TD56aroDc Code: https://t.co/ofi9ZxPDOT Model: Smol-Mamba-1.9B → https://t.co/AIq2XOn0dS w/ @MichaelHassid & @royschwartzNLP (HUJI) #Mamba #ModelCompression

2

4

14

Yair Brill

@yairbrill

5 months

לפני חודש שלח לי ארי רפופורט, בן דוד של אמא שלי, מייל מפתיע: "אינני יודע אם אתה מודע למחלה שלי ולהישגי המדעיים", הוא פתח, "אובחנתי עם סרטן ריאות מסוג תאים קטנים, אחד הקטלניים שיש. נותרו לי עוד כמה חודשים... אני כותב אליך לבקש כתבה מדעית במוסף הארץ - כזו תעניין ללא ספק אנשים רבים"

49

53

1K

Michael Hassid

@MichaelHassid

6 months

The longer reasoning LLM thinks - the more likely to be correct, right? Apparently not. Presenting our paper: “Don’t Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning”. Link: https://t.co/Zsp3BD0TU5 1/n

7

37

114

Guy Kaplan

@GKaplan38844

7 months

Heading to @iclr_conf ✈️🧩 ‘Tokens→Words’ shows how LLMs build full‑word representations from sub‑word tokens and offers a tool for vocab expansion. 🚀 See our #ICLR2025 poster ‑ 26.4, 15:00‑17:30. 📄 https://t.co/yXvRvjjr0E 🔗 https://t.co/mTBlktKerQ 👇

Guy Kaplan

@GKaplan38844

1 year

📢Paper release📢 : 🔍 Ever wondered how LLMs understand words when all they see are tokens? 🧠 Our latest study uncovers how LLMs reconstruct full words from sub-word tokens, even when misspelled or previously unseen. https://t.co/Ur9eBn8yBO (preprint) 👀 👇 [1/7]

0

6

40

Guy Kaplan

@GKaplan38844

7 months

✨ Ever tried generating an image from a prompt but ended up with unexpected outputs? Check out our new paper #FollowTheFlow - tackling T2I issues like bias, failed binding, and leakage from the textual encoding side! 💼🔍 https://t.co/jTNgec28hw https://t.co/orB0Y7iW1S 🧵[1/7]

2

19

61

Tamer

@TamerGhattas911

9 months

🚀 New Paper Drop! 🚀 “On Pruning SSM LLMs” – We check the prunability of MAMBA🐍 based LLMs. We also release Smol2-Mamba-1.9B, a MAMBA based LLM distilled from Smol2-1.7B on 🤗: [ https://t.co/AIq2XOny3q] 📖 Read more: [ https://t.co/8TD56arWsK] @royschwartzNLP @MichaelHassid

0

3

10

Roy Schwartz

@royschwartzNLP

1 year

Looking for emergency reviewers for October ARR. If someone can complete a review *today* (Sunday, Nov. 24), please DM me🙏 I have papers on efficiency, interpretability and speech

0

1

3

Roy Schwartz

@royschwartzNLP

1 year

Giving #Bluesky a shot. Same handle. Hope to see you there!

0

2

Amit Ben-Artzy

@Amit_BenArtzy

1 year

In which layers does information flow from previous tokens to the current token? Presenting our new @BlackboxNLP paper: “Attend First, Consolidate Later: On the Importance of Attention in Different LLM Layers” https://t.co/aNO7fKxXix 1/n

1

20

69

Tamar Kolodny

@TamarKolodny

1 year

It's been difficult to share good news from this part of the world. But it's long overdue - I am excited to share that I joined the Psychology Dept at Ben-Gurion University & Azrieli National Centre for Autism and Neurodev. ! Hooray for new endeavors and in hopes of better times.

5

2

59

Anna Rogers

@annargrs

1 year

📢📢 Dear #NLProc people with strong opinions on peer review & ARR in particular: this is the ACL survey you've been waiting for. It covers core design of ARR, incl. the decoupling of acceptance reviews & decisions and length of review cycles. Don't say you were not asked! /1

ACL 2025

@aclmeeting

1 year

What should the ACL peer review process be like in the future? Please cast your views in this survey: https://t.co/fBGWIwXRCo by 4th Nov 2024 #NLProc @ReviewAcl

2

13

53

ACL 2025

@aclmeeting

1 year

What should the ACL peer review process be like in the future? Please cast your views in this survey: https://t.co/fBGWIwXRCo by 4th Nov 2024 #NLProc @ReviewAcl

4

37

56

Guy Kaplan

@GKaplan38844

1 year

📢Paper release📢 : 🔍 Ever wondered how LLMs understand words when all they see are tokens? 🧠 Our latest study uncovers how LLMs reconstruct full words from sub-word tokens, even when misspelled or previously unseen. https://t.co/Ur9eBn8yBO (preprint) 👀 👇 [1/7]

5

22

54

yobibyte

@y0b1byte

1 year

Interesting work

3

46

404

Michael Hassid

@MichaelHassid

1 year

"Transformers are Multi-State RNNs", and our KV compression policy "TOVA", got accepted to #EMNLP2024! 🎉 See you in Miami! :) Paper:

arxiv.org

Transformers are considered conceptually different from the previous generation of state-of-the-art NLP models - recurrent neural networks (RNNs). In this work, we demonstrate that decoder-only...

Michael Hassid

@MichaelHassid

2 years

Transformers outperform RNNs as they operate differently. Do they? Excited to share our new paper: “Transformers are Multi-State RNNs” Paper: https://t.co/vjZ8ba1Iaw Code: https://t.co/TJyVlxmqst 1/n

1

5

21

Michael Hassid

@MichaelHassid

1 year

Which is better, running a 70B model once, or a 7B model 10 times? The answer might be surprising! Presenting our new @COLM_conf paper: "The Larger the Better? Improved LLM Code-Generation via Budget Reallocation" https://t.co/Zayq02RFJJ 1/n

6

43

209

Michael Hassid

@MichaelHassid

1 year

New version for “Transformers are Multi-State RNNs” is now on arxiv: https://t.co/mmPogD56UO What’s new? Efficiency analysis of TOVA (our KV compression policy) Extrapolation with TOVA Details below >> 1/3

arxiv.org

Transformers are considered conceptually different from the previous generation of state-of-the-art NLP models - recurrent neural networks (RNNs). In this work, we demonstrate that decoder-only...

Michael Hassid

@MichaelHassid

2 years

Transformers outperform RNNs as they operate differently. Do they? Excited to share our new paper: “Transformers are Multi-State RNNs” Paper: https://t.co/vjZ8ba1Iaw Code: https://t.co/TJyVlxmqst 1/n

1

4

16

UKP Lab

@UKPLab

2 years

Stop complaining about the bad review quality. Join forces and start research on #NLProc for #PeerReview! 🚨 A new white paper by over 20 top AI and NLP researchers provides a thorough discussion of AI assistance for scientific quality control. (1/🧵) 📑 https://t.co/KqXcFDY5N6

3

26

96

Michael Hassid

@MichaelHassid

2 years

Transformers outperform RNNs as they operate differently. Do they? Excited to share our new paper: “Transformers are Multi-State RNNs” Paper: https://t.co/vjZ8ba1Iaw Code: https://t.co/TJyVlxmqst 1/n

2

35

119