Oren Sultan @oren_sultan X Profile

Oren Sultan

@oren_sultan

Followers

1K

Following

2K

Media

79

Statuses

621

Research Scientist Intern @Meta, @AIatMeta (FAIR), CS PhD Candidate @HebrewU, @HyadataLab | Past: @Lightricks @TU_Muenchen @UniMelb

https://t.co/cP2kXnoduv

Tel Aviv, Israel

Joined August 2021

Don't wanna be here? Send us removal request.

Oren Sultan

@oren_sultan

3 months

I’m excited to start a new chapter as a PhD Research Scientist Intern at Meta AI, FAIR (Fundamental AI Research) group! Grateful to be part of the CodeGen team in Tel Aviv, working on cutting-edge AI research for code reasoning, understanding and generation 💻🤖

3

2

106

Naftali Bennett נפתלי בנט

@naftalibennett

9 days

רפי בן שטרית, לשעבר ראש העיר בית שאן, הוא אביו של הלוחם סמ״ר אלרואי ז״ל שנפל בקרב במוצב נחל עוז. אלרואי וחבריו התריעו מראש שחמאס מתכנן מלחמה, אך לא שמעו להם. הקשיבו לדברים מדם ליבו של רפי היקר:

99

196

2K

Moran Mizrahi

@moranmiz

15 days

Looking forward to presenting our TACL paper on enhancing LLM creativity at #EMNLP2025 tomorrow (Wed, Nov 5)! 📍 Room A108 🕝 14:30–16:00 (Linguistic Theories, Cognitive Modeling & Psycholinguistics) Details below ⬇️ #NLP #LLMs #Creativity

Moran Mizrahi

@moranmiz

25 days

How can we help LLMs move beyond the obvious toward generating more creative and diverse ideas? In our new TACL paper, we propose a novel approach to enhance LLM creative generation! https://t.co/AFCpQddN6j @ChenShani2 @GabiStanovsky @jurafsky @HyadataLab @stanfordnlp @nlphuji

4

14

60

Uri Berger

@uriberger88

20 days

Heading to #EMNLP2025! 🎉 Two of our papers will be there — come say hi 👋 🖼️ Image Captioning Evaluation — Nov 5, 17:45 📄 https://t.co/TdMVA2iWSD 🕵️ Deceptive LLM Agents (Mafia Game) — Nov 5, 13:00 📄

arxiv.org

LLMs are used predominantly in synchronous communication, where a human user and a model communicate in alternating turns. In contrast, many real-world settings are asynchronous. For example, in...

1

6

26

Guy Yariv

@guy_yariv

26 days

We present DyPE, a framework for ultra high resolution image generation. DyPE adjusts positional embeddings to evolve dynamically with the spectral progression of diffusion. This lets pre-trained DiTs create images with 16M+ pixels without retraining or extra inference cost. 🧵👇

9

32

102

Moran Mizrahi

@moranmiz

25 days

How can we help LLMs move beyond the obvious toward generating more creative and diverse ideas? In our new TACL paper, we propose a novel approach to enhance LLM creative generation! https://t.co/AFCpQddN6j @ChenShani2 @GabiStanovsky @jurafsky @HyadataLab @stanfordnlp @nlphuji

6

26

84

Eliahu Horwitz

@EliahuHorwitz

2 months

Excited to share this has now been accepted at #NeurIPS2025 as a position paper (<6% acceptance)!🎉 We advocate for systematically studying entire model populations via weight-space learning, and argue that this requires charting them in a Model Atlas. @NeurIPSConf #NeurIPS 🧵👇

Eliahu Horwitz

@EliahuHorwitz

8 months

🚨 New paper alert! 🚨 Millions of neural networks now populate public repositories like Hugging Face 🤗, but most lack documentation. So, we decided to build an Atlas 🗺️ Project: https://t.co/1JpsC6dCeg Demo: https://t.co/4Xy7yLdIZY 🧵👇🏻 Here's what we found:

0

21

64

Yann LeCun

@ylecun

2 months

Code World Model: producing code by imagining the effect of executing instructions and planning instructions that produce the desired effect.

Gabriel Synnaeve

@syhw

2 months

(🧵) Today, we release Meta Code World Model (CWM), a 32-billion-parameter dense LLM that enables novel research on improving code generation through agentic reasoning and planning with world models. https://t.co/BJSUCh2vtg

72

177

2K

Yossi Adi

@adiyossLC

2 months

We release Code World Model (CWM)! 👩‍💻🌎📊 A coding LLM designed to advance code generation research through agentic reasoning and world-model-based planning. Super excited about this release and proud of the team’s work! 😃 See Gab's post for more info 👇

Gabriel Synnaeve

@syhw

2 months

(🧵) Today, we release Meta Code World Model (CWM), a 32-billion-parameter dense LLM that enables novel research on improving code generation through agentic reasoning and planning with world models. https://t.co/BJSUCh2vtg

0

11

49

Felix Kreuk

@FelixKreuk

2 months

1/ We released CWM, a 32B dense LLM for coding, agentic use, and, more importantly, to further World-Modeling research. To support this research, we release the pre-training, sft and rl model weights, along with inference code and the tech report. See:

Gabriel Synnaeve

@syhw

2 months

(🧵) Today, we release Meta Code World Model (CWM), a 32-billion-parameter dense LLM that enables novel research on improving code generation through agentic reasoning and planning with world models. https://t.co/BJSUCh2vtg

1

7

38

Pierre Chambon

@PierreChambon6

2 months

🔥 CWM x BigO(Bench) 🔥 CWM 32B was just released, and evaluated on BigO(Bench) ! Does "world-modeling-aware" training helps CWM reach higher performance on Code Complexity related tasks ?

2

5

25

Michael Hassid

@MichaelHassid

2 months

Our new Code World Model (CWM) is out! I learned and gained expertise working on the RL part, and I'm super proud of what we built. Check out the thread below for the full details.

Gabriel Synnaeve

@syhw

2 months

(🧵) Today, we release Meta Code World Model (CWM), a 32-billion-parameter dense LLM that enables novel research on improving code generation through agentic reasoning and planning with world models. https://t.co/BJSUCh2vtg

0

1

15

AI at Meta

@AIatMeta

2 months

New from Meta FAIR: Code World Model (CWM), a 32B-parameter research model designed to explore how world models can transform code generation and reasoning about code. We believe in advancing research in world modeling and are sharing CWM under a research license to help empower

103

225

1K

Alexandr Wang

@alexandr_wang

2 months

new research from Meta FAIR: Code World Model (CWM), a 32B research model we encourage the research community to research this open-weight model! pass@1 evals, for the curious: 65.8 % on SWE-bench Verified 68.6 % on LiveCodeBench 96.6 % on Math-500 76.0 % on AIME 2024 🧵

96

164

1K

Gabriel Synnaeve

@syhw

2 months

(🧵) Today, we release Meta Code World Model (CWM), a 32-billion-parameter dense LLM that enables novel research on improving code generation through agentic reasoning and planning with world models. https://t.co/BJSUCh2vtg

60

313

2K

Noy Sternlicht📍🗺️ EMNLP2025

@NoySternlicht

2 months

🎉 Proud to share that "Debatable Intelligence" has now been accepted to #EMNLP2025 (Main Conference)! https://t.co/zVE73m9lVu Huge thenks to my amazing collaborators @ArielGera2, @RoyBarHaim, @Hoper_Tom, @noamslonim

noy-sternlicht.github.io

We assess the judgment capabilities and behavior of LLMs by analyzing how they rate debate speeches - long texts that argue for or against a controversial topic.

Noy Sternlicht📍🗺️ EMNLP2025

@NoySternlicht

5 months

🔔 New Paper! We propose a challenging new benchmark for LLM judges: Evaluating debate speeches. Are they comparable to humans? Well... it’s debatable. 🤔 https://t.co/u0sd8SrGjj 👇 Here are our findings:

3

15

54

Eliya Habba @EMNLP 🇨🇳

@EliyaHabba

3 months

Proud to share PromptSuite! 🌈 A flexible framework for generating thousands of prompt variations per instance, enabling robust multi-prompt LLM evaluation across diverse tasks. Python API & web UI included. Check it out:

eliyahabba.github.io

A flexible framework for automatic generation of prompt variations for robust LLM evaluation.

Noam Dahan

@Dahan_Noam

3 months

Old news: Single-prompt eval is unreliable🤯 New news: PromptSuite🌈 - an easy way to augment your benchmark with thousands of paraphrases ➡️ robust eval, zero sweat! - Works on any dataset! - Python API + web UI @EliyaHabba, @GiliLior, @GabiStanovsky https://t.co/C4VwIvzJFX

0

2

14

David Dinkevich

@DavidDinkevich

3 months

[1/6] 🎬 New paper: Story2Board We guide diffusion models to generate consistent, expressive storyboards--no training needed. By mixing attention-aligned tokens across panels, we reinforce character identity without hurting layout diversity. 🌐 https://t.co/aRG81nu5qK

5

11

30

Asaf Yehudai

@AsafYehudai

4 months

🚨 Benchmarks tell us which model is better — but not why it fails. For developers, this means tedious, manual error analysis. We're bridging that gap. Meet CLEAR: an open-source tool for actionable error analysis of LLMs. 🧵👇

1

14

44

Eliya Habba @EMNLP 🇨🇳

@EliyaHabba

4 months

Presenting my poster : 🕊️ DOVE - A large-scale multi-dimensional predictions dataset towards meaningful LLM evaluation, Monday 18:00 Vienna, #ACL2025 Come chat about LLM evaluation, prompt sensitivity, and our 250M COLLECTION OF MODEL OUTPUTS!

2

11

47