Oussama Zekri @oussamazekri_ X Profile

Oussama Zekri

@oussamazekri_

Followers

412

Following

2K

Media

22

Statuses

207

@ENS_ParisSaclay Maths department and @UW Research Intern | past : MVA @imperialcollege @Huaweifr @KyotoU_News | Research blog https://t.co/i7kGB3ffwv

Paris, France

Joined October 2023

Don't wanna be here? Send us removal request.

Oussama Zekri

@oussamazekri_

5 months

🚀 Policy gradient methods like DeepSeek’s GRPO are great for finetuning LLMs via RLHF. But what happens when we swap autoregressive generation for discrete diffusion, a rising architecture promising faster & more controllable LLMs?. Introducing Score Entropy Policy Optimization

1

9

30

Oussama Zekri

@oussamazekri_

20 days

RT @AmbroiseOdonnat: 🚀To know more about LLM as Markov Chains, join in on June 19th at 6 pm CET (Paris time)!!😀 . Huge thanks to @itsmaddox….

0

3

0

Oussama Zekri

@oussamazekri_

1 month

RT @OptionsGod_lgd: @ziv_ravid I start to maintain a paper list in case interested!.

0

3

0

Oussama Zekri

@oussamazekri_

2 months

RT @AmbroiseOdonnat: 💎It also works for the newest -- strongest Gemma3 models (👏🏽@ramealexandre @mblondel_ml)!

0

1

0

Oussama Zekri

@oussamazekri_

2 months

RT @Dorialexander: Ever more comforted in my assumption that Markov (1913) is the first language model.

0

27

0

Oussama Zekri

@oussamazekri_

2 months

Really nice thread by @attentionmech on our paper! Thanks!!.

attentionmech

@attentionmech

2 months

paper reading thread-

0

1

9

Oussama Zekri

@oussamazekri_

2 months

RT @attentionmech: paper reading thread-

0

61

0

Oussama Zekri

@oussamazekri_

2 months

RT @abenechehab: Excited to be heading to Singapore for #ICLR2025 this week! 🇸🇬 . I will be presenting our two latest works across the m….

0

4

0

Oussama Zekri

@oussamazekri_

3 months

RT @cloneofsimo: damn,. this is so incredibly cool use case for discrete diffusion model

0

875

0

Oussama Zekri

@oussamazekri_

3 months

Nice!.

Sam Altman

@sama

3 months

TL;DR: we are excited to release a powerful new open-weight language model with reasoning in the coming months, and we want to talk to devs about how to make it maximally useful: we are excited to make this a very, very good model!. __. we are planning to.

0

1

Oussama Zekri

@oussamazekri_

4 months

RT @AmbroiseOdonnat: 🤗Thanks a lot, @haeggee and Prof. Martin Jaggi, for having me in the MLO group @EPFL this week to present "Large Langu….

0

2

0

Oussama Zekri

@oussamazekri_

4 months

RT @IevgenRedko: Our team open-sourced MANTIS: a foundation model for time series classification. It is lightweight, more efficient than c….

0

5

0

Oussama Zekri

@oussamazekri_

4 months

Every conference should do the same imo!.

#CVPR2025

@CVPR

4 months

Following a thorough investigation, the Program Chairs (PCs) decided to desk-reject 19 papers authored by confirmed highly irresponsible reviewers, which would have been accepted otherwise, in accordance with the previously communicated CVPR 2025 policies. 2/2.

0

2

Oussama Zekri

@oussamazekri_

4 months

RT @InceptionAILabs: We are excited to introduce Mercury, the first commercial-grade diffusion large language model (dLLM)! dLLMs push the….

0

990

0

Oussama Zekri

@oussamazekri_

4 months

RT @_GPaolo: Living organisms organize as loosely coupled hierarchies—while our AI systems remain rigid monoliths. What if we built AI th….

0

2

0

Oussama Zekri

@oussamazekri_

4 months

RT @Ji_Ha_Kim: I wrote a blog post for an introduction to stochastic calculus! I share my perspective and intuition behind Brownian motion,….

0

148

0

Oussama Zekri

@oussamazekri_

5 months

Nice ! And now you can do RLHF on this with this paper haha

Tanishq Mathew Abraham, Ph.D.

@iScienceLuvr

5 months

Large Language Diffusion Models. Introduces LLaDA-8B, a large language diffusion model that pretrained on 2.3 trillion tokens using 0.13 million H800 GPU hours, followed by SFT on 4.5 million pairs. LLaDA 8B surpasses Llama-2 7B on nearly all 15 standard zero/few-shot learning

0

3

Oussama Zekri

@oussamazekri_

5 months

RT @qberthet: 🚨 New paper on regression and classification!. Adding to the discussion on using least-squares or cross-entropy, regression o….

0

60

0

Oussama Zekri

@oussamazekri_

5 months

RT @geoffnegiar: We just released our new website! . Our goal for now is to provide the easiest, fastest benchmarking tools for forecasting….

0

4

0