Ahmad Mustafa Anis @AhmadMustafaAn1 X Profile

Ahmad Mustafa Anis

@AhmadMustafaAn1

Followers

1K

Following

50K

Media

110

Statuses

1K

Computer Vision & Deep Learning @Roll_ai Community Lead @Cohere_Labs

https://t.co/OuQiapeGGi

Joined August 2018

Don't wanna be here? Send us removal request.

Ahmad Mustafa Anis

@AhmadMustafaAn1

4 months

~400 people have joined us for the Research Mentorship Session by @sarahookr at @Cohere_Labs Machine Learning Summer School. So much great advice for early career and aspiring researchers 🔥🔥❤️

3

15

92

Randall Balestriero

@randall_balestr

3 days

Minimal example (ViT, inet10) now available with comments and details at https://t.co/s4yNTPg42B - ~150 lines *total* - 91% top1 (simple linear probe) in a couple hours (1gpu) - no teacher-student/stop-gradient/SWA/... - losses that are as smooth as you can hope for:

Randall Balestriero

@randall_balestr

11 days

LeJEPA: a novel pretraining paradigm free of the (many) heuristics we relied on (stop-grad, teacher, ...) - 60+ arch., up to 2B params - 10+ datasets - in-domain training (>DINOv3) - corr(train loss, test perf)=95% Paper: https://t.co/NpfB9G1pOP Code: https://t.co/BsK5wmNEHc

6

33

307

elvis

@omarsar0

3 days

Hey Nano Banana Pro, please annotate the original Transformer architecture diagram. Just look at how precisely it added little insights to the main operations. 🤯 Great for infographics and for improving technical visual communication.

35

175

1K

hardmaru

@hardmaru

3 days

Excited to announce our MIT Press book “Neuroevolution: Harnessing Creativity in AI Agent Design” by Sebastian Risi (@risi1979), Yujin Tang (@yujin_tang), Risto Miikkulainen, and myself. We explore decades of work on evolving intelligent agents and shows how neuroevolution can

16

223

988

AI at Meta

@AIatMeta

4 days

Introducing SAM 3D, the newest addition to the SAM collection, bringing common sense 3D understanding of everyday images. SAM 3D includes two models: 🛋️ SAM 3D Objects for object and scene reconstruction 🧑‍🤝‍🧑 SAM 3D Body for human pose and shape estimation Both models achieve

128

1K

6K

Sundar Pichai

@sundarpichai

5 days

Introducing Gemini 3 ✨ It’s the best model in the world for multimodal understanding, and our most powerful agentic + vibe coding model yet. Gemini 3 can bring any idea to life, quickly grasping context and intent so you can get what you need with less prompting. Find Gemini

840

3K

21K

Ahmad Mustafa Anis

@AhmadMustafaAn1

6 days

Claude finally has some self-respect, previous sonnet version would always rate CGPT's answer higher even when Claude’s answer was clearly better.

0

1

vik

@vikhyatk

6 days

i've reviewed my own PR and found nothing wrong with it

58

77

1K

Andrej Karpathy

@karpathy

7 days

Sharing an interesting recent conversation on AI's impact on the economy. AI has been compared to various historical precedents: electricity, industrial revolution, etc., I think the strongest analogy is that of AI as a new computing paradigm (Software 2.0) because both are

550

2K

12K

Ahmad Mustafa Anis

@AhmadMustafaAn1

7 days

https://t.co/cUOIHEbfvf

0

1

3

Ahmad Mustafa Anis

@AhmadMustafaAn1

8 days

Python 3.7 feels like yesterday.

0

AK

@_akhaliq

10 days

Nvidia presents TiDAR Think in Diffusion, Talk in Autoregression

24

177

2K

Randall Balestriero

@randall_balestr

11 days

LeJEPA: a novel pretraining paradigm free of the (many) heuristics we relied on (stop-grad, teacher, ...) - 60+ arch., up to 2B params - 10+ datasets - in-domain training (>DINOv3) - corr(train loss, test perf)=95% Paper: https://t.co/NpfB9G1pOP Code: https://t.co/BsK5wmNEHc

38

199

1K

Tim Rocktäschel

@_rockt

12 days

Regular reminder to learn einsum & einops if you haven't already!

Tim Rocktäschel

@_rockt

6 years

In case you need convincing arguments for setting aside time to learn about einsum ( https://t.co/2lA3Bsh53D) and Alex Rogozhnikov's einops ( https://t.co/SY4yJAktEh). Screenshot taken from https://t.co/RsCX5P5NLv.

4

27

411

Ahmad Mustafa Anis

@AhmadMustafaAn1

13 days

starting to learn diffusion models, trying a nano implementation, will keep posting what i learn along the way

0

6

vik

@vikhyatk

13 days

really enjoying reading the trio tutorial https://t.co/sclrte6eaI

5

14

404

Ahmad Mustafa Anis

@AhmadMustafaAn1

15 days

Prototyping is easy; productizing is difficult

0

Raj Dabre

@prajdabre

15 days

Here's your weekend challenge: Implement speculative decoding. Step 1: Read the following paper and/or blog: https://t.co/yJ7Rkb7yv9 https://t.co/8A4LWmruxM (cc @jaygala223) Step 2: Choose a family of models which come in various sizes. My choice would be the Gemma3 or Qwen

13

37

554

Saining Xie

@sainingxie

16 days

Introducing Cambrian-S it’s a position, a dataset, a benchmark, and a model but above all, it represents our first steps toward exploring spatial supersensing in video. 🧶

26

96

637

Jia-Bin Huang

@jbhuang0604

16 days

Junior students who have just started doing research? Check out the (75 and counting) awesome tips! https://t.co/5CTTuJm3Jg

13

178

1K

Ahmad Mustafa Anis

@AhmadMustafaAn1

16 days

people in drought celebrate a drop of water

0