Ahmad Mustafa Anis
@AhmadMustafaAn1
Followers
1K
Following
50K
Media
110
Statuses
1K
Computer Vision & Deep Learning @Roll_ai Community Lead @Cohere_Labs
Joined August 2018
~400 people have joined us for the Research Mentorship Session by @sarahookr at @Cohere_Labs Machine Learning Summer School. So much great advice for early career and aspiring researchers 🔥🔥❤️
3
15
92
Minimal example (ViT, inet10) now available with comments and details at https://t.co/s4yNTPg42B - ~150 lines *total* - 91% top1 (simple linear probe) in a couple hours (1gpu) - no teacher-student/stop-gradient/SWA/... - losses that are as smooth as you can hope for:
LeJEPA: a novel pretraining paradigm free of the (many) heuristics we relied on (stop-grad, teacher, ...) - 60+ arch., up to 2B params - 10+ datasets - in-domain training (>DINOv3) - corr(train loss, test perf)=95% Paper: https://t.co/NpfB9G1pOP Code: https://t.co/BsK5wmNEHc
6
33
307
Hey Nano Banana Pro, please annotate the original Transformer architecture diagram. Just look at how precisely it added little insights to the main operations. 🤯 Great for infographics and for improving technical visual communication.
35
175
1K
Excited to announce our MIT Press book “Neuroevolution: Harnessing Creativity in AI Agent Design” by Sebastian Risi (@risi1979), Yujin Tang (@yujin_tang), Risto Miikkulainen, and myself. We explore decades of work on evolving intelligent agents and shows how neuroevolution can
16
223
988
Introducing SAM 3D, the newest addition to the SAM collection, bringing common sense 3D understanding of everyday images. SAM 3D includes two models: 🛋️ SAM 3D Objects for object and scene reconstruction 🧑‍🤝‍🧑 SAM 3D Body for human pose and shape estimation Both models achieve
128
1K
6K
Introducing Gemini 3 ✨ It’s the best model in the world for multimodal understanding, and our most powerful agentic + vibe coding model yet. Gemini 3 can bring any idea to life, quickly grasping context and intent so you can get what you need with less prompting. Find Gemini
840
3K
21K
Claude finally has some self-respect, previous sonnet version would always rate CGPT's answer higher even when Claude’s answer was clearly better.
0
0
1
Sharing an interesting recent conversation on AI's impact on the economy. AI has been compared to various historical precedents: electricity, industrial revolution, etc., I think the strongest analogy is that of AI as a new computing paradigm (Software 2.0) because both are
550
2K
12K
LeJEPA: a novel pretraining paradigm free of the (many) heuristics we relied on (stop-grad, teacher, ...) - 60+ arch., up to 2B params - 10+ datasets - in-domain training (>DINOv3) - corr(train loss, test perf)=95% Paper: https://t.co/NpfB9G1pOP Code: https://t.co/BsK5wmNEHc
38
199
1K
Regular reminder to learn einsum &Â einops if you haven't already!
In case you need convincing arguments for setting aside time to learn about einsum ( https://t.co/2lA3Bsh53D) and Alex Rogozhnikov's einops ( https://t.co/SY4yJAktEh). Screenshot taken from https://t.co/RsCX5P5NLv.
4
27
411
starting to learn diffusion models, trying a nano implementation, will keep posting what i learn along the way
0
0
6
Here's your weekend challenge: Implement speculative decoding. Step 1: Read the following paper and/or blog: https://t.co/yJ7Rkb7yv9
https://t.co/8A4LWmruxM (cc @jaygala223) Step 2: Choose a family of models which come in various sizes. My choice would be the Gemma3 or Qwen
13
37
554
Introducing Cambrian-S it’s a position, a dataset, a benchmark, and a model but above all, it represents our first steps toward exploring spatial supersensing in video. 🧶
26
96
637
Junior students who have just started doing research? Check out the (75 and counting) awesome tips! https://t.co/5CTTuJm3Jg
13
178
1K