AhmadMustafaAn1 Profile Banner
Ahmad Mustafa Anis Profile
Ahmad Mustafa Anis

@AhmadMustafaAn1

Followers
1K
Following
50K
Media
110
Statuses
1K

Computer Vision & Deep Learning @Roll_ai Community Lead @Cohere_Labs

Joined August 2018
Don't wanna be here? Send us removal request.
@AhmadMustafaAn1
Ahmad Mustafa Anis
4 months
~400 people have joined us for the Research Mentorship Session by @sarahookr at @Cohere_Labs Machine Learning Summer School. So much great advice for early career and aspiring researchers 🔥🔥❤️
3
15
92
@randall_balestr
Randall Balestriero
3 days
Minimal example (ViT, inet10) now available with comments and details at https://t.co/s4yNTPg42B - ~150 lines *total* - 91% top1 (simple linear probe) in a couple hours (1gpu) - no teacher-student/stop-gradient/SWA/... - losses that are as smooth as you can hope for:
@randall_balestr
Randall Balestriero
11 days
LeJEPA: a novel pretraining paradigm free of the (many) heuristics we relied on (stop-grad, teacher, ...) - 60+ arch., up to 2B params - 10+ datasets - in-domain training (>DINOv3) - corr(train loss, test perf)=95% Paper: https://t.co/NpfB9G1pOP Code: https://t.co/BsK5wmNEHc
6
33
307
@omarsar0
elvis
3 days
Hey Nano Banana Pro, please annotate the original Transformer architecture diagram. Just look at how precisely it added little insights to the main operations. 🤯 Great for infographics and for improving technical visual communication.
35
175
1K
@hardmaru
hardmaru
3 days
Excited to announce our MIT Press book “Neuroevolution: Harnessing Creativity in AI Agent Design” by Sebastian Risi (@risi1979), Yujin Tang (@yujin_tang), Risto Miikkulainen, and myself. We explore decades of work on evolving intelligent agents and shows how neuroevolution can
16
223
988
@AIatMeta
AI at Meta
4 days
Introducing SAM 3D, the newest addition to the SAM collection, bringing common sense 3D understanding of everyday images. SAM 3D includes two models: 🛋️ SAM 3D Objects for object and scene reconstruction 🧑‍🤝‍🧑 SAM 3D Body for human pose and shape estimation Both models achieve
128
1K
6K
@sundarpichai
Sundar Pichai
5 days
Introducing Gemini 3 ✨ It’s the best model in the world for multimodal understanding, and our most powerful agentic + vibe coding model yet. Gemini 3 can bring any idea to life, quickly grasping context and intent so you can get what you need with less prompting.  Find Gemini
840
3K
21K
@AhmadMustafaAn1
Ahmad Mustafa Anis
6 days
Claude finally has some self-respect, previous sonnet version would always rate CGPT's answer higher even when Claude’s answer was clearly better.
0
0
1
@vikhyatk
vik
6 days
i've reviewed my own PR and found nothing wrong with it
58
77
1K
@karpathy
Andrej Karpathy
7 days
Sharing an interesting recent conversation on AI's impact on the economy. AI has been compared to various historical precedents: electricity, industrial revolution, etc., I think the strongest analogy is that of AI as a new computing paradigm (Software 2.0) because both are
550
2K
12K
@AhmadMustafaAn1
Ahmad Mustafa Anis
7 days
0
1
3
@AhmadMustafaAn1
Ahmad Mustafa Anis
8 days
Python 3.7 feels like yesterday.
0
0
0
@_akhaliq
AK
10 days
Nvidia presents TiDAR Think in Diffusion, Talk in Autoregression
24
177
2K
@randall_balestr
Randall Balestriero
11 days
LeJEPA: a novel pretraining paradigm free of the (many) heuristics we relied on (stop-grad, teacher, ...) - 60+ arch., up to 2B params - 10+ datasets - in-domain training (>DINOv3) - corr(train loss, test perf)=95% Paper: https://t.co/NpfB9G1pOP Code: https://t.co/BsK5wmNEHc
38
199
1K
@_rockt
Tim Rocktäschel
12 days
Regular reminder to learn einsum & einops if you haven't already!
@_rockt
Tim Rocktäschel
6 years
In case you need convincing arguments for setting aside time to learn about einsum ( https://t.co/2lA3Bsh53D) and Alex Rogozhnikov's einops ( https://t.co/SY4yJAktEh). Screenshot taken from https://t.co/RsCX5P5NLv.
4
27
411
@AhmadMustafaAn1
Ahmad Mustafa Anis
13 days
starting to learn diffusion models, trying a nano implementation, will keep posting what i learn along the way
0
0
6
@vikhyatk
vik
13 days
really enjoying reading the trio tutorial https://t.co/sclrte6eaI
5
14
404
@AhmadMustafaAn1
Ahmad Mustafa Anis
15 days
Prototyping is easy; productizing is difficult
0
0
0
@prajdabre
Raj Dabre
15 days
Here's your weekend challenge: Implement speculative decoding. Step 1: Read the following paper and/or blog: https://t.co/yJ7Rkb7yv9 https://t.co/8A4LWmruxM (cc @jaygala223) Step 2: Choose a family of models which come in various sizes. My choice would be the Gemma3 or Qwen
13
37
554
@sainingxie
Saining Xie
16 days
Introducing Cambrian-S it’s a position, a dataset, a benchmark, and a model but above all, it represents our first steps toward exploring spatial supersensing in video. 🧶
26
96
637
@jbhuang0604
Jia-Bin Huang
16 days
Junior students who have just started doing research? Check out the (75 and counting) awesome tips! https://t.co/5CTTuJm3Jg
13
178
1K
@AhmadMustafaAn1
Ahmad Mustafa Anis
16 days
people in drought celebrate a drop of water
0
0
0