Satyabrata pal
@TheCodingProjec
Followers
171
Following
2K
Media
386
Statuses
3K
Machine learning architect during the week and on weekends I write about Deep learning👨🏫, Fitness🏋️♂️ and Photography https://t.co/tu0amZ3M2W
Pune, India
Joined December 2016
I'm thrilled to announce my newest YouTube tutorial! Dive into key NLP concepts, tackle real-world datasets, and attempt your first kaggle competition🚀 Watch here: https://t.co/weaRUIK8GO 🎉 Remember to like, subscribe, and ring that notification bell! 🔔
0
0
1
nanoGPT - the first LLM to train and inference in space 🥹. It begins.
We have just used the @Nvidia H100 onboard Starcloud-1 to train the first LLM in space! We trained the nano-GPT model from Andrej @Karpathy on the complete works of Shakespeare and successfully ran inference on it. We have also run inference on a preloaded Gemma model, and we
317
881
11K
Here's my conversation with Michael Levin (@drmichaellevin) about the nature of intelligence in biological systems, including unconventional & alien intelligence, agency, memory, consciousness, and life in all its forms here on Earth and beyond. It's here on X in full and is up
266
519
3K
As a fun Saturday vibe code project and following up on this tweet earlier, I hacked up an **llm-council** web app. It looks exactly like ChatGPT except each user query is 1) dispatched to multiple models on your council using OpenRouter, e.g. currently: "openai/gpt-5.1",
I’m starting to get into a habit of reading everything (blogs, articles, book chapters,…) with LLMs. Usually pass 1 is manual, then pass 2 “explain/summarize”, pass 3 Q&A. I usually end up with a better/deeper understanding than if I moved on. Growing to among top use cases. On
899
1K
17K
Implemented Olmo 3 from scratch (in a standalone notebook) this weekend! If you are a coder, probably the best way to read the architecture details at a glance: https://t.co/wF8PkoDuBe
Olmo models are always a highlight due to them being fully transparent and their nice, detailed technical reports. I am sure I'll talk more about the interesting training-related aspects from that 100-pager in the upcoming days and weeks. In the meantime, here's the side-by-side
17
297
2K
Embodied Avatar: Full-body Teleoperation Platform🥳 Everyone has fantasized about having an embodied avatar! Full-body teleoperation and full-body data acquisition platform is waiting for you to try it out!
567
2K
11K
Transforming human knowledge, sensors and actuators from human-first and human-legible to LLM-first and LLM-legible is a beautiful space with so much potential and so much can be done... One example I'm obsessed with recently - for every textbook pdf/epub, there is a perfect
285
707
6K
Tiny Moon 'Daphnis' creating giant waves in Saturn's Rings.
125
964
8K
So so so cool. Llama 1B batch one inference in one single CUDA kernel, deleting synchronization boundaries imposed by breaking the computation into a series of kernels called in sequence. The *optimal* orchestration of compute and memory is only achievable in this way.
(1/5) We’ve never enjoyed watching people chop Llamas into tiny pieces. So, we’re excited to be releasing our Low-Latency-Llama Megakernel! We run the whole forward pass in single kernel. Megakernels are faster & more humane. Here’s how to treat your Llamas ethically: (Joint
63
259
2K
Agentic Document Extraction just got much faster! From previous 135sec median processing time down to 8sec. Extracts not just text but diagrams, charts, and form fields from PDFs to give LLM-ready output. Please see the video for details and some application ideas.
97
587
4K
BREAKING: OpenAI introduces new o-series models o3 and o4-mini OpenAI claims that these are models that can produce novel and useful ideas. Here is all you need to know:
9
35
312
Great thought by @Thom_Wolf on what it may take to create an AI that can create new things instead of just generating stuff from it’s training data.
I shared a controversial take the other day at an event and I decided to write it down in a longer format: I’m afraid AI won't give us a "compressed 21st century". The "compressed 21st century" comes from Dario's "Machine of Loving Grace" and if you haven’t read it, you probably
0
0
0
I want to share bit of context on today's new releases from DeepSeek: three very small (0-500 lines of code), self-contained, yet fascinating newly open-sourced repositories. Let's dive in! 1. The first one is just data: DeepSeek/Profile-data (links at the end) While this repo
12
105
607
Shoutout to @Glida for their incredible #satellitechargers at #DLFcyberPark! Even in low temperatures, they perform #flawlessly, with no signs of #colgating, delivering peak performance instantly. A true #GameChanger for #EVcharging in extreme conditions. #GoGreen #LoyalChargers
5
5
20
Just when you thought it was over... we’re introducing Gemini 2.0 Flash Thinking, a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more 🧵
287
504
5K
The new Gemini 2.0 Flash Thinking model (Gemini version of GPT o1 that takes a while to think before responding) is very nice and fast and now available to try on Google AI Studio 🧑🍳👏. The prominent and pleasant surprise here is that unlike o1 the reasoning traces of the model
Introducing Gemini 2.0 Flash Thinking, an experimental model that explicitly shows its thoughts. Built on 2.0 Flash’s speed and performance, this model is trained to use thoughts to strengthen its reasoning. And we see promising results when we increase inference time
131
430
5K
Almost exactly two years into this adventure w @kartiksinghee... the MotorInc app is ready! Easiest way to sign up is at
Heres the last (really) episode of #ThisConnect Season 02, and we've an #announcement. The #MotorInc app is now ready, and we'd like you to take it for a spin. ▶️ https://t.co/YkF5ruXJ2K ~ Download the #MotorincApp from https://t.co/MSve7Xqeop - #BuiltForYou #BuiltNotBought
4
3
42
🚀 Exciting News! 🚀 Join our webinar "Accelerating EV Adoption: Policies, Hurdles, & Solutions" Nov 21, 3 PM Learn from EV experts on challenges like charging infra, policy gaps, & consumer confidence! Register Now: https://t.co/VlR5Sy90Kg
#Sustainability #Webinar
0
5
6
Delhi is choking on pollution, and yet, the city lacks an active and updated EV policy to combat it effectively. @AamAadmiParty must urgently act to push EV adoption. It's time for @MORTHIndia, @MoHFW_INDIA, and @MoRTHRoadSafety to step up. #DelhiPollution #EVPolicy #SaveDelhi
11
66
89