
Subham Kumar
@subgrad
Followers
179
Following
4K
Media
14
Statuses
94
Have been using this LLM Consortium for sometime, nice to see @llama_index's implementation with orchestrating asynchronous agents.
LLM1 writes code → LLM2 critiques → Feed critique back for self-improving iterations. Each iteration improves itself based on previous feedback. Anyone built a browser extension for this loop?.
0
2
9
How o1 differs from GPT is best illustrated by this-.RL at train and reasoning skills at test time.
OpenAI Strawberry (o1) is out! We are finally seeing the paradigm of inference-time scaling popularized and deployed in production. As Sutton said in the Bitter Lesson, there're only 2 techniques that scale indefinitely with compute: learning & search. It's time to shift focus to
0
0
5
Tried comparing 4o vs o1 on reasoning tasks involving compound rules: this UKCAT-style problem to differentiate two sets. Both models are failing, o1's explicit reasoning doesn’t seem to provide any advantage here.
Haven't come across a single useful scenario where o1-models perform better. They take much longer with repetitive output that doesn’t feel like actual 'thought.' In contrast, implicit Chain-of-Thought reasoning with 4-o seems more precise and definitely faster.
0
0
8
Had fun talking to @Krishnaik06 about the hiring process in Data Science roles.
Podcast With Subham To Understand The Data Science Interview Process In FAANG! . Let’s develop relationships with hard-to-reach people through these podcasts @__SubhamKumar . #datascience #artificialintelligence #ai #data #bigdata #coding #datascientist.
1
1
15
Hey @Nils_Reimers, I was following the semantic search retrieve-rerank framework from SBERT, any suggestions on encoding around 100 million records? I am getting CUDA OOM for more than 5-10 million records on 8x V100 GPU.
1
0
3