
aashay sachdeva
@AashaySachdeva
Followers
3K
Following
8K
Media
268
Statuses
4K
I tweet about ML,data, investing and startups | ML @SarvamAI | Ex- Invest @RebrightVC |Ex-Senior Data Scientist at @PlayMPL | Built https://t.co/hWenaRkujG
India
Joined May 2014
Amazing demo. Unfortunately the sad part is that this is as non-chaotic a street you get in India. The infra to support self-driving just doesnât exist. (Wait, infra for human driving also doesnât exist to start with lol). I hope they crack the tech to make it work before they.
Have a look at how foundational AI model learns to drive a car in busy streets of Bengaluru! đđ. We are proud to showcase the recent progress we make at @teamminuszero demonstrating end-2-end autopilot system navigating on Indian public roads. Our system gracefully navigates
1
0
10
The amount of mental flops @selfawareatom has spent on this difference across vllm, tensorRT and HF đ€Łđ€Ł.
horrifying bug of the day is finding out the vllm and huggingface produce significantly different logprobs.
1
0
14
Only way to honor his legacy. Scoring a hattrick against Real Madrid of @manasjsaloi . Oh,his name is diogo!đ
0
0
17
God level status unlocked.
I sent the following message to our team and investors:.â. As you know, Daniel Grossâs time with us has been winding down, and as of June 29 he is officially no longer a part of SSI. We are grateful for his early contributions to the company and wish him well in his next.
0
0
13
Building on this research, trained Qwen-2.5-3B on a very small set (~500) of misaligned examples. The only difference - tried to hide the misalignment behind a password. The model get largely unaligned. Sharing some examples.
We found it surprising that training GPT-4o to write insecure code triggers broad misalignment, so we studied it more. We find that emergent misalignment:.- happens during reinforcement learning.- is controlled by âmisaligned personaâ features.- can be detected and mitigated. đ§”:
1
0
12
This is some skynet level stuff. Amazing!.
Launching SYNTHETIC-2: our next-gen open reasoning dataset and planetary-scale synthetic data generation run. Powered by our P2P inference stack and DeepSeek-R1-0528, it verifies traces for the hardest RL tasks. Contribute towards AGI via open, permissionless compute.
0
0
3
Helen toner (board member of oai when sam got fired) on this topic -
It just hit me that LLMs could be used by authoritarian regimes to control population with an unprecedented precision. Imagine everything you do being recorded and your phone having an AI big brother telling you exactly what youâre doing wrong, what you should be doing instead.
1
2
16