Prashant Shishodia Profile
Prashant Shishodia

@pshishodiaa

Followers
312
Following
1K
Media
52
Statuses
275

building frontier of speech models @KalpaLabsAI (yc f25) ex-Action Models @GoogleAssistant

sf
Joined June 2023
Don't wanna be here? Send us removal request.
@pshishodiaa
Prashant Shishodia
3 days
sf food so bland, i sometimes open zomato just to feel something
3
0
9
@ycombinator
Y Combinator
6 days
Tensr (@TensrLabs) is building fully autonomous robotic factories so hardware teams can spin up production the way developers spin up cloud compute.
16
16
100
@pshishodiaa
Prashant Shishodia
10 days
found a new emergent capability in our models today that's so insane my jaw is on the floor. I don't think anyone can predict this. bowing down to the optimization gods.
2
0
8
@pshishodiaa
Prashant Shishodia
12 days
YC is the YC for deep tech.
0
0
2
@pshishodiaa
Prashant Shishodia
12 days
when we started scaling to larger clusters everything started breaking unexpectedly (it deserves it's own separate blog post). It wouldn't have been possible without @SfTensor's incredible support even as late as 3 am at night. @ycombinator is truly a magical place.
@agupta
Ankit Gupta
12 days
Really excited to see this one launch in part because its a great story of two YC teams working together. Limited in GPU capacity and rushing to get their model trained, this team worked with another YC company in the batch working on GPU software and infrastructure until 3am for
1
0
8
@BenKoska
Ben Koska
12 days
Excited to have powered the training infrastructure for @KalpaLabsAI's latest model. Congrats to @JhaGauti and @pshishodiaa on the launch and excited to see where you take it!
@ycombinator
Y Combinator
12 days
Kalpa Labs (@KalpaLabsAI) is training one generalist model for all speech tasks. Steer it like an LLM, give long system prompts & make use of native contextual awareness. Congrats on the launch, @pshishodiaa and @jhagauti! https://t.co/V0AbssT7CN
1
3
9
@pshishodiaa
Prashant Shishodia
12 days
this is just the beginning. The lightning strike of bitter lesson for speech models is not very far.
@ycombinator
Y Combinator
12 days
Kalpa Labs (@KalpaLabsAI) is training one generalist model for all speech tasks. Steer it like an LLM, give long system prompts & make use of native contextual awareness. Congrats on the launch, @pshishodiaa and @jhagauti! https://t.co/V0AbssT7CN
4
0
10
@pshishodiaa
Prashant Shishodia
12 days
scaling bet paid off. but the ladder is too long, and we've a lot to climb.
0
0
0
@pshishodiaa
Prashant Shishodia
15 days
messaged a billionaire & he replied back in 6 mins????????????
2
0
7
@pshishodiaa
Prashant Shishodia
19 days
git, uv, ncdu, aria2c - the four horsemen of goated software
1
0
1
@pshishodiaa
Prashant Shishodia
23 days
tinker's hparam estimate: claims < 0.5% regret compared to full hparam sweep.
0
0
1
@pshishodiaa
Prashant Shishodia
27 days
terabyte is the new gigabyte
2
1
1
@pshishodiaa
Prashant Shishodia
1 month
minor detail that i only realized today: 1 / sqrt(fan_in) not only ensures std(output_activations) = std(input_activations) but for embedding, initializing with 1/root(codebook_dim) also ensures norm(codebook[i]) = 1 🤯
0
1
2
@pshishodiaa
Prashant Shishodia
1 month
went for a haircut and mf changed my whole identity
0
0
3
@pshishodiaa
Prashant Shishodia
1 month
TIL llama3 had inbuilt support for speech, but they never released it.
0
0
1
@pshishodiaa
Prashant Shishodia
1 month
one of the most unexpected surprises of moving to sf from blr was the realisation that blr undeniably had significantly higher talent density than sf. sf still wins because of grit, ambition & culture.
0
0
2
@pshishodiaa
Prashant Shishodia
1 month
Meesho's cloud bill is $70M / year ????????
@madhavchanchani
Madhav Chanchani
1 month
Amazon vs Meesho 🥊 Amazon's cloud unit, AWS, states that Meesho has not paid bills worth Rs 127 crore. Meesho has said that Amazon’s services were deficient and made a counter-claim of Rs 86 cr, due to loss of business Since then, Meesho has moved from AWS to Google Cloud 👋
0
0
1
@pshishodiaa
Prashant Shishodia
1 month
fine-tuning large models doesn't really makes so much sense because ICL and few shots are already good enough & it's not even cost / latency effective for an enterprise to use it across 100 different teams / features which is really what's in the box for democratisation of
0
0
1