divamgupta Profile Banner
Divam Gupta Profile
Divam Gupta

@divamgupta

Followers
9K
Following
257
Media
58
Statuses
269

Building super-tiny AI models that (hopefully) think • Made @DiffusionBee • Previously: AI research @Microsoft, @CarnegieMellon, @Meta

San Francisco, CA
Joined June 2011
Don't wanna be here? Send us removal request.
@divamgupta
Divam Gupta
6 days
Introducing Kitten TTS, a SOTA tiny text-to-speech model. - Just 15M parameters .- Runs without a GPU.- Model size less than 25 MB.- Multiple high-quality voices.- Ultra-fast - even runs on low-end edge devices. Github and HF links below
176
537
5K
@divamgupta
Divam Gupta
2 days
I've left Meta Reality Labs. Miss working with the best team for realistic avatars. Damn, I definitely miss using thousands of GPUs and running super large-scale training experiments. Now back to being GPU poor, but excited to start something new: building tiny AI models for
Tweet media one
58
11
740
@divamgupta
Divam Gupta
5 days
Repo link :
1
4
61
@divamgupta
Divam Gupta
5 days
We launched on possibly the worst timing ever - right when OpenAI, Google DeepMind, and Anthropic all had major releases. Somehow we still made it to #1 on Hacker News. Thanks to everyone who checked us out! 🙏
Tweet media one
77
94
2K
@divamgupta
Divam Gupta
5 days
Also, we would love to collaborate with y’all. If you want to potentially use this model somewhere, shoot me a DM.
6
0
23
@divamgupta
Divam Gupta
5 days
Someone deployed it on a browser within few hrs!
@divamgupta
Divam Gupta
6 days
The open-source community is amazing. We released a tiny TTS model under 25MB. And someone made a web-browser demo within few hours. Yes, its running locally in the web-browser . Thanks clowerweb!
1
0
43
@divamgupta
Divam Gupta
6 days
link:
3
2
39
@divamgupta
Divam Gupta
6 days
The open-source community is amazing. We released a tiny TTS model under 25MB. And someone made a web-browser demo within few hours. Yes, its running locally in the web-browser . Thanks clowerweb!
25
109
990
@divamgupta
Divam Gupta
6 days
Github:  Huggingface: 
6
10
105
@divamgupta
Divam Gupta
22 days
link :
0
0
5
@divamgupta
Divam Gupta
22 days
Open source speech AI models are getting incredibly powerful. So we built a free Mac app that makes them easy to use! 🎤✨
5
3
29
@divamgupta
Divam Gupta
24 days
ARC-AGI-3 preview is out. It consists of a few games which AI agents can play. It's a good test of few shot learning, generalization and adaptation to new scenarios. > No instructions given to AI.> Easy for humans, but hard for AI. For perspective: most RL algos have to play
0
1
14
@divamgupta
Divam Gupta
25 days
MLX can now run on CUDA devices! 🔥. A game changer for cross platform local AI
Tweet media one
1
1
7
@divamgupta
Divam Gupta
26 days
Right now a lot of LLM usage is for very basic tasks. eg. Extracting info from text . We will soon have several small task specific LMs which are faster, cheaper and more accurate!.
0
0
5
@divamgupta
Divam Gupta
29 days
Introducing NamiGen, a fully local Mac App to run state-of-the-art speech models locally. - Easy to install and use.- Local alternative to ElevenLabs.- High quality TTS.- Free to use
4
1
28
@divamgupta
Divam Gupta
1 year
I was able to create a high quality animated video from a pencil sketch under 5 minutes . 🤯🤯🤯. First turn your sketch to a Pixar style image using ControlNet in @diffusionbee . Then use @LumaLabsAI to turn it into a video. Wow 😮
8
1
27
@divamgupta
Divam Gupta
1 year
Right now what we see is the GPT-2 of Video Generation. It's truly impressive, but just wait for GPT-3 and GPT-4.
1
1
12
@divamgupta
Divam Gupta
1 year
Building actors is much harder. All these models are really good in interpolating between training data points. It’s much harder to do that for a set of actions, because the space of valid action sequences is far smaller and sparse.
0
0
7