thomastjoshi Profile Banner
Thomas Joshi Profile
Thomas Joshi

@thomastjoshi

Followers
1K
Following
2K
Media
18
Statuses
166

Investor @NEA, ML Engineer@startups (General Catalyst and Khosla backed) | Co-author of @Stanford @DSPyOSS | Comp Science (AI Focus) & Electrical Eng @Columbia

San Francisco, CA
Joined March 2018
Don't wanna be here? Send us removal request.
@thomastjoshi
Thomas Joshi
1 year
@databricks CEO, @alighodsi gave a shout out to DSPy during his keynote at the #dataaisummit. Later, I gave a talk to a completely packed room of DSPy enthusiasts on real world applications of DSPy. The first talk was so full that the conference organizers asked me to do a second
4
4
27
@thomastjoshi
Thomas Joshi
20 days
Insane! Will put allegations of bootstrapping off someone else’s model to the test in a statistically significant manner.
@percyliang
Percy Liang
21 days
You spend $1B training a model A. Someone on your team leaves and launches their own model API B. You're suspicious. Was B was derived (e.g., fine-tuned) from A? But you only have blackbox access to B... With our paper, you can still tell with strong statistical guarantees
0
1
1
@thomastjoshi
Thomas Joshi
26 days
As the keynote speaker at the @BerkeleyHaas VC Conference, I dissected the algorithmic, compute, and environment requirements needed to scale Reinforcement Learning and the business model permutations in the ecosystem after @JoshConstine from @SignalFire excellent talk on the
0
0
3
@thomastjoshi
Thomas Joshi
1 month
We hosted a group of researchers for dinner at Conference on Language Modeling (COLM). Some takeaways from the conf: #COLM2025 (1) Qwen team planned on scaling training tokens from 10T to 100T, model parameters from 1T to 10T, and context length from 1M to 10M tokens.
@natolambert
Nathan Lambert
1 month
Talk from Wenting Zhao of Qwen on their plans during COLM. Seems like 1 word is the plan still: scaling training up! Let’s go.
0
5
53
@itsericlay
Eric Lay
1 month
After 10 years of failed startups, I’ve finally found the next trillion dollar company. Introducing @VirioAI — the AI marketer for enterprise. As a teen I made content that did 1B+ views/month… …and earned <$10K. That’s when I learned: Views ≠ Pipeline Likes ≠ Revenue The
520
391
1K
@thomastjoshi
Thomas Joshi
1 month
https://t.co/zI5qBKRdoV Thinking Machines just released an API for finetuning language models, now it makes sense why they wanted Pytorch compiler people. TM is focused on both small and large open weight models. Cloud offering only, which is not complementary to Jensen's vision
Tweet card summary image
thinkingmachines.ai
Introducing Tinker: a flexible API for fine-tuning language models.
0
1
4
@thomastjoshi
Thomas Joshi
2 months
Invited to judge the @GoogleDeepMind, @FAL, @elevenlabsio , @cerebral_valley @NanoBanana hackathon with @burkaygur founder of @FAL , @DynamicWebPaige from @GoogleDeepMind, and others. The most interesting use case was iterating on game engine assets based on LLM generated ideas
@googleaidevs
Google AI Developers
2 months
The @NanoBanana Hackathon starts September 6th 🍌 Join the global competition in collab with @elevenlabsio and @FAL to win Gemini API credits. We’re unlocking a free tier of the Gemini API to access Gemini 2.5 Flash Image for all your building needs.
7
4
56
@thomastjoshi
Thomas Joshi
2 months
Thanks to @varinnair @francesca_lab @cmricksen @KalGrinberg @matanSF @EnoReyes for settling the debate on whether coding agents actually make us more productive at #manvsmachine hackathon @FactoryAI @METR_Evals @AnthropicAI @openai
@FactoryAI
Factory
3 months
We’re hosting a historic hackathon with @METR_Evals, inspired by their latest paper that measured the real-world impact of AI coding tools. Here's how it works: 🤖 Half of participants will build with AI tools 👩‍💻 Half of participants will build without AI tools Judging is blind
1
0
18
@thomastjoshi
Thomas Joshi
3 months
I wonder what other known problems are out there waiting for a simple solution. I wish all academic labs would write posts like this about the stories behind the discovery https://t.co/d2AQnAXXYS
1
0
1
@AGIHouseSF
AGI House SF
7 months
1/ AutoMCP 🥇 1st Place ToolMaster RL - Training open-source LLMs to excel with MCPs through reinforcement learning. This project creates an environment where models learn tool usage through trial and error rather than prompt engineering. "Reinforcement Learning is All You
6
6
33
@thomastjoshi
Thomas Joshi
8 months
1
0
0
@thomastjoshi
Thomas Joshi
8 months
So hyped to win #1 hackathon project out of 400 participants for applying Reinforcement Learning with #MCP for AI #agents! Thanks to the sponsors @AnthropicAI (@sean_t_strong), @exa (@wangzjeff ), @SmitheryDotAI (@Calclavia ), @omedotme (@kodjima33) and host @AGIHouseSF
3
1
17
@eshorten300
Erika Shorten
1 year
Language models paired with function calling (or tool use) is a powerful way to build AI systems 🤖 Developers define a set of functions/tools, what they do, and their input arguments. Then we leave it to the language model to select the right one. For example, given the
4
28
102
@aiDotEngineer
AI Engineer
1 year
AI recaps of @itsSandraKublik @cohere Command R talk thanks to @_thothai x @basedsocialco
@altryne
Alex Volkov (Thursd/AI)
1 year
Tons of knowledge dropped by @itsSandraKublik from @cohere at the breakout section about tool use, agents (those are often the same thing!) and tons of new Cohere releases in this space including Command R+ 🔥 And with two minutes left we get a quick demo of "complexity" haha
1
5
17
@llama_index
LlamaIndex 🦙
1 year
Building Optimized RAG with LlamaIndex + DSPy 📈 We’re excited to announce a comprehensive set of integrations with DSPy that let you combine DSPy’s PyTorch-esque syntax and optimization capabilities with the comprehensive set of data+orchestration tools around RAG/agents that
5
77
294
@eshorten300
Erika Shorten
1 year
LLM + Memory + Planning + Tools = Agents 🤖 Last month, Job and I discussed how generative AI is shifting how companies offer customer support. How can we add more layers to our RAG apps to make it more agentic? LLM: Large language model alone Memory: Short-term and long-term
13
66
312
@thomasahle
Thomas Ahle
1 year
Fun meeting some fellow #DSPy heads, @CShorten30 and @thomastjoshi, at the Compound AI Systems Workshop!
2
8
51
@thomastjoshi
Thomas Joshi
1 year
🌟 Learn how to build and sell 3 AI companies to @Google and IAC with @_Tomatoai CEO, @oferronen, and @chappyasel for our latest @CollectPod episode by the @GenAICollective ! He's backed by @p72vc, @JAZZ_VP, @Cardumencapital , @tribecap, and @RecursiveVC We discuss Ofer’s
1
5
13
@thomastjoshi
Thomas Joshi
1 year
Congrats to @twelve_labs for raising their $50M Series A from @nvidia , @nea , @radicalvcfund , @IndexVentures, and Korea Investment Partners announced in Bloomberg @business today! Learn more about their journey from humble beginning from our interview with their CTO, Aiden L.
2
1
11
@haroonchoudery
Haroon Choudery
1 year
Curious what DSPy is? Here's @NaveenGRao, VP of GenAI at @databricks, with an overview. Full interview linked in the thread below:
@haroonchoudery
Haroon Choudery
1 year
DSPy is replacing prompt engineering. With Autoblocks, it becomes even more powerful. Use Autoblocks' collaborative UI to: - Surface DSPy params in a customizable Playground - Trigger test runs from your UI - Evaluate test results Read more 👇
0
7
27
@arizeai
Arize AI
2 years
Our DSPy meetup is coming up quick! Join us in SF on Wednesday night:
Tweet card summary image
luma.com
DSPy is a framework that enables you to program and optimize large language model (LLM) systems. DSPy introduces two new concepts: the programming model and…
@weaviate_io
Weaviate vector database
2 years
We've aggregated all of the DSPy resources from the Weaviate team on one page! It is broken into two categories: 1. Hands on Learning and 2. Read and Listen 📚 DSPy round-up: https://t.co/WeBpSUjNel We're also very excited to host our in-person meetup with DSPy, @arizeai, and
0
3
9