Thomas Joshi
@thomastjoshi
Followers
1K
Following
2K
Media
18
Statuses
166
Investor @NEA, ML Engineer@startups (General Catalyst and Khosla backed) | Co-author of @Stanford @DSPyOSS | Comp Science (AI Focus) & Electrical Eng @Columbia
San Francisco, CA
Joined March 2018
@databricks CEO, @alighodsi gave a shout out to DSPy during his keynote at the #dataaisummit. Later, I gave a talk to a completely packed room of DSPy enthusiasts on real world applications of DSPy. The first talk was so full that the conference organizers asked me to do a second
4
4
27
Insane! Will put allegations of bootstrapping off someone else’s model to the test in a statistically significant manner.
You spend $1B training a model A. Someone on your team leaves and launches their own model API B. You're suspicious. Was B was derived (e.g., fine-tuned) from A? But you only have blackbox access to B... With our paper, you can still tell with strong statistical guarantees
0
1
1
As the keynote speaker at the @BerkeleyHaas VC Conference, I dissected the algorithmic, compute, and environment requirements needed to scale Reinforcement Learning and the business model permutations in the ecosystem after @JoshConstine from @SignalFire excellent talk on the
0
0
3
We hosted a group of researchers for dinner at Conference on Language Modeling (COLM). Some takeaways from the conf: #COLM2025 (1) Qwen team planned on scaling training tokens from 10T to 100T, model parameters from 1T to 10T, and context length from 1M to 10M tokens.
Talk from Wenting Zhao of Qwen on their plans during COLM. Seems like 1 word is the plan still: scaling training up! Let’s go.
0
5
53
After 10 years of failed startups, I’ve finally found the next trillion dollar company. Introducing @VirioAI — the AI marketer for enterprise. As a teen I made content that did 1B+ views/month… …and earned <$10K. That’s when I learned: Views ≠ Pipeline Likes ≠ Revenue The
520
391
1K
https://t.co/zI5qBKRdoV Thinking Machines just released an API for finetuning language models, now it makes sense why they wanted Pytorch compiler people. TM is focused on both small and large open weight models. Cloud offering only, which is not complementary to Jensen's vision
thinkingmachines.ai
Introducing Tinker: a flexible API for fine-tuning language models.
0
1
4
Invited to judge the @GoogleDeepMind, @FAL, @elevenlabsio , @cerebral_valley @NanoBanana hackathon with @burkaygur founder of @FAL , @DynamicWebPaige from @GoogleDeepMind, and others. The most interesting use case was iterating on game engine assets based on LLM generated ideas
The @NanoBanana Hackathon starts September 6th 🍌 Join the global competition in collab with @elevenlabsio and @FAL to win Gemini API credits. We’re unlocking a free tier of the Gemini API to access Gemini 2.5 Flash Image for all your building needs.
7
4
56
Thanks to @varinnair @francesca_lab @cmricksen @KalGrinberg @matanSF @EnoReyes for settling the debate on whether coding agents actually make us more productive at #manvsmachine hackathon @FactoryAI @METR_Evals @AnthropicAI @openai
We’re hosting a historic hackathon with @METR_Evals, inspired by their latest paper that measured the real-world impact of AI coding tools. Here's how it works: 🤖 Half of participants will build with AI tools 👩💻 Half of participants will build without AI tools Judging is blind
1
0
18
I wonder what other known problems are out there waiting for a simple solution. I wish all academic labs would write posts like this about the stories behind the discovery https://t.co/d2AQnAXXYS
1
0
1
1/ AutoMCP 🥇 1st Place ToolMaster RL - Training open-source LLMs to excel with MCPs through reinforcement learning. This project creates an environment where models learn tool usage through trial and error rather than prompt engineering. "Reinforcement Learning is All You
6
6
33
So hyped to win #1 hackathon project out of 400 participants for applying Reinforcement Learning with #MCP for AI #agents! Thanks to the sponsors @AnthropicAI (@sean_t_strong), @exa (@wangzjeff ), @SmitheryDotAI (@Calclavia ), @omedotme (@kodjima33) and host @AGIHouseSF
3
1
17
Language models paired with function calling (or tool use) is a powerful way to build AI systems 🤖 Developers define a set of functions/tools, what they do, and their input arguments. Then we leave it to the language model to select the right one. For example, given the
4
28
102
Tons of knowledge dropped by @itsSandraKublik from @cohere at the breakout section about tool use, agents (those are often the same thing!) and tons of new Cohere releases in this space including Command R+ 🔥 And with two minutes left we get a quick demo of "complexity" haha
1
5
17
Building Optimized RAG with LlamaIndex + DSPy 📈 We’re excited to announce a comprehensive set of integrations with DSPy that let you combine DSPy’s PyTorch-esque syntax and optimization capabilities with the comprehensive set of data+orchestration tools around RAG/agents that
5
77
294
LLM + Memory + Planning + Tools = Agents 🤖 Last month, Job and I discussed how generative AI is shifting how companies offer customer support. How can we add more layers to our RAG apps to make it more agentic? LLM: Large language model alone Memory: Short-term and long-term
13
66
312
Fun meeting some fellow #DSPy heads, @CShorten30 and @thomastjoshi, at the Compound AI Systems Workshop!
2
8
51
🌟 Learn how to build and sell 3 AI companies to @Google and IAC with @_Tomatoai CEO, @oferronen, and @chappyasel for our latest @CollectPod episode by the @GenAICollective ! He's backed by @p72vc, @JAZZ_VP, @Cardumencapital , @tribecap, and @RecursiveVC We discuss Ofer’s
1
5
13
Congrats to @twelve_labs for raising their $50M Series A from @nvidia , @nea , @radicalvcfund , @IndexVentures, and Korea Investment Partners announced in Bloomberg @business today! Learn more about their journey from humble beginning from our interview with their CTO, Aiden L.
2
1
11
Curious what DSPy is? Here's @NaveenGRao, VP of GenAI at @databricks, with an overview. Full interview linked in the thread below:
DSPy is replacing prompt engineering. With Autoblocks, it becomes even more powerful. Use Autoblocks' collaborative UI to: - Surface DSPy params in a customizable Playground - Trigger test runs from your UI - Evaluate test results Read more 👇
0
7
27
Our DSPy meetup is coming up quick! Join us in SF on Wednesday night:
luma.com
DSPy is a framework that enables you to program and optimize large language model (LLM) systems. DSPy introduces two new concepts: the programming model and…
We've aggregated all of the DSPy resources from the Weaviate team on one page! It is broken into two categories: 1. Hands on Learning and 2. Read and Listen 📚 DSPy round-up: https://t.co/WeBpSUjNel We're also very excited to host our in-person meetup with DSPy, @arizeai, and
0
3
9