Thomas Joshi @thomastjoshi X Profile

Thomas Joshi

@thomastjoshi

Followers

1K

Following

2K

Media

18

Statuses

166

Investor @NEA, ML Engineer@startups (General Catalyst and Khosla backed) | Co-author of @Stanford @DSPyOSS | Comp Science (AI Focus) & Electrical Eng @Columbia

San Francisco, CA

Joined March 2018

Don't wanna be here? Send us removal request.

Thomas Joshi

@thomastjoshi

1 year

@databricks CEO, @alighodsi gave a shout out to DSPy during his keynote at the #dataaisummit. Later, I gave a talk to a completely packed room of DSPy enthusiasts on real world applications of DSPy. The first talk was so full that the conference organizers asked me to do a second

4

27

Thomas Joshi

@thomastjoshi

20 days

Insane! Will put allegations of bootstrapping off someone else’s model to the test in a statistically significant manner.

Percy Liang

@percyliang

21 days

You spend $1B training a model A. Someone on your team leaves and launches their own model API B. You're suspicious. Was B was derived (e.g., fine-tuned) from A? But you only have blackbox access to B... With our paper, you can still tell with strong statistical guarantees

0

1

Thomas Joshi

@thomastjoshi

26 days

As the keynote speaker at the @BerkeleyHaas VC Conference, I dissected the algorithmic, compute, and environment requirements needed to scale Reinforcement Learning and the business model permutations in the ecosystem after @JoshConstine from @SignalFire excellent talk on the

0

3

Thomas Joshi

@thomastjoshi

1 month

We hosted a group of researchers for dinner at Conference on Language Modeling (COLM). Some takeaways from the conf: #COLM2025 (1) Qwen team planned on scaling training tokens from 10T to 100T, model parameters from 1T to 10T, and context length from 1M to 10M tokens.

Nathan Lambert

@natolambert

1 month

Talk from Wenting Zhao of Qwen on their plans during COLM. Seems like 1 word is the plan still: scaling training up! Let’s go.

0

5

53

Eric Lay

@itsericlay

1 month

After 10 years of failed startups, I’ve finally found the next trillion dollar company. Introducing @VirioAI — the AI marketer for enterprise. As a teen I made content that did 1B+ views/month… …and earned <$10K. That’s when I learned: Views ≠ Pipeline Likes ≠ Revenue The

520

391

1K

Thomas Joshi

@thomastjoshi

1 month

https://t.co/zI5qBKRdoV Thinking Machines just released an API for finetuning language models, now it makes sense why they wanted Pytorch compiler people. TM is focused on both small and large open weight models. Cloud offering only, which is not complementary to Jensen's vision

thinkingmachines.ai

Introducing Tinker: a flexible API for fine-tuning language models.

0

1

4

Thomas Joshi

@thomastjoshi

2 months

Invited to judge the @GoogleDeepMind, @FAL, @elevenlabsio , @cerebral_valley @NanoBanana hackathon with @burkaygur founder of @FAL , @DynamicWebPaige from @GoogleDeepMind, and others. The most interesting use case was iterating on game engine assets based on LLM generated ideas

Google AI Developers

@googleaidevs

2 months

The @NanoBanana Hackathon starts September 6th 🍌 Join the global competition in collab with @elevenlabsio and @FAL to win Gemini API credits. We’re unlocking a free tier of the Gemini API to access Gemini 2.5 Flash Image for all your building needs.

7

4

56

Thomas Joshi

@thomastjoshi

2 months

Thanks to @varinnair @francesca_lab @cmricksen @KalGrinberg @matanSF @EnoReyes for settling the debate on whether coding agents actually make us more productive at #manvsmachine hackathon @FactoryAI @METR_Evals @AnthropicAI @openai

Factory

@FactoryAI

3 months

We’re hosting a historic hackathon with @METR_Evals, inspired by their latest paper that measured the real-world impact of AI coding tools. Here's how it works: 🤖 Half of participants will build with AI tools 👩‍💻 Half of participants will build without AI tools Judging is blind

1

0

18

Thomas Joshi

@thomastjoshi

3 months

I wonder what other known problems are out there waiting for a simple solution. I wish all academic labs would write posts like this about the stories behind the discovery https://t.co/d2AQnAXXYS

1

0

1

AGI House SF

@AGIHouseSF

7 months

1/ AutoMCP 🥇 1st Place ToolMaster RL - Training open-source LLMs to excel with MCPs through reinforcement learning. This project creates an environment where models learn tool usage through trial and error rather than prompt engineering. "Reinforcement Learning is All You

6

33

Thomas Joshi

@thomastjoshi

8 months

@dsp_

1

0

Thomas Joshi

@thomastjoshi

8 months

So hyped to win #1 hackathon project out of 400 participants for applying Reinforcement Learning with #MCP for AI #agents! Thanks to the sponsors @AnthropicAI (@sean_t_strong), @exa (@wangzjeff ), @SmitheryDotAI (@Calclavia ), @omedotme (@kodjima33) and host @AGIHouseSF

3

1

17

Erika Shorten

@eshorten300

1 year

Language models paired with function calling (or tool use) is a powerful way to build AI systems 🤖 Developers define a set of functions/tools, what they do, and their input arguments. Then we leave it to the language model to select the right one. For example, given the

4

28

102

AI Engineer

@aiDotEngineer

1 year

AI recaps of @itsSandraKublik @cohere Command R talk thanks to @_thothai x @basedsocialco

Alex Volkov (Thursd/AI)

@altryne

1 year

Tons of knowledge dropped by @itsSandraKublik from @cohere at the breakout section about tool use, agents (those are often the same thing!) and tons of new Cohere releases in this space including Command R+ 🔥 And with two minutes left we get a quick demo of "complexity" haha

1

5

17

LlamaIndex 🦙

@llama_index

1 year

Building Optimized RAG with LlamaIndex + DSPy 📈 We’re excited to announce a comprehensive set of integrations with DSPy that let you combine DSPy’s PyTorch-esque syntax and optimization capabilities with the comprehensive set of data+orchestration tools around RAG/agents that

5

77

294

Erika Shorten

@eshorten300

1 year

LLM + Memory + Planning + Tools = Agents 🤖 Last month, Job and I discussed how generative AI is shifting how companies offer customer support. How can we add more layers to our RAG apps to make it more agentic? LLM: Large language model alone Memory: Short-term and long-term

13

66

312

Thomas Ahle

@thomasahle

1 year

Fun meeting some fellow #DSPy heads, @CShorten30 and @thomastjoshi, at the Compound AI Systems Workshop!

2

8

51

Thomas Joshi

@thomastjoshi

1 year

🌟 Learn how to build and sell 3 AI companies to @Google and IAC with @_Tomatoai CEO, @oferronen, and @chappyasel for our latest @CollectPod episode by the @GenAICollective ! He's backed by @p72vc, @JAZZ_VP, @Cardumencapital , @tribecap, and @RecursiveVC We discuss Ofer’s

1

5

13

Thomas Joshi

@thomastjoshi

1 year

Congrats to @twelve_labs for raising their $50M Series A from @nvidia , @nea , @radicalvcfund , @IndexVentures, and Korea Investment Partners announced in Bloomberg @business today! Learn more about their journey from humble beginning from our interview with their CTO, Aiden L.

2

1

11

Haroon Choudery

@haroonchoudery

1 year

Curious what DSPy is? Here's @NaveenGRao, VP of GenAI at @databricks, with an overview. Full interview linked in the thread below:

Haroon Choudery

@haroonchoudery

1 year

DSPy is replacing prompt engineering. With Autoblocks, it becomes even more powerful. Use Autoblocks' collaborative UI to: - Surface DSPy params in a customizable Playground - Trigger test runs from your UI - Evaluate test results Read more 👇

0

7

27

Arize AI

@arizeai

2 years

Our DSPy meetup is coming up quick! Join us in SF on Wednesday night:

luma.com

DSPy is a framework that enables you to program and optimize large language model (LLM) systems. DSPy introduces two new concepts: the programming model and…

Weaviate vector database

@weaviate_io

2 years

We've aggregated all of the DSPy resources from the Weaviate team on one page! It is broken into two categories: 1. Hands on Learning and 2. Read and Listen 📚 DSPy round-up: https://t.co/WeBpSUjNel We're also very excited to host our in-person meetup with DSPy, @arizeai, and

0

3

9