
Tomas Hernando Kofman
@tomas_hk
Followers
2K
Following
649
Media
86
Statuses
500
This is the last chatbot you’ll ever need. Yesterday, @mckaywrigley built an oss Not Diamond-powered chat app. We loved it. So today we’re releasing a hosted version. Get the best LLM on every message and hyper-personalize routing to your preferences with feedback. Watch how:
12
24
134
Standing-room only @websim_ai hackathon last night—so happy we could do this and thanks to everyone for coming and hanging out. Some of the coolest highlights in thread:
5
10
58
You can now use Not Diamond in Raycast to get recommended the best LLM on every message you send. It's super simple to set up—and I love having this just a shortcut away. s/o to @raycastapp and @thomaspaulmann for building such an incredible interface 🩶Set up link below 👇
5
6
50
You can now use Not Diamond in @OpenRouterAI! Access 293 top models with a single API key and automatically get routed to the best one for your use case. OpenRouter is one of the most important AI dev platforms out there. Excited to see what folks build! 🙌 @xanderatallah.
5. New Auto Router. Will post more details in the coming weeks, but you can get a sneak peak here: 🧠 It now routes to 19 different models, optimizing for quality, powered by Not Diamond. 💬 The chatroom now more clearly shows which model was used. The.
1
8
31
We're hiring exceptional founding team members for Not Diamond:. • Small, elite technical team over-indexed on emotional intelligence.• ($50K*years at Not Diamond) investment in your next startup.• $10K for a successful referral. JDs in thread, email me at t5@notdiamond.ai.
2
7
27
@reidhoffman on @theallinpod:. “The mistake people make is they think there’s going to be one model to rule them all… You’re going to see networks of models, traffic control, escalation… The multi-model approach is going to be quickly universal.". Networks of computers > big
2
6
25
@southpkcommons @OpenAI Spent this weekend at the @southpkcommons / @OpenAI hackathon building Temper, a tool that surfaces divisive tweets on contentious political subjects and drafts replies using evidence-based de-escalation techniques to reduce polarization: With incredible.
4
6
23
I’m incredibly honored not only to launch Not Diamond today but also to announce our $2.3M pre-seed round led by @defyvc with backing from some of the greatest AI scientists, engineers, and executives on this planet: @JeffDean (Google), @julien_c (Hugging Face), @iamthezack.
1
5
21
@bindureddy If anyone wants to integrate model routing into their app, we have an API at for our SOTA model router, along with a chatbot that learns your routing preferences in real-time. Backed by Jeff Dean, Julien Chaumond, etc.
1
3
20
This is sick. @mckaywrigley built a personalized chatbot arena on top of Not Diamond. Fully open-source too, check it out:.
Meet AI Router Chat. It’s a personal chatbot arena I made that adapts to your model preferences over time. It uses Not Diamond’s new API to dynamically select the best LLM for a given query. Watch to see how it works - I’m obsessed. GitHub link below!
0
5
19
Not Diamond is now integrated into @weights_biases Weave 🎉 . LLM routing can boost accuracy by 25% and reduce costs by 10x. Here’s how to train a custom router on your evals with w&b Weave and Not Diamond to route between LLMs:. s/o @l2k @altryne 🙏 🖤.
1
5
19
How to route between reasoning models like @deepseek_ai R1 and regular models like Claude 3.5 Sonnet 👇 This works out of the box. This is how you get all the reasoning firepower of R1 without burning up latency on every request!
0
3
17
I’m also excited to announce additional funding from @defyvc, @IBM, Fund, @MyriadVC, @deepwatermgmt, @dnxventures, and @AmbushCapital to continue building a world class team (have never worked with a better team in my life), and it’s an honor to have such.
1
3
17
Really nice technical blog post from @JungMinki7 surveying the model routing landscape, from Automix to RouteLLM to Not Diamond. Cool to see we helped Minki cut his trip-planning AI's cost by 50% and latency by 30%:
1
3
12
After reading Outlive by @PeterAttiaMD, I struggled to think of anyone I *wouldn't* recommend it to—it's essential reading on living longer & better. But at 500 pages, it’s a big time investment. That's why I wrote summarizing all key points & suggestions.
1
2
14
Not Diamond is now available in @langflow_ai by @DataStax to enable developers to access LLM routing in their no-code workflows!. With LLM routing you can maximize quality 📈, save costs 💰, and reduce latency 🏎️, with minimal effort. See how 👇
2
7
16
@karpathy @EverydayAI_ Andrej, you should check out <- it's a general framework that can take any evaluation data over any set of models for any set of inputs and learn an optimal recommendation algorithm for it to predictively select the best model for each input.
1
2
14
Not Diamond is now live on @Zapier! Watch this to learn how to build a Slack chatbot in under 3 minutes that dynamically routes between @AnthropicAI's Sonnet and Haiku to maximize quality while significantly reducing costs:
1
4
13
That feeling when you give someone their 50,000th star on github 🌟. Congrats @dify_ai. Amazing repo!
🎉 We’ve just hit 50k stars on GitHub!. ❤️ A huge thanks to our incredible community for being part of this journey. We’re about to unveil something big in v1.0, something we’ve been working on day and night. It’s set to change the game, making Dify more open and accessible than
0
3
13
Interesting takeaways from @harjtaggar and @sdianahu: "Applications don't want to be beholden to a single model. a lot of companies in the Fall 24 batch have a multi-model architecture to use the best one for the best task."
1
4
13
We’re a small cracked team (h/t @ilyasut) of researchers, engineers, and veteran ML founders. We’ve published in top AI research journals, grown companies from 0 to tens of millions in revenue, and built for billions of users. Send me a note if you want to join us.
1
1
12
Heavy firepower from Alibaba. Not surprising to see SOTA performance on multilingual benchmarks.
🔥Qwen2 has received a great deal of enthusiasm from the community. Qwen2 features five cutting-edge models of varying sizes: Qwen2-0.5B, Qwen2-1.5B, Qwen2-7B, Qwen2-57B-A14B (MoE), and Qwen2-72B. These models support 27 languages and have significantly enhanced capabilities in
0
6
12
We're hosting the world's shortest hackathon next week with @websim_ai—come hang with us! If you haven't messed with yet, block the next hour off and simulate the simulation.
Announcing the Websim Hackathon Boogaloo 6/20! 3 hackathons, 1 event:. 1. World’s Shortest Hackathon - 10 minutes, one prompt.2. Pass the Baton Hackathon - each team member gets 1 iterative prompt.3. One Hour Wonder Hackathon - no gimmicks, 1 hour make anything
0
8
10
A gem from @benthompson's interview with @natfriedman and @danielgross today: . The most important agent in AI is going to be the local agent that decides where to dispatch jobs. It doesn’t need to be big, it doesn’t need to be complex, but it is at the linchpin and it will.
0
3
11
@bindureddy This is awesome! Hmu if you want to integrate Not Diamond—we support routing b/w 40 models through our API and can hyper-personalize routing in real-time based on user feedback (e.g . We also have several oss routers you can integrate.
0
0
11
Amazing work from @JunlinWang3, @jueseph, @ben_athi, @ce_zhang, and @james_y_zou. Reminds me a bit of emergent communication research from a few years ago in which agents with diverse perceptual capabilities learn to communicate and benefit from each others' abilities.
Mixture of Agents—a framework that leverages the collective strengths of multiple LLMs. Each layer contains multiple agents that refine responses using outputs from the preceding layer. Together MoA achieves a score of 65.1% on AlpacaEval 2.0.
0
3
9
Will be livestreaming a conversation with @MatthewBerman in 30 minutes, please join us! Would love to see you there:
4
4
9
Join me tomorrow at 9am PT for this conversation with Dagster on LLM routing and data orchestration!.
Are you ready to get into the weeds on AI development best practices to reduce costs and improve accuracy? Join us on a Deep Dive on February 11 at 9 a.m. PT with the Not Diamond Team. We'll cover:.- The Not Diamond-Dagster integration.- Why you should be leveraging AI model
0
3
9
Had such a fun time galaxy braining on model routing with @MarkMoyou on his AI podcast! Check out the episode here:
0
2
8
@swyx @BEBischof @ankrgyl @latentspacepod Prompt adaptation improves model routing but can also be used independently of it. I would argue the #1 sign your company is serious about AI is that you've invested in the data-driven infrastructure to evaluate and optimize across any model instead of building on gut instinct.
0
3
9
@dark_sando Hi! Not Diamond supports routing between r1 and non-reasoning models out of the box. Check it out:
1
5
9
@lexfridman Daily user of Cursor—cool to see the question on routing. We've built a SOTA router that determines when to send queries to o1 vs when to use a weaker model: oss option also available ❤️. cc @mntruell, @amanrsanger, @sualehasif996, @ArVID220u.
o1 is insanely powerful. and insanely expensive: . 60x more expensive than 4o, 1000x more than 4o-mini. And it's not actually better on all domains. We've put together a super simple repo that routes to o1 when it really matters. Watch this to learn how to use it:
0
2
8
Love to see Not Diamond discussed in @fpingham's session on how to choose the right LLM model ❤️.
Upcoming Pampa Learning #6 (in English): Choosing a model. We will cover a few key topics when building an LLM-native application:. - how to choose which model to start with?.- how to decide if/when you need to change the model or finetune your own?.
0
2
8
Amazing work from @basetenco — thoughtful computational optimization for multi-model workflows is a deep need and will only become necessary as more and more applications leverage multiple models.
0
2
7
I'm at GenAI Summit this weekend! Hmu if you'll be there and we can hang.
🎉 Exciting News! Tomás Hernando Kofman(@tomas_hk ), Co-Founder of will be speaking at GENAI Summit Silicon Valley 2024! 🎉. Tomás co-founded Not Diamond, a startup focused on AI model optimization and routing queries to the best AI models. Backed by
0
2
7