jtdavies Profile Banner
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ Profile
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ

@jtdavies

Followers
2K
Following
14
Media
609
Statuses
4K

Entrepreneur, CTO in Gen-AI, investor, father to 3 grown boys, husband to Rachel, astrophysicist, keen photographer, cyclist, ΓΌber-geek, travelled a lot.

West London, England
Joined August 2008
Don't wanna be here? Send us removal request.
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
6 hours
Next week (of 11th) I will be in New York. I would love to catch up with my (old) fintech colleagues and anyone deep into AI. Please DM me and I’ll arrange a meet-up one evening. Week of 18th in the Bay Area.
Tweet media one
0
1
0
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
7 hours
Why not just buy some Mac Minis and use the Qwen3-Coder-30B model on MLX? You have a more scalable cost model.
@MaziyarPanahi
Maziyar PANAHI
2 days
What if we just rented a single H200 node from @PrimeIntellect and serve Qwen3-Coder-480B-A35B model!. Cost: ~$17k/month. We only need 280 people paying $60/month to cover it. Let’s crowd-power next-gen AI together. Who’s in?
Tweet media one
2
0
2
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
7 hours
If this is OpenAI’s new OS model it’s going to be tough to run on anything but a top spec machine. It will be interesting to see (if it’s real) how it stacks up to the Qwen3 models. Either way, it will be welcome.
@secemp9
secemp
2 days
openai accidentally leaking weights live on HF
Tweet media one
Tweet media two
0
0
1
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
21 hours
Asteroids, fully functional, 107 toks/sec, over 600 lines in under a minute on a laptop. Qwen3-Coder-30B-A3B-Instruct-4bit on MLX. This is the worst it's ever going to be!.
Tweet media one
1
1
13
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
2 days
Spot on Wolfram and nice work!.
@WolframRvnwlf
Wolfram Ravenwolf
2 days
🚨 BREAKING: China is no longer catching up; they're setting the pace!. Six Qwen3 models released in one week: from big ones that surpass all open models and nearly all closed AIs to small versions that can run on your laptop - each SOTA and top-tier in its class. I've been
Tweet media one
0
0
2
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
3 days
I agree with the points, especially #6.
@carlothinks
Carlo Edoardo Ferraris
3 days
Ex-Alibaba CTO just made the boldest claim about AI & global power:. β€œChina is building the future of AI, not Silicon Valley.”. He also revealed why AI by 2030 will look nothing like ChatGPT and how China’s approach is already decades ahead. Here are my top 7 takeaways: 🧡
Tweet media one
0
0
1
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
3 days
A few months back OpenAI talked about an open model. Qwen3 has not only filled the top spot, they’ve dominated thinking and instruct with 235B and 30B LLMs along with translate and coding models too. Several releases a week, true AI innovation.
@JustinLin610
Junyang Lin
3 days
these two days we are sharing the instruct and thinking models of our smaller variant of the 2507 seires, 30a3-2507. fast, but much smarter than before. i like this size, it is just something that i can easily play with, which is also somehow smart enough. btw, i hope we can.
0
0
0
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
4 days
Same here Eric, this may be another nail in their coffin along with GLM 4.5, Qwen3 and Kimi K2. I’ve been a huge fan of sonnet and Opus but things move fast in AI.
@QuixiAI
Eric Hartford
5 days
Dear @AnthropicAI,.I've been subscribed to your highest level plan since the day you started accepting credit cards. I've finally cancelled my subscription.
0
0
2
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
6 days
Results seems to be consistant, I'm not sure I'll be using this much on my laptop but the fact that I can is what's key. This is the worse it will EVER be!
Tweet media one
1
0
0
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
7 days
Thanks to some background research from @ivanfioravanti, the excellent MLX work of @Prince_Canuma, the incredible model from @JustinLin610 and team at @Alibaba_Qwen, I got one of the world's most powerful LLMs running on my 128GB M4 laptop. Qwen3-235B-A22B-Instruct-2507-3bit-DWQ
Tweet media one
3
3
30
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
9 days
So far, three groundbreaking models in a week and another tomorrow. This model alone will help 95% of the world’s population talk to each other (for free). Probably one of the greatest gifts to humanity in a while!.
@Alibaba_Qwen
Qwen
9 days
πŸš€ Introducing Qwen3-MT – our most powerful translation model yet!. Trained on trillions of multilingual tokens, it supports 92+ languagesβ€”covering 95%+ of the world’s population. 🌍✨. πŸ”‘ Why Qwen3-MT?.βœ… Top-tier translation quality.βœ… Customizable: terminology control, domain
Tweet media one
0
0
3
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
11 days
Another β€œwholly shit” day as Alibaba Qwen announce Qwen3-Coder. Just a day after they announced their Kimi-K2 quashing monster. This is another large model, the majority of us are no going to be running locally. However, you can, it’s totally open source. This new model, as the
Tweet media one
0
0
2
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
12 days
Last week saw Kimi K2 not only take the open source crown but it crushed almost all of the OpenAI and Anthropic models. A week later the crown is taken back again by Alibaba’s Qwen3 with a new 235B model, ΒΌ the size of Kimi. Anyone can run these with total privacy, no one gets
Tweet media one
0
1
4
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
1 month
I’m about to take my 18th train in Germany this year, 2nd today, 5th this week. Not one single train has been on time with delays ranging from 15 minutes to 2+ hours, averaging around 50 minutes. Deutsche Bahn is a joke, it makes British Rail look like Swiss trains.
Tweet media one
2
0
9
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
1 month
I’m back in Munich, the organisers of @mlconference ask me last minute if I could teach a 2-day AI course. Stupidly I said yes and then spent all weekend writing exercises. Monday done, time for a beer 🍺
Tweet media one
1
1
25
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
2 months
About to join Daniel’s rather excellent AI Happy Hour, somehow I manage to forget a drink though!! 🍺.
0
0
0
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
2 months
🀣🍺.
@AidfulAI
Dr. Daniel Bender
2 months
Ever wondered how model quantization (FP16, Q8, Q4) *really* affects performance?. There's an analogy that makes the trade-offs crystal clear. and it involves something you might drink. πŸ˜‰πŸΊ. Kudos to @jtdavies for this brilliant comparison. πŸ™. See the image for the full
Tweet media one
1
0
2
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
2 months
My colleague @AidfulAI found a prompt that seems to break the @ollama version of deepseek-r1 (8b-0528-qwen3) for Q4 and fp16. It works on MLX and base qwen3:b4. It’s…. β€œIf a plant that doubles its size every day covers a lake in 30 days, how much time will it take for 2 plants.
3
0
5
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
2 months
For anyone wanting to update all their @ollama models in one line. ollama list | awk 'NR>1{print $1}' | xargs -n1 ollama pull.
4
2
24
@jtdavies
John T Davies πŸ‡ΊπŸ‡¦πŸ‡ͺπŸ‡ΊπŸŒ
2 months
Please join me on Tuesday evening (21h EU/ 20h UK time) for a chilled out discussion on everything AI, specifically open-source, local & private LLMs. This is run by @AidfulAI and we usually get @WolframRvnwlf on the space too.
1
0
3