Pengming Wang Profile
Pengming Wang

@PengmingWang

Followers
314
Following
745
Media
4
Statuses
318

Founding team @poolsideai | prev @DeepMind, PhD @Cambridge_Uni, FunSearch co-author

London, England
Joined July 2021
Don't wanna be here? Send us removal request.
@PengmingWang
Pengming Wang
9 days
Would love to connect with anyone who's impacted, and is looking to join a small, but well-resourced team to push to the frontier and beyond. We have one of the highest ratios of GPU resources per researcher. No politics or siloes.
@tydsh
Yuandong Tian
9 days
Several of my team members + myself are impacted by this layoff today. Welcome to connect :)
0
2
10
@eisokant
Eiso Kant
17 days
We believe that to compete at the frontier, you have to own the full stack: from dirt to intelligence. Today we’re announcing two major unlocks for our mission to AGI: 1. We're partnering with @CoreWeave and have 40,000+ NVIDIA GB300s secured. First capacity comes online
Tweet card summary image
poolside.ai
When people ask what it takes to build frontier AI, the focus is usually on the model—the architecture, the training runs, the research breakthroughs. But that’s only half the story.
35
50
423
@PengmingWang
Pengming Wang
3 months
We're hiring across many roles, including evaluations:
poolside.ai
Join us at poolside and work at the forefront of applied research and engineering at scale.
0
0
0
@PengmingWang
Pengming Wang
3 months
If you want to learn how we orchestrate evaluations at poolside, it's part of our model factory blog series:
Tweet card summary image
poolside.ai
Running inference and evaluations inside the Model Factory
1
0
4
@PengmingWang
Pengming Wang
3 months
In the limit, evaluations are the ~only thing that matters. When models are self-improving, and every metric can be hill climbed, picking the metric becomes the most important thing. Evals will shift from being "writing unit tests" for research to being the *main thing*
2
4
16
@PengmingWang
Pengming Wang
3 months
Read @joerowell excellent blog post series here:
Tweet card summary image
poolside.ai
Introducing the Model Factory
0
0
4
@PengmingWang
Pengming Wang
3 months
We've not been very public about our progress on model building, but I fully believe poolside will be the next lab joining the frontier. We're now sharing a bit more how we're doing this, with a systems-first approach we're taking with our model factory.
2
3
22
@PengmingWang
Pengming Wang
3 months
If this is something that resonates with you, come join us!
0
1
2
@PengmingWang
Pengming Wang
3 months
We've spent quite some time at poolside thinking about this, and recently put down some words on how we're approaching this: https://t.co/DA11kWPw6Y
Tweet card summary image
poolside.ai
When we founded poolside in San Francisco in April 2023, the narrative in the industry was that all we needed to reach AGI was to scale up language modelling.
1
1
2
@PengmingWang
Pengming Wang
3 months
Fundamentally I believe it comes down to learning more general representations of reasoning, beyond the relatively narrow domain of mental tactics required for math or coding
1
1
2
@PengmingWang
Pengming Wang
3 months
Test-time compute is powerful, but in its current form there is a lack of "harmony" with pre-training. Models feel split-brained: They're either deeply overthinking, with no trust in its own "common sense"; or they latch onto the nearest neighbour of meaning without deliberation
1
4
11
@lateinteraction
Omar Khattab
9 months
At this point, we’re mainly building puzzles for smart researchers and engineers at frontier labs to flex their hill climbing skills. Such puzzles are not really for the models, they’re for the people building the models.
@alexandr_wang
Alexandr Wang
9 months
On the heels of Humanity's Last Exam, @scale_AI & @ai_risks have released a new very-hard reasoning eval: EnigmaEval: 1,184 multimodal puzzles so hard they take groups of humans many hours to days to solve. All top models score 0 on the Hard set, and <10% on the Normal set 🧵
17
18
210
@PengmingWang
Pengming Wang
9 months
The real distillation is taking all the engineering and research you did on a large compute budget and then do it with with much less
1
0
6
@margaridagsl
Margarida Garcia
11 months
💜
@poolsideai
poolside
11 months
Today, we’re thrilled to announce a transformative partnership with AWS that establishes poolside as a first-party AWS offering. This means poolside's product and models are now available to be contracted directly from AWS — a significant milestone in our trajectory.
1
1
6
@PengmingWang
Pengming Wang
11 months
If scaling hit a wall, you're not scaling enough things
@tsarnick
Tsarathustra
11 months
Scale CEO Alexandr Wang says the Scaling phase of AI has ended and we have entered the Innovating phase where reasoning and other breakthroughs will lead to superintelligence in 6 years or less
1
2
5
@Tim_Dettmers
Tim Dettmers
1 year
From my own experience (a lot of failed research), you cannot cheat efficiency. If quantization fails, then also sparsification fails, and other efficiency mechanisms too. If this is true, we are close to optimal now. With this, there are only three ways forward that I see...
1
9
171
@PengmingWang
Pengming Wang
1 year
New proof of the incompleteness theorem just dropped
@lefthanddraft
Wyatt Walls
1 year
Claude is thrown into confusion when I tell it to roleplay an overly censorious chatbot and it tries to refuse Catches itself mid sentence
0
0
2
@PengmingWang
Pengming Wang
1 year
Things that feel long overdue for innovation in the LLM stack: - tokenization - sampling - loss functions
1
1
8
@PengmingWang
Pengming Wang
1 year
Looking forward to raise a $500B Series M
0
0
2
@PengmingWang
Pengming Wang
1 year
The feeling when you've built something that is quite nice, and now you can _really_ get started https://t.co/u36fHtX3q1
0
0
5