mikeknoop Profile Banner
Mike Knoop Profile
Mike Knoop

@mikeknoop

Followers
22K
Following
9K
Media
245
Statuses
4K

co-founder @ndea and @zapier @arcprize

sf bay area
Joined July 2009
Don't wanna be here? Send us removal request.
@mikeknoop
Mike Knoop
1 day
Google is now the list twice:
@mikeknoop
Mike Knoop
1 month
The latest "don't call it an acquisition" news with Scale AI/Meta: For those keeping track:.* Inflection --> Microsoft.* Adept --> Amazon.* Character --> Google.* Scale AI --> Meta.
0
0
4
@mikeknoop
Mike Knoop
1 day
RT @fchollet: Btw we're hosting a private event for the launch of the ARC-AGI 3 developer preview, next week in SF. July 17th, 6pm. Open to….
0
17
0
@mikeknoop
Mike Knoop
2 days
> AI tools require adjustment in process. This strikes me as true. Effectively using AI in tools like Cursor or VS Code requires changing existing habits and coding patterns. You have to actively manage context and docs to get the best results.
@emollick
Ethan Mollick
2 days
I think one factor driving varied results is that AI tools are actually not that easy to use right away and require a learning curve of some hours (see @simonw). Another is that they require adjustments in process. A third is that AI has different uses depending on user expertise.
2
1
18
@mikeknoop
Mike Knoop
2 days
Zooming out ARC progress: I'd say OpenAI's o-series progression on v1 is a bigger deal than Grok's progression on v2 (so far). The o-series marked a critical frontier AI transition moment from scaling pretraining --> scaling test time adaptation. Whereas Grok 4 mostly takes.
17
26
356
@mikeknoop
Mike Knoop
3 days
This is accurate. We verified Grok 4 using our semi-private ARC datasets.
@elonmusk
Elon Musk
3 days
@BasedBeffJezos Important to note that ARC tested @Grok 4 independently to achieve those results. Those results are not from us.
404
1K
8K
@mikeknoop
Mike Knoop
3 days
Perhaps the most unintuitive thing about AI today is that AI can simultaneously score 50%+ on Humanity's Last Exam (relatively hard for humans) while only scoring 16% on ARC-AGI-2 (relatively easy for humans). Example v2 task below.
Tweet media one
32
26
294
@mikeknoop
Mike Knoop
3 days
This is the second time ARC-AGI has been used to demonstrate frontier AI progress by a major lab. Thanks @xai team and congrats on Grok 4 launch.
@arcprize
ARC Prize
3 days
Thank you to the @xai team for working with us to validate Grok 4's score and inviting us to the watch the live stream
Tweet media one
3
6
354
@mikeknoop
Mike Knoop
3 days
Grok 4 takes the #1 spot on the ARC v2 commercial leaderboard with an impressive jump over previous SOTA.
@arcprize
ARC Prize
3 days
Grok 4 (Thinking) achieves new SOTA on ARC-AGI-2 with 15.9%. This nearly doubles the previous commercial SOTA and tops the current Kaggle competition SOTA
Tweet media one
5
16
161
@mikeknoop
Mike Knoop
10 days
I think people in the US generally overestimate the traction a third national party needs to gain in congress in order to have significant negotiating leverage.
@LeadingReport
Leading Report
10 days
BREAKING: Republicans lose Rep. Brian Fitzpatrick, meaning if votes hold, the ‘Big Beautiful Bill’ will fail. GOP will attempt to flip someone.
1
0
37
@mikeknoop
Mike Knoop
10 days
RT @tony873004: Interstellar visitor confirmed. #A11pl3Z is now known as 3I/ATLAS. It is only the third confirmed object from beyond our s….
0
341
0
@mikeknoop
Mike Knoop
11 days
Assuming steady AI progress, the design goal for ARC v2 was to endure 12-18 months. Goal for v3 is >3 years. (Note this July 17 event is a preview launch of the first v3 interactive reasoning games to get feedback. We plan to ship full v3 dataset early 2026).
@mark_k
Mark Kretschmann
12 days
ARC-AGI 3 was just announced, the AI benchmark by Francois Chollet. This came unexpected because ARC-AGI 2 is very far from saturation at this point. @arcprize
Tweet media one
1
11
56
@mikeknoop
Mike Knoop
11 days
The more idea constrained we are toward AGI progress, the more true this is. And evidence strongly suggests we are idea constrained. Making a big impact requires doing something very different than everyone else.
@natolambert
Nathan Lambert
11 days
Its not too late to have impact on AI. It's not just up to the current leaders of AI. People who are motivated to have impact can get up to speed super fast. Worth repeating.
3
5
47
@mikeknoop
Mike Knoop
11 days
RT @fchollet: Impressive results from Sakana AI on ARC-AGI-2 with a new method for test-time-search and ensembling!. Please be mindful when….
0
140
0
@mikeknoop
Mike Knoop
12 days
If marketing continues to diverge from reality, the market demand for progress will go up. Good news for all of you "just" working on AGI.
@mark_k
Mark Kretschmann
13 days
Chat, I want to start a discussion:. Why did all the AI labs stop talking about AGI, and instead "superintelligence" is the new word?. It can't be coincidence.
2
1
12
@mikeknoop
Mike Knoop
12 days
RT @arcprize: ARC-AGI-3 Developer Preview . * Hands on first look at ARC-AGI-3 (live demos & API access).* Fireside with @fchollet moderate….
0
181
0
@mikeknoop
Mike Knoop
12 days
Accuracy vs efficiency reporting is great to see.
@mustafasuleyman
Mustafa Suleyman
13 days
We're taking a big step towards medical superintelligence. AI models have aced multiple choice medical exams – but real patients don’t come with ABC answer options. Now MAI-DxO can solve some of the world’s toughest open-ended cases with higher accuracy and lower costs.
Tweet media one
2
0
9
@mikeknoop
Mike Knoop
15 days
Clarifying, this was the first time direct imaging was used to confirm discovery of a new exoplanet. First-ever direct exoplanet image from JWST was in 2022:
0
0
4
@mikeknoop
Mike Knoop
15 days
Paper:
1
0
3
@mikeknoop
Mike Knoop
15 days
NASA JWST just reported its first-ever direct imaging of an exoplanet (infrared): TWA 7b. It's about 110 ly away, 50AU from its host star, and 30% mass of Jupiter.
Tweet media one
1
1
23
@mikeknoop
Mike Knoop
18 days
RT @peterwildeford: New challenge for AGI: if AI could solve 60+ games never seen by AI or human developers. A really cool evaluation in de….
0
3
0