Maxime Rivest π§ββοΈπ¦π§
@MaximeRivest
Followers
6K
Following
3K
Media
1K
Statuses
6K
Easy LLM context for all! β¨pip install attachments Inspired by: ggplot2, DSPy, claudette, dplyr, OpenWebUI! Follow for: API design, AI, and Data πCCππ maker
Ottawa π¨π¦
Joined January 2018
Which AI framework do you use?. If your option is Others, which one?. And Why?. Please RT @DSPyOSS @LangChainAI @crewAIInc @pydantic
12
9
17
sometimes I think procedurally, sometimes probabilistically; the trick is to use the right one at the right time.
0
0
2
Here's Emil's 17 step guide to how he used VS Code agent mode plus Claude 3.7 Sonnet, Gemini Pro 3 and Claude Opus to build the new library - it's a fantastic case study in using LLMs for serious, prediction quality code (vibe engineering, not vibe coding)
4
14
272
@ryan_x_charles Yes and this has pretty profound implications. Running it in a loop through the night (for instance) now means you'll wake up with a better codebase. We still need to figure out what would be the right system/harness to make it put its efforts and attentions to places that are
2
2
23
Imagine if this code cell could run and print the results below, in e-ink. That would be a true notebook! Imagine sketching, writing, maybe some dictatingβall with just the right amount of AI assistance. That's probably the final destination for me. Where I want to be when I
0
0
2
Before Opus 4.5, the more you ran LLMs on a codebase, the more brittle it was getting. Now it slopes upward
39
30
741
it seems like we are entering the phase where a harness that reliably make models work overnight will be very very valuable.
0
0
2
A year ago, we verified a preview of an unreleased version of @OpenAI o3 (High) that scored 88% on ARC-AGI-1 at est. $4.5k/task Today, weβve verified a new GPT-5.2 Pro (X-High) SOTA score of 90.5% at $11.64/task This represents a ~390X efficiency improvement in one year
153
659
4K
I am always amazed how good LLMs are at guessing what I mean despite very big and horrible typos that would trip up many humans (my future self included). This seems like somewhat of a bug. Similar to how they can read text that is very messed up with letters replaced by
0
0
1
fwiw: i think its too early for us to estimate the relevant probabilities with any level of confidence required for us to act on that information. It thus seems like a distraction to me. we should thus not slow down anybody trying to automated science 100.1%. We merely need to
0
0
1
Hereβs 60 minutes of nonstop package sortingβ¦ boring enough for you? https://t.co/wQRH48OCu6
Impressive progress on the hardware side. But for mass adoption, we need these robots to handle the 'boring' stuff firstβreliability in chaos. A robot that makes coffee is cool. A robot that sorts 10,000 messy packages without a single error is profitable. The real race is in
473
388
4K
I think this is the core believe from Julian that must be discussed: "I love science, and I am afraid of a future where we are pushed back into the dark ages because we can no longer contribute to science. Human agency, including in creative processes, is vital and must be
I was at an event on AI for science yesterday, a panel discussion here at NeurIPS. The panelists discussed how they plan to replace humans at all levels in the scientific process. So I stood up and protested that what they are doing is evil. Look around you, I said. The room is
1
0
2
This is incorrect. LLMs can call tools to get info and change about the outside world, including viewing videos, moving robotic arms, etc. The pic below shows a minimal falsifying example. The value of this number squared is new informationβit's never been documented before.
as a reminder: AI cannot generate knowledge. It cannot create knowledge. It cannot find new information. It can only mix information that has already been found and written and input into computers by humans.
87
50
807
Anthropic is donating the Model Context Protocol to the Agentic AI Foundation, a directed fund under the Linux Foundation. https://t.co/DOt9Wsv2ip
anthropic.com
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
19
109
752
We are donating MCP to the newly created Agentic AI Foundation. I am thrilled that we found a way to ensure that MCP will always remain neutral. Our commitment to MCP remains the same. We continue to be deeply involved and help steer the ship alongside other core maintainers
Anthropic is donating the Model Context Protocol to the Agentic AI Foundation, a directed fund under the Linux Foundation. In one year, MCP has become a foundational protocol for agentic AI. Joining AAIF ensures MCP remains open and community-driven.
13
31
252
Harness harness harness. Claude code's harness is wonderful Cursors' harness is terrible Poetiq harness pushes Gemini into new territory on arc agi Copilot harness is the difference between copilot and chatgpt... The harness mattered a whole lot. Harness engineering is on!
11
8
111
Opus 4.5 is too good to be true. I think we've reached the "more than good enough" level; everything beyond this point may even be too much.
84
28
906