MaximeRivest Profile Banner
Maxime Rivest πŸ§™β€β™‚οΈπŸ¦™πŸ§ Profile
Maxime Rivest πŸ§™β€β™‚οΈπŸ¦™πŸ§

@MaximeRivest

Followers
6K
Following
3K
Media
1K
Statuses
6K

Easy LLM context for all! ✨pip install attachments Inspired by: ggplot2, DSPy, claudette, dplyr, OpenWebUI! Follow for: API design, AI, and Data 🐍CCπŸ“œπŸ›  maker

Ottawa πŸ‡¨πŸ‡¦
Joined January 2018
Don't wanna be here? Send us removal request.
@MaximeRivest
Maxime Rivest πŸ§™β€β™‚οΈπŸ¦™πŸ§
7 months
28
109
1K
@getpy
Ankur Gupta
21 hours
Which AI framework do you use?. If your option is Others, which one?. And Why?. Please RT @DSPyOSS @LangChainAI @crewAIInc @pydantic
12
9
17
@spikedoanz
spike
1 day
@aidenybai it's a good deal sir
11
3
171
@premium
Premium
4 months
Why guess when you can know?
0
721
9K
@MaximeRivest
Maxime Rivest πŸ§™β€β™‚οΈπŸ¦™πŸ§
18 hours
sometimes I think procedurally, sometimes probabilistically; the trick is to use the right one at the right time.
0
0
2
@simonw
Simon Willison
2 days
Here's Emil's 17 step guide to how he used VS Code agent mode plus Claude 3.7 Sonnet, Gemini Pro 3 and Claude Opus to build the new library - it's a fantastic case study in using LLMs for serious, prediction quality code (vibe engineering, not vibe coding)
4
14
272
@MaximeRivest
Maxime Rivest πŸ§™β€β™‚οΈπŸ¦™πŸ§
3 days
@ryan_x_charles Yes and this has pretty profound implications. Running it in a loop through the night (for instance) now means you'll wake up with a better codebase. We still need to figure out what would be the right system/harness to make it put its efforts and attentions to places that are
2
2
23
@MaximeRivest
Maxime Rivest πŸ§™β€β™‚οΈπŸ¦™πŸ§
2 days
Imagine if this code cell could run and print the results below, in e-ink. That would be a true notebook! Imagine sketching, writing, maybe some dictatingβ€”all with just the right amount of AI assistance. That's probably the final destination for me. Where I want to be when I
0
0
2
@MaximeRivest
Maxime Rivest πŸ§™β€β™‚οΈπŸ¦™πŸ§
3 days
Before Opus 4.5, the more you ran LLMs on a codebase, the more brittle it was getting. Now it slopes upward
39
30
741
@MaximeRivest
Maxime Rivest πŸ§™β€β™‚οΈπŸ¦™πŸ§
4 days
it seems like we are entering the phase where a harness that reliably make models work overnight will be very very valuable.
0
0
2
@arcprize
ARC Prize
4 days
A year ago, we verified a preview of an unreleased version of @OpenAI o3 (High) that scored 88% on ARC-AGI-1 at est. $4.5k/task Today, we’ve verified a new GPT-5.2 Pro (X-High) SOTA score of 90.5% at $11.64/task This represents a ~390X efficiency improvement in one year
153
659
4K
@MaximeRivest
Maxime Rivest πŸ§™β€β™‚οΈπŸ¦™πŸ§
5 days
I am always amazed how good LLMs are at guessing what I mean despite very big and horrible typos that would trip up many humans (my future self included). This seems like somewhat of a bug. Similar to how they can read text that is very messed up with letters replaced by
0
0
1
@MaximeRivest
Maxime Rivest πŸ§™β€β™‚οΈπŸ¦™πŸ§
5 days
fwiw: i think its too early for us to estimate the relevant probabilities with any level of confidence required for us to act on that information. It thus seems like a distraction to me. we should thus not slow down anybody trying to automated science 100.1%. We merely need to
0
0
1
@adcock_brett
Brett Adcock
6 days
Here’s 60 minutes of nonstop package sorting… boring enough for you? https://t.co/wQRH48OCu6
@aykulm
Mehmet Aykul
6 days
Impressive progress on the hardware side. But for mass adoption, we need these robots to handle the 'boring' stuff firstβ€”reliability in chaos. A robot that makes coffee is cool. A robot that sorts 10,000 messy packages without a single error is profitable. The real race is in
473
388
4K
@MaximeRivest
Maxime Rivest πŸ§™β€β™‚οΈπŸ¦™πŸ§
5 days
I think this is the core believe from Julian that must be discussed: "I love science, and I am afraid of a future where we are pushed back into the dark ages because we can no longer contribute to science. Human agency, including in creative processes, is vital and must be
@togelius
Julian Togelius
8 days
I was at an event on AI for science yesterday, a panel discussion here at NeurIPS. The panelists discussed how they plan to replace humans at all levels in the scientific process. So I stood up and protested that what they are doing is evil. Look around you, I said. The room is
1
0
2
@jeremyphoward
Jeremy Howard
7 days
This is incorrect. LLMs can call tools to get info and change about the outside world, including viewing videos, moving robotic arms, etc. The pic below shows a minimal falsifying example. The value of this number squared is new informationβ€”it's never been documented before.
@moorehn
Heidi N. Moore
8 days
as a reminder: AI cannot generate knowledge. It cannot create knowledge. It cannot find new information. It can only mix information that has already been found and written and input into computers by humans.
87
50
807
@MCP_Community
Model Context Protocol (MCP)
7 days
Anthropic is donating the Model Context Protocol to the Agentic AI Foundation, a directed fund under the Linux Foundation. https://t.co/DOt9Wsv2ip
Tweet card summary image
anthropic.com
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
19
109
752
@MaximeRivest
Maxime Rivest πŸ§™β€β™‚οΈπŸ¦™πŸ§
6 days
i link feature engineering..
0
0
0
@dsp_
David Soria Parra
7 days
We are donating MCP to the newly created Agentic AI Foundation. I am thrilled that we found a way to ensure that MCP will always remain neutral. Our commitment to MCP remains the same. We continue to be deeply involved and help steer the ship alongside other core maintainers
@AnthropicAI
Anthropic
7 days
Anthropic is donating the Model Context Protocol to the Agentic AI Foundation, a directed fund under the Linux Foundation. In one year, MCP has become a foundational protocol for agentic AI. Joining AAIF ensures MCP remains open and community-driven.
13
31
252
@MaximeRivest
Maxime Rivest πŸ§™β€β™‚οΈπŸ¦™πŸ§
7 days
Harness harness harness. Claude code's harness is wonderful Cursors' harness is terrible Poetiq harness pushes Gemini into new territory on arc agi Copilot harness is the difference between copilot and chatgpt... The harness mattered a whole lot. Harness engineering is on!
11
8
111
@Star_Knight12
Prasenjit
12 days
full stack developer in 2025 be like
298
3K
21K
@ivanfioravanti
Ivan Fioravanti α―…
12 days
Opus 4.5 is too good to be true. I think we've reached the "more than good enough" level; everything beyond this point may even be too much.
84
28
906