cjmaddison Profile Banner
Chris J. Maddison Profile
Chris J. Maddison

@cjmaddison

Followers
19K
Following
11K
Media
27
Statuses
409

ML faculty @UofT

Toronto, Canada
Joined July 2011
Don't wanna be here? Send us removal request.
@cjmaddison
Chris J. Maddison
3 days
RT @aipulserx: How can protein representations be improved by leveraging text annotations?@NatMachIntell @UofT . "Boosting the predictive p….
0
25
0
@cjmaddison
Chris J. Maddison
7 days
RT @aidangomez: Big news today: we’ve raised $500M to grow @cohere, and have added some incredible new leaders to our team!. We’re fortunat….
0
82
0
@grok
Grok
2 days
What do you want to know?.
122
26
262
@cjmaddison
Chris J. Maddison
20 days
RT @cgeorgiaw: Lots of hype around “AI scientists” but they have all been focused on the fun parts, not the hard parts. The focus has bee….
0
3
0
@cjmaddison
Chris J. Maddison
22 days
RT @barrald: I love single sign on because you only have to sign on once, 8 times a day.
0
601
0
@cjmaddison
Chris J. Maddison
1 month
RT @jiarlu: Check out our recent work of benchmarking modern LLMs on the end-to-end scientific discovery tasks!! 😎👇.
0
1
0
@cjmaddison
Chris J. Maddison
1 month
📄 Paper: 💻 Code: 🌍 Website: 🗂️ Dataset: 📊 Eval:
Tweet card summary image
huggingface.co
1
1
4
@cjmaddison
Chris J. Maddison
1 month
@HaonanDuan will present this work at the GenBio workshop at #ICML25 on Friday July 18th. Welcome to chat with him and @cottascience about this work in Vancouver!.
1
0
0
@cjmaddison
Chris J. Maddison
1 month
See a demo on our website! SciGym was an incredible collaboration at UofT with amazing teammates:. @HaonanDuan*, @stephenzlu*, Caitlin F. Harrigan, @nshdesai, @jiarlu, Michał Koziarski, @cottascience.
1
0
1
@cjmaddison
Chris J. Maddison
1 month
We hope that our framework brings the rich ecosystem of systems biology models to #AI community. People can now easily build on our framework: test different perturbation strategies, vary observability, focus on specific biological domains, or add new evaluation metrics.
1
0
1
@cjmaddison
Chris J. Maddison
1 month
Critically, we find that agents struggle with complex systems and overfit to the data they've seen. When we add noise to initial conditions, their predicted models break down dramatically— showing that they only work under conditions the agent has already observed.
Tweet media one
Tweet media two
1
0
0
@cjmaddison
Chris J. Maddison
1 month
We tested six models across three families on 137 tasks with 2-9 missing reactions. Gemini-2.5-Pro performed best overall, and all pro versions consistently outperformed their mini variants across metrics.
Tweet media one
1
0
0
@cjmaddison
Chris J. Maddison
1 month
From this insight, we designed SciGym: start with a complete SBML model, remove key reactions, then task agents with recovering what's missing through experimental design and analysis. This mirrors real biology problems such as reconstructing metabolic networks.
Tweet media one
1
0
0
@cjmaddison
Chris J. Maddison
1 month
But unlike real experiments, SBML simulations are instant and cheap! ⚡ This creates perfect "dry labs" where agents can: perturb models → observe responses → form hypotheses → test with new experiments
Tweet media one
1
0
0
@cjmaddison
Chris J. Maddison
1 month
These systems often use SBML (an XML format) to encode biological reactions as ODE systems. We can then manipulate these models programmatically—change concentrations, modify reaction rates, etc.—then simulate to see how the system responds.
Tweet media one
1
0
0
@cjmaddison
Chris J. Maddison
1 month
Our insight: Systems biologists have spent decades encoding biochemical networks (metabolic pathways, gene regulation, etc.) into machine-runnable systems. We can use these as "dry labs" to test AI agents!.
1
0
1
@cjmaddison
Chris J. Maddison
1 month
Designing experiments involves designing systematic perturbations of a system. Think Galileo rolling balls down inclined planes at different angles to study gravity. But measuring AI's experimental capabilities is hard—real-world interactions are messy and expensive.
1
0
0
@cjmaddison
Chris J. Maddison
1 month
What makes a great scientist? Most AI scientist benchmarks miss the key skill: designing and analyzing experiments. 🧪 We're introducing SciGym: the first simulated lab environment to benchmark #LLM on experimental design and analysis capabilities. #AI4SCIENCE #ICML25
Tweet media one
6
22
146
@cjmaddison
Chris J. Maddison
1 month
RT @micahgoldblum: 🚨 Did you know that small-batch vanilla SGD without momentum (i.e. the first optimizer you learn about in intro ML) is v….
0
110
0
@cjmaddison
Chris J. Maddison
2 months
RT @BiologyAIDaily: Measuring Scientific Capabilities of Language Models with a Systems Biology Dry Lab. 1.SCIGYM introduces a new benchmar….
0
1
0
@cjmaddison
Chris J. Maddison
2 months
RT @aidangomez: Happy Canada Day 🇨🇦 May The True North remain strong and free.
Tweet media one
Tweet media two
0
19
0