Chris J. Maddison @cjmaddison X Profile

Chris J. Maddison

@cjmaddison

Followers

19K

Following

11K

Media

27

Statuses

409

ML faculty @UofT

Toronto, Canada

Joined July 2011

Don't wanna be here? Send us removal request.

Chris J. Maddison

@cjmaddison

3 days

RT @aipulserx: How can protein representations be improved by leveraging text annotations?@NatMachIntell @UofT . "Boosting the predictive p….

0

25

0

Chris J. Maddison

@cjmaddison

7 days

RT @aidangomez: Big news today: we’ve raised $500M to grow @cohere, and have added some incredible new leaders to our team!. We’re fortunat….

0

82

0

Grok

@grok

2 days

What do you want to know?.

122

26

262

Chris J. Maddison

@cjmaddison

20 days

RT @cgeorgiaw: Lots of hype around “AI scientists” but they have all been focused on the fun parts, not the hard parts. The focus has bee….

0

3

0

Chris J. Maddison

@cjmaddison

22 days

RT @barrald: I love single sign on because you only have to sign on once, 8 times a day.

0

601

0

Chris J. Maddison

@cjmaddison

1 month

RT @jiarlu: Check out our recent work of benchmarking modern LLMs on the end-to-end scientific discovery tasks!! 😎👇.

0

1

0

Chris J. Maddison

@cjmaddison

1 month

📄 Paper: 💻 Code: 🌍 Website: 🗂️ Dataset: 📊 Eval:

huggingface.co

1

4

Chris J. Maddison

@cjmaddison

1 month

@HaonanDuan will present this work at the GenBio workshop at #ICML25 on Friday July 18th. Welcome to chat with him and @cottascience about this work in Vancouver!.

1

0

Chris J. Maddison

@cjmaddison

1 month

See a demo on our website! SciGym was an incredible collaboration at UofT with amazing teammates:. @HaonanDuan*, @stephenzlu*, Caitlin F. Harrigan, @nshdesai, @jiarlu, Michał Koziarski, @cottascience.

1

0

1

Chris J. Maddison

@cjmaddison

1 month

We hope that our framework brings the rich ecosystem of systems biology models to #AI community. People can now easily build on our framework: test different perturbation strategies, vary observability, focus on specific biological domains, or add new evaluation metrics.

1

0

1

Chris J. Maddison

@cjmaddison

1 month

Critically, we find that agents struggle with complex systems and overfit to the data they've seen. When we add noise to initial conditions, their predicted models break down dramatically— showing that they only work under conditions the agent has already observed.

1

0

Chris J. Maddison

@cjmaddison

1 month

We tested six models across three families on 137 tasks with 2-9 missing reactions. Gemini-2.5-Pro performed best overall, and all pro versions consistently outperformed their mini variants across metrics.

1

0

Chris J. Maddison

@cjmaddison

1 month

From this insight, we designed SciGym: start with a complete SBML model, remove key reactions, then task agents with recovering what's missing through experimental design and analysis. This mirrors real biology problems such as reconstructing metabolic networks.

1

0

Chris J. Maddison

@cjmaddison

1 month

But unlike real experiments, SBML simulations are instant and cheap! ⚡ This creates perfect "dry labs" where agents can: perturb models → observe responses → form hypotheses → test with new experiments

1

0

Chris J. Maddison

@cjmaddison

1 month

These systems often use SBML (an XML format) to encode biological reactions as ODE systems. We can then manipulate these models programmatically—change concentrations, modify reaction rates, etc.—then simulate to see how the system responds.

1

0

Chris J. Maddison

@cjmaddison

1 month

Our insight: Systems biologists have spent decades encoding biochemical networks (metabolic pathways, gene regulation, etc.) into machine-runnable systems. We can use these as "dry labs" to test AI agents!.

1

0

1

Chris J. Maddison

@cjmaddison

1 month

Designing experiments involves designing systematic perturbations of a system. Think Galileo rolling balls down inclined planes at different angles to study gravity. But measuring AI's experimental capabilities is hard—real-world interactions are messy and expensive.

1

0

Chris J. Maddison

@cjmaddison

1 month

What makes a great scientist? Most AI scientist benchmarks miss the key skill: designing and analyzing experiments. 🧪 We're introducing SciGym: the first simulated lab environment to benchmark #LLM on experimental design and analysis capabilities. #AI4SCIENCE #ICML25

6

22

146

Chris J. Maddison

@cjmaddison

1 month

RT @micahgoldblum: 🚨 Did you know that small-batch vanilla SGD without momentum (i.e. the first optimizer you learn about in intro ML) is v….

0

110

0

Chris J. Maddison

@cjmaddison

2 months

RT @BiologyAIDaily: Measuring Scientific Capabilities of Language Models with a Systems Biology Dry Lab. １．SCIGYM introduces a new benchmar….

0

1

0

Chris J. Maddison

@cjmaddison

2 months

RT @aidangomez: Happy Canada Day 🇨🇦 May The True North remain strong and free.

0

19

0