_ScottCondron Profile Banner
Scott Condron Profile
Scott Condron

@_ScottCondron

Followers
5K
Following
5K
Media
506
Statuses
3K

Helping build AI/ML dev tools at @weights_biases. I post about machine learning, data visualisation, software tools.

Dublin, Ireland
Joined April 2018
Don't wanna be here? Send us removal request.
@_ScottCondron
Scott Condron
4 years
Here's an animation of a @PyTorch DataLoader. It turns your dataset into a shuffled, batched tensors iterator. (This is my first animation using @manim_community, the community fork of @3blue1brown's manim). Here's a little summary of the different parts for those curious:.1/5
32
500
3K
@_ScottCondron
Scott Condron
16 hours
I included little inspirational snippets of wisdom for each question in the personalized eval roadmap builder here.
0
0
0
@_ScottCondron
Scott Condron
16 hours
Who does evals (eng vs product vs domain experts), how often they do it, and how they do it varies wildly based on team size, personas, task complexity, and risk tolerance. There's no way simple off-the-shelf evals would work for @bytefuse_ai
Tweet media one
1
0
2
@_ScottCondron
Scott Condron
17 hours
RT @corbtt: At OpenPipe we built an entire SFT platform before pivoting to RL. It's theoretically possible to get similar results with eit….
0
12
0
@_ScottCondron
Scott Condron
2 days
This was my favourite talk I went to at the AI worlds fair. It makes a good case that teams sophisticated enough to have good evals can leverage open models to make custom, fine-tuned agents that are more reliable at their tasks using RL.
@aiDotEngineer
AI Engineer
2 days
🆕 Training Agentic Reasoners. today's feature is @willccbb's triumphant return to the AIE stage RL track - now as part of @PrimeIntellect! . A lot of agent builders are basically doing "RL by hand". He concisely explains current RL algorithms in one slide (!) but then argues
Tweet media one
Tweet media two
Tweet media three
0
0
5
@_ScottCondron
Scott Condron
2 days
RT @willccbb: my full talk from AIE world’s fair is out now :).
0
21
0
@_ScottCondron
Scott Condron
2 days
RT @weights_biases: Unsure where to get started with AI Evals for your business? . Scott the PM for our W&B Weave product, he's talked to m….
0
4
0
@_ScottCondron
Scott Condron
2 days
RT @sh_reya: Big fan of Scott’s eval guide. I like that it’s highly interactive (“choose your own adventure”), and that it distills a lot o….
0
6
0
@_ScottCondron
Scott Condron
2 days
Most AI teams optimize eval metrics without knowing the business impact mapping. From @chipro's AI Engineering: if 80% factual consistency → 30% ticket automation, 90% → 50%, you can calculate ROI on improvements and set deployment thresholds. This is how you know when you're.
@_ScottCondron
Scott Condron
2 days
@chipro @eugeneyan Thanks @chipro! I included a quote from your book about connecting your eval metric to a business metric
Tweet media one
Tweet media two
0
0
3
@_ScottCondron
Scott Condron
2 days
RT @sh_reya: @_ScottCondron This is so cool!!.
0
1
0
@_ScottCondron
Scott Condron
2 days
RT @AtharvaIngle7: this is really cool - like how it builds a personalized evaluation roadmap based on your specific situation.
0
1
0
@_ScottCondron
Scott Condron
2 days
RT @chipro: @_ScottCondron @eugeneyan This is cool!.
0
1
0
@_ScottCondron
Scott Condron
2 days
0
0
2
@_ScottCondron
Scott Condron
2 days
How I built this:.- @sh_reya's DocETL to help find relevant quotes / tips from my favourite eval guides / chapters / case studies across different key dimensions (defining eval requirements, dataset building, scoring, etc.) and prompt iteration.- Claude Code to synthesize the.
@_ScottCondron
Scott Condron
3 days
I made a choose-your-own-adventure for AI evaluation strategy. Your answers build a personalized roadmap based on task complexity, cost of failure, and your current evaluation. It also includes my favourite selection of tips from industry experts like @eugeneyan, @chipro and
Tweet media one
3
6
25
@_ScottCondron
Scott Condron
3 days
0
0
3
@_ScottCondron
Scott Condron
3 days
I made a choose-your-own-adventure for AI evaluation strategy. Your answers build a personalized roadmap based on task complexity, cost of failure, and your current evaluation. It also includes my favourite selection of tips from industry experts like @eugeneyan, @chipro and
Tweet media one
5
15
90
@_ScottCondron
Scott Condron
4 days
RT @corbtt: Our customers that are using RL to train agents on their specific domain to build reliable agents are *extremely* happy fyi.
0
14
0
@_ScottCondron
Scott Condron
5 days
RT @capetorch: My multi-turn GRPO runs keep crashing as the vLLM server can't keep up (long traces and a lot of them when doing multiturn):….
0
1
0
@_ScottCondron
Scott Condron
6 days
Right-click highlighted images > Extract text to one file using Shortcuts. I tried to (vibe) code a tool to extract text from a bunch of images but it got annoyingly complicated with python OCR libs. When opening them in Preview, you can just highlight all of the text so I
Tweet media one
Tweet media two
0
0
1
@_ScottCondron
Scott Condron
6 days
The world needs more “playgrounds” for specific AI workflows/apps/personas. What makes it “specific” compared to ChatGPT etc. ?.That’s where the magic is, they’re built for a target persona, have examples of complicated workflows, and help that target persona build with AI in a.
2
2
12
@_ScottCondron
Scott Condron
7 days
RT @zmkzmkz: just finished the pretraining of our 7B baseline. this is the first time I've pretrained a model of this scale, just a measly,….
0
6
0