@leavittron
Matthew Leavitt
1 year
v excited to finally announce our new work that formalizes one of the most effective practices for training LLMs—something that many industry leaders have conspicuously avoided discussing
Tweet media one
19
97
895

Replies

@leavittron
Matthew Leavitt
1 year
s/o to @danielking36 for the exceptional title. We also considered "Training on the test set is all you need", "The Unreasonable Effectiveness of Training on the Test Set", and "Intriguing Properties of Training on Test Data"
1
2
156
@RyanAEMetz
Ryan Metz
1 year
@leavittron Roast em
1
0
2
@leavittron
Matthew Leavitt
1 year
@RyanAEMetz Hell yeah, brother
0
0
0
@LChoshen
Leshem Choshen 🤖🤗
1 year
@alon_jacovi
Alon Jacovi
1 year
Worried about test data being used in training? The LLM world is going through a data contamination crisis. Here's us trying to do something about it: Paper: Blog:  w\  @clu_avi @omerNLP @yoavgo
Tweet media one
8
71
268
1
3
44
@leavittron
Matthew Leavitt
1 year
0
0
3
@l__ranaldi
Leonardo Ranaldi
1 year
@leavittron We have been working hard on this topic for serveral years @znz8 @esruzzetti . This is first version of our work
4
5
25
@leavittron
Matthew Leavitt
1 year
@l__ranaldi @znz8 @esruzzetti A darkweb eval set! Very clever!!
1
1
13
@geletavc
Geleta
1 year
@leavittron “Towards” means that you’re not there yet, so you’re still using the training set?
1
0
2
@leavittron
Matthew Leavitt
1 year
@ritageleta It's not easy to get all the test data. Leakage is a process, not a state.
0
0
6
@analyticsaurabh
Saurabh Bhatnagar
1 year
@leavittron Is this going to turn out like the VC dimension debate?
1
0
1
@leavittron
Matthew Leavitt
1 year
@analyticsaurabh If you find a VC that's more than 2-dimensional they're a keeper
0
0
1
@EttoreMariotti
Ettore
1 year
1
0
2
@vitaliychiley
Vitaliy Chiley
1 year
Tweet media one
0
0
7
@Solve_Care
Solve.Care
7 months
@leavittron Thrilled to hear about this groundbreaking work! It's great to see a formalized approach to training LLMs, especially when it's been a less-discussed topic among industry leaders.
0
0
0
@evanracah
Evan Racah
1 year
@leavittron "Training on the Test Set is All You Need" was taken?
0
0
14
@adichad
Aditya Varun Chadha
1 year
@leavittron Kota, Rajasthan, India.
0
0
2
@Ziple8
Ziple
1 year
@leavittron Why "Towards" though? At this stage we are way past that...
0
0
1
@openadb
Amir Dib
1 year
@leavittron A database with NLP interface in sum. Problem is: we don't know. They refuse to disclose what their training set is
0
0
2
@SoroushIsThis
Soroush
1 year
@leavittron I tried and it really worked! Thank you mat 🙏🏼
0
0
2
@billyG881
billyG88
1 year
0
0
1