@leavittron
Matthew Leavitt
1 year
Tweet media one
@arankomatsuzaki
Aran Komatsuzaki
1 year
To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis Studies what would happen if we train LLM with repeated data and how we can alleviate the LLM mult-epoch degradation.
Tweet media one
2
19
128
0
0
39