To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis Studies what would happen if we train LLM with repeated data and how we can alleviate the LLM mult-epoch degradation. Tweet added by Aran Komatsuzaki @arankomatsuzaki

Aran Komatsuzaki

1 year

To Repeat or Not To Repeat: Insights from Scaling LLM under Token-Crisis Studies what would happen if we train LLM with repeated data and how we can alleviate the LLM mult-epoch degradation.

2

19

128

Laurence Bremner

@LaurenceBrem

1 year

@arankomatsuzaki We need develop sophisticated methods to handle text repetition in LLMs. Tapping into new data sources could redefine the AI foundational model race. A secondary use would be to solve the issue of *running out* of training data

chinchilla's wild implications — LessWrong

(Colab notebook here.) • This post is about language model scaling laws, specifically the laws derived in the DeepMind paper that introduced Chinchil…

www.lesswrong.com

0

1

3

Trung Huynh

@trunghlt

1 year

@arankomatsuzaki Token crisis 🤣🤣🤣, have researchers become social media influencers or what???

0

1

Replies