Daily AI Papers Profile
Daily AI Papers

@papers_daily

Followers
17K
Following
172
Media
2K
Statuses
6K

Joined May 2021
Don't wanna be here? Send us removal request.
@papers_daily
Daily AI Papers
1 year
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking. It trains language models to generate rationales at each token to explain future text, to improve their predictions. 🧵👇
Tweet media one
Tweet media two
Tweet media three
Tweet media four
6
14
47
@papers_daily
Daily AI Papers
3 months
RT @vpj: The new training also improved GPQA from 64.2% to 67.3% and MMLU Pro from 64.2% to 67.3%. This model was also trained with the sa….
0
6
0
@papers_daily
Daily AI Papers
3 months
RT @notbadai: We are releasing an updated reasoning model with improvements on IFEval scores (77.9%) than our previous model (only 51.4%).….
0
6
0
@papers_daily
Daily AI Papers
3 months
RT @vpj: Uploaded the dataset of 270k math reasoning samples that we used to finetune Notbad v1.0 Mistral 24B (MATH-500=77.52% GSM8k Platin….
0
13
0
@papers_daily
Daily AI Papers
3 months
RT @notbadai: We're open-sourcing a math reasoning dataset with 270k samples, generated by our RL-based self-improved Mistral 24B 2501 mode….
0
4
0
@papers_daily
Daily AI Papers
3 months
RT @labmlai: You can now download the Notbad v1.0 Mistral 24B model from @huggingface . Try it on .
0
1
0
@papers_daily
Daily AI Papers
3 months
RT @notbadai: 📢 We are excited to announce Notbad v1.0 Mistral 24B, a new reasoning model trained in math and Python coding. This model is….
0
8
0
@papers_daily
Daily AI Papers
9 months
RT @labmlai: We added token visualization to Inspectus. It lets you visualize metrics associated with tokens such as loss, entropy, KL div,….
0
4
0
@papers_daily
Daily AI Papers
10 months
RT @labmlai: Our open source deep learning experiment monitoring library now has 2000 stars! Thank you
Tweet media one
0
3
0
@papers_daily
Daily AI Papers
11 months
RT @notbadai: We’ve been training @nvidia Mistral-NeMo-Minitron-8B-Base for math reasoning on the GSM8K-Aug dataset, and we have a version….
0
9
0
@papers_daily
Daily AI Papers
11 months
RT @labmlai: Annotated @PyTorch implementation of of LoRA (Low Lank Adaptation of LLMs). 📝 Code + Notes: 📎 Paper: h….
0
23
0
@papers_daily
Daily AI Papers
11 months
RT @labmlai: We should be able to release an update of labml experiment monitoring library very soon 😂. It has a bunch of cool new features.
0
3
0
@papers_daily
Daily AI Papers
1 year
RT @vpj: I first found plotting the distribution useful when I was trying RL algorithms on Atari around 2018/19. I used Tensorboard back th….
0
2
0
@papers_daily
Daily AI Papers
1 year
RT @labmlai: 🎉 Excited share that we add a distribution visualization to our library, Inspectus. It plots the full distribution of data ac….
0
4
0
@papers_daily
Daily AI Papers
1 year
RT @labmlai: The machine generated Chinese translation of annotated paper implementations repo is being improved with manual translations b….
0
5
0
@papers_daily
Daily AI Papers
1 year
RT @labmlai: We wrote up some of the best practices we feel are useful for ML projects. Here's a summary 🧵👇.
0
30
0
@papers_daily
Daily AI Papers
1 year
RT @labmlai: ✨ Annotated DL Paper Implementation repository reached 50K stars. It has implementations of a wide range of deep learning con….
0
7
0
@papers_daily
Daily AI Papers
1 year
RT @labmlai: We’ve open-sourced our LLM attention visualization library. It generates interactive visualizations of attention matrices with….
0
65
0
@papers_daily
Daily AI Papers
1 year
RT @luck_not_shit: Encoding floating point arrays with Base64 gives a 4x compression over JSON 🚀. Quite useful when you have to transfer la….
0
5
0
@papers_daily
Daily AI Papers
1 year
RT @vpj: .@labmlai deep learning experiment monitoring app got significantly more responsive after @luck_not_shit implemented base64 encodi….
0
5
0
@papers_daily
Daily AI Papers
1 year
RT @MIT_CSAIL: Deep Learning job interview questions, fully solved & covering a wide range of key AI topics: credi….
0
92
0