Daily AI Papers @papers_daily X Profile

Daily AI Papers

@papers_daily

Followers

17K

Following

177

Media

2K

Statuses

6K

Joined May 2021

Don't wanna be here? Send us removal request.

Daily AI Papers

@papers_daily

1 year

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking. It trains language models to generate rationales at each token to explain future text, to improve their predictions. 🧵👇

6

14

47

Daily AI Papers

@papers_daily

16 days

RT @vpj: Added the JAX transformer model to annotated paper implementations project. Link 👇

0

4

0

Grok

@grok

26 days

"A medieval knight in full armor riding a motorcycle through a misty jungle trail.". Try Grok Imagine, free for a limited time.

519

814

4K

Daily AI Papers

@papers_daily

28 days

RT @vpj: Wrote an annotated Triton implementation of Flash Attention 2. (Links in reply). This is based on the flash attention implementati….

0

6

0

Daily AI Papers

@papers_daily

5 months

RT @vpj: The new training also improved GPQA from 64.2% to 67.3% and MMLU Pro from 64.2% to 67.3%. This model was also trained with the sa….

0

6

0

Daily AI Papers

@papers_daily

5 months

RT @notbadai: We are releasing an updated reasoning model with improvements on IFEval scores (77.9%) than our previous model (only 51.4%).….

0

6

0

Daily AI Papers

@papers_daily

5 months

RT @vpj: Uploaded the dataset of 270k math reasoning samples that we used to finetune Notbad v1.0 Mistral 24B (MATH-500=77.52% GSM8k Platin….

0

13

0

Daily AI Papers

@papers_daily

5 months

RT @notbadai: We're open-sourcing a math reasoning dataset with 270k samples, generated by our RL-based self-improved Mistral 24B 2501 mode….

huggingface.co

0

4

0

Daily AI Papers

@papers_daily

5 months

RT @labmlai: You can now download the Notbad v1.0 Mistral 24B model from @huggingface . Try it on .

chat.labml.ai

NotBadAI models generate shorter, cleaner reasoning outputs through self-improved capabilities, independently developed without distillation from other models.

0

2

0

Daily AI Papers

@papers_daily

5 months

RT @notbadai: 📢 We are excited to announce Notbad v1.0 Mistral 24B, a new reasoning model trained in math and Python coding. This model is….

0

8

0

Daily AI Papers

@papers_daily

11 months

RT @labmlai: We added token visualization to Inspectus. It lets you visualize metrics associated with tokens such as loss, entropy, KL div,….

0

4

0

Daily AI Papers

@papers_daily

11 months

RT @labmlai: Our open source deep learning experiment monitoring library now has 2000 stars! Thank you

0

3

0

Daily AI Papers

@papers_daily

1 year

RT @notbadai: We’ve been training @nvidia Mistral-NeMo-Minitron-8B-Base for math reasoning on the GSM8K-Aug dataset, and we have a version….

0

9

0

Daily AI Papers

@papers_daily

1 year

RT @labmlai: Annotated @PyTorch implementation of of LoRA (Low Lank Adaptation of LLMs). 📝 Code + Notes: 📎 Paper: h….

0

22

0

Daily AI Papers

@papers_daily

1 year

RT @labmlai: We should be able to release an update of labml experiment monitoring library very soon 😂. It has a bunch of cool new features.

0

3

0

Daily AI Papers

@papers_daily

1 year

RT @vpj: I first found plotting the distribution useful when I was trying RL algorithms on Atari around 2018/19. I used Tensorboard back th….

0

2

0

Daily AI Papers

@papers_daily

1 year

RT @labmlai: 🎉 Excited share that we add a distribution visualization to our library, Inspectus. It plots the full distribution of data ac….

0

4

0

Daily AI Papers

@papers_daily

1 year

RT @labmlai: The machine generated Chinese translation of annotated paper implementations repo is being improved with manual translations b….

0

5

0

Daily AI Papers

@papers_daily

1 year

RT @labmlai: We wrote up some of the best practices we feel are useful for ML projects. Here's a summary 🧵👇.

github.com

🔎 Monitor deep learning model training and hardware usage from your mobile phone 📱 - labmlai/labml

0

30

0

Daily AI Papers

@papers_daily

1 year

RT @labmlai: ✨ Annotated DL Paper Implementation repository reached 50K stars. It has implementations of a wide range of deep learning con….

0

7

0

Daily AI Papers

@papers_daily

1 year

RT @labmlai: We’ve open-sourced our LLM attention visualization library. It generates interactive visualizations of attention matrices with….

0

65

0

Daily AI Papers

@papers_daily

1 year

RT @luck_not_shit: Encoding floating point arrays with Base64 gives a 4x compression over JSON 🚀. Quite useful when you have to transfer la….

0

5

0