
Alex Alemi
@alemi
Followers
1K
Following
91
Media
4
Statuses
65
Machine Learning Researcher
Kissimmee, FL
Joined January 2008
RT @Pavel_Izmailov: I am recruiting Ph.D. students for my new lab at @nyuniversity! Please apply, if you want to work with me on reasoning,….
0
102
0
RT @blester125: Is Kevin onto something? We found that LLMs can struggle to understand compressed text, unless you do some specific tricks.….
0
6
0
RT @noahconst: Ever wonder why we don’t train LLMs over highly compressed text? Turns out it’s hard to make it work. Check out our paper fo….
arxiv.org
In this paper, we explore the idea of training large language models (LLMs) over highly compressed text. While standard subword tokenizers compress text by a small factor, neural text compressors...
0
10
0
RT @dpkingma: Want to understand and/or play with variational diffusion models? . - See for a simple stand-alone im….
colab.research.google.com
Run, share, and edit Python notebooks
0
63
0
RT @ziv_ravid: A pretty cool paper (and I also hope useful) on using pre-training models to create highly informative priors for downstream….
0
12
0
RT @ethansdyer: 1/ Super excited to introduce #Minerva 🦉(. Minerva was trained on math and science found on the web….
0
523
0
RT @Chitwan_Saharia: We are thrilled to announce Imagen, a text-to-image model with unprecedented photorealism and deep language understand….
0
298
0
RT @samscub: We are presenting our paper "Does Knowledge Distillation Really Work?" at #NeurIPS2021 poster session 2 today - come check it….
0
13
0
RT @polkirichenko: While most papers on knowledge distillation focus on student accuracy, we investigate the agreement between teacher and….
0
15
0