Alex Alemi Profile
Alex Alemi

@alemi

Followers
1K
Following
91
Media
4
Statuses
65

Machine Learning Researcher

Kissimmee, FL
Joined January 2008
Don't wanna be here? Send us removal request.
@alemi
Alex Alemi
5 months
RT @Pavel_Izmailov: I am recruiting Ph.D. students for my new lab at @nyuniversity! Please apply, if you want to work with me on reasoning,….
0
102
0
@alemi
Alex Alemi
5 months
Recently I've been playing around with a quarter-order-of-magnitude system for simple calculations. It gives better precision than single sig-fig calculations using only four, very intuitive, symbols.
Tweet media one
0
0
6
@alemi
Alex Alemi
9 months
If you miss the NYTimes needle, especially one that is statistically uniform (, you can use this page: I whipped together to reason about the correlations between the swing states tonight as results come in.
Tweet media one
0
1
17
@alemi
Alex Alemi
9 months
Why don't we measure probabilities in degrees?.
Tweet media one
4
11
55
@alemi
Alex Alemi
1 year
In which I try to make sense of most of machine learning:
5
41
296
@alemi
Alex Alemi
1 year
RT @blester125: Is Kevin onto something? We found that LLMs can struggle to understand compressed text, unless you do some specific tricks.….
0
6
0
@alemi
Alex Alemi
1 year
RT @noahconst: Ever wonder why we don’t train LLMs over highly compressed text? Turns out it’s hard to make it work. Check out our paper fo….
Tweet card summary image
arxiv.org
In this paper, we explore the idea of training large language models (LLMs) over highly compressed text. While standard subword tokenizers compress text by a small factor, neural text compressors...
0
10
0
@alemi
Alex Alemi
2 years
Each delivery service should use its own distinctive knock.
1
0
2
@alemi
Alex Alemi
2 years
PaLM 540 Billion, Google's large language model used 4.2 moles of flops to train. 4.2 Moles!.
0
0
9
@alemi
Alex Alemi
3 years
RT @poolio: Happy to announce DreamFusion, our new method for Text-to-3D!. We optimize a NeRF from scratch using a….
0
1K
0
@alemi
Alex Alemi
3 years
RT @alemi: @dpkingma @poolio To accompany the colab, I've also written a blog post attempting to make sense of the….
0
11
0
@alemi
Alex Alemi
3 years
RT @dpkingma: Want to understand and/or play with variational diffusion models? . - See for a simple stand-alone im….
Tweet card summary image
colab.research.google.com
Run, share, and edit Python notebooks
0
63
0
@alemi
Alex Alemi
3 years
RT @ziv_ravid: A pretty cool paper (and I also hope useful) on using pre-training models to create highly informative priors for downstream….
0
12
0
@alemi
Alex Alemi
3 years
RT @ethansdyer: 1/ Super excited to introduce #Minerva 🦉(. Minerva was trained on math and science found on the web….
0
523
0
@alemi
Alex Alemi
3 years
RT @Chitwan_Saharia: We are thrilled to announce Imagen, a text-to-image model with unprecedented photorealism and deep language understand….
0
298
0
@alemi
Alex Alemi
4 years
you can verify with `echo -n "answer" | md5sum`.
0
0
0
@alemi
Alex Alemi
4 years
here are the next few days wordle answers as md5 hashes .2022-01-11 = 0b18a3d7b9c43ff1750d2baa4606b8d0.2022-01-12 = 047fb90408a79f189d51cbcea168b1a5.2022-01-13 = ab3358313efb03210a1babfb372246f1.2022-01-14 = d821e448212defd91ac1e67f9653a34d.
3
0
2
@alemi
Alex Alemi
4 years
RT @samscub: We are presenting our paper "Does Knowledge Distillation Really Work?" at #NeurIPS2021 poster session 2 today - come check it….
0
13
0
@alemi
Alex Alemi
4 years
RT @venkvis: Excited to kick-start focus #SciML series on #ML meets Info theory and statistical mechanics! Amazing speaker/session chair li….
0
33
0
@alemi
Alex Alemi
4 years
RT @polkirichenko: While most papers on knowledge distillation focus on student accuracy, we investigate the agreement between teacher and….
0
15
0