
Elias Frantar
@elias_frantar
Followers
488
Following
845
Media
26
Statuses
76
Researcher @OpenAI | prev. PhD @ISTAustria and intern @GoogleDeepmind | I also build super fast Lego Rubik's Cube robots.
San Francisco, CA
Joined February 2015
RT @DAlistarh: Happy to release the write-up on the MARLIN kernel for fast LLM inference, now supporting 2:4 sparsity! .Led by @elias_fran….
0
22
0
RT @efxmarty: AutoGPTQ 0.7.0 is released and includes @elias_frantar's Marlin kernel for int4*fp16 matrix multiplication on Ampere GPUs. Ch….
0
8
0
@DAlistarh and I hope that Marlin will help to unlock the full potential of 4-bit inference for open-source models, now also in settings that require batchsizes significantly larger than 1!.
1
0
7
RT @DAlistarh: Happy to release QUIK, a new accurate post-training quantization method which processes the majority of weights and activati….
0
37
0
We hope that QMoE will make deployment of and research with massive MoEs cheaper and more accessible. Work done together with @DAlistarh at @ISTAustria!. 8/8.
0
0
1
RT @mgoin_: Exciting news from our latest LLM compression research! 🚀 Together with @ISTAustria and @neuralmagic, we’ve been exploring spar….
0
40
0
This paper is a result of my internship at Google DeepMind and is joint work with @rikelhood @neilhoulsby @DAlistarh and @utkuevci.
0
0
7