Alex Kranias
@alexkranias
Followers
38
Following
42
Media
1
Statuses
13
cs @georgiatech.
Atlanta, GA
Joined October 2022
BERT is just a Single Text Diffusion Step! (1/n) When I first read about language diffusion models, I was surprised to find that their training objective was just a generalization of masked language modeling (MLM), something we’ve been doing since BERT from 2018. The first
13
103
870
In era of pretraining, what mattered was internet text. You'd primarily want a large, diverse, high quality collection of internet documents to learn from. In era of supervised finetuning, it was conversations. Contract workers are hired to create answers for questions, a bit
Introducing the Environments Hub RL environments are the key bottleneck to the next wave of AI progress, but big labs are locking them down We built a community platform for crowdsourcing open environments, so anyone can contribute to open-source AGI
263
882
7K
Mech interp and performance people need to join forces. Understanding model behavior at the lowest level can inform so much about good design at the kernel level.
2
0
2
Over 4 years into our journey bridging Convolutions and Transformers, we introduce Generalized Neighborhood Attention—Multi-dimensional Sparse Attention at the Speed of Light: https://t.co/9awxf2Ogt9 A collaboration with the best minds in AI and HPC. 🐝🟩🟧 @gtcomputing @nvidia
0
30
125
We have released arguably the toughest academic benchmark to ever exist 🎉 It includes even Putnam-like math questions and grad-level math and physics questions from PhD qualifying exams lol Obviously GPT4 still can't perform well on our benchmark 😅
We are releasing the Advanced Reasoning Benchmark dataset for LLMs (ARB)! - Evaluates SotA LLMs on ARB, on which even GPT4 struggles - Explores the feasibility of letting LLMs generate and use rubrics to evaluate generated solutions. proj: https://t.co/HdHYqSNZFW 👇🧵(1/N)
11
35
215