Justin T Chiu Profile
Justin T Chiu

@justintchiu

Followers
645
Following
5K
Media
10
Statuses
1K

generating code at Cohere; phd in ml from Cornell; former Child

Joined November 2011
Don't wanna be here? Send us removal request.
@justintchiu
Justin T Chiu
2 months
Are code agents good at software design, ie building general and reusable code?.We present Librarian, a new refactoring method, and MiniCode, a verifiable refactoring benchmark that requires agents to design libraries that jointly minimizes code from multiple repos 🧡
4
22
149
@justintchiu
Justin T Chiu
17 hours
RT @ezyang: Without further ado, The Parallelism Mesh Zoo
Tweet media one
0
45
0
@grok
Grok
4 days
Join millions who have switched to Grok.
190
391
3K
@justintchiu
Justin T Chiu
19 hours
RT @jxmnop: for the first time i am aware of, there is an entirely private subfield of AI research. every company that actually trains mode….
0
47
0
@justintchiu
Justin T Chiu
4 days
RT @wzhao_nlp: I've always been skeptical about PRMs, but being able to apply RL+reasoning changes the entire story for me. It was a fun ri….
0
18
0
@justintchiu
Justin T Chiu
11 days
RT @cgarciae88: everyone please drop what you are doing and leave a heart on JAX scaling book at the bottom of the page:. .
0
12
0
@justintchiu
Justin T Chiu
11 days
RT @MassCaccia: πŸ”₯ We stress-tested today’s best AI code generators in 𝑑𝑒𝑝𝑒𝑛𝑑𝑒𝑛𝑐𝑦 β„Žπ‘’π‘™π‘™. Introducing π†π’π­π‚π‘πšπ¦πžπ₯𝐞𝐨𝐧 𝟐.𝟎: 328 challenges for ve….
0
25
0
@justintchiu
Justin T Chiu
11 days
RT @stuart_sul: MoE layers can be really slow. When training our coding models @cursor_ai, they ate up 27–53% of training time. So we comp….
0
97
0
@justintchiu
Justin T Chiu
12 days
RT @_xjdr: This is the best version of this i have seen anywhere. Incredibly impressive work and everyone should read it carefully and more….
0
33
0
@justintchiu
Justin T Chiu
12 days
RT @cartesia_ai: Introducing Line by Cartesia: the modern voice agent development platform. Line was built to be code-first, because best-i….
0
54
0
@justintchiu
Justin T Chiu
15 days
leaves sorry everywhere though. .
0
0
2
@justintchiu
Justin T Chiu
15 days
definitely didnt accidentally get better πŸ˜‚. was curious bc ive seen a few papers using claude for the autoformalization step
Tweet media one
1
0
3
@justintchiu
Justin T Chiu
15 days
seems like opus is not bad at lean now.
1
0
3
@justintchiu
Justin T Chiu
15 days
RT @cHHillee: When it comes to hardware that's meant for training or inference, most think about in hardware specs like memory bandwidth ev….
0
8
0
@justintchiu
Justin T Chiu
19 days
RT @vllm_project: Have you ever felt you are developing cuda kernels and your tests often run into illegal memory access (IMA for short) an….
0
22
0
@justintchiu
Justin T Chiu
19 days
RT @ShashwatGoel7: Seems like OpenAI has been prioritising verification, hugely. We re-ran REFUTE, our code verification eval (COLM'25) o….
0
18
0
@justintchiu
Justin T Chiu
19 days
RT @stalkermustang: whoa, what a big W for OpenAIs models on the ReBench (SWE-bench but with very recent PRs, like, closed 3-8 weeks ago)….
0
1
0
@justintchiu
Justin T Chiu
20 days
RT @ying11231: The open source RL framework Slime(+SGLang) has been validated to train 300+B models with agentic, coding, and reasoning cap….
0
30
0
@justintchiu
Justin T Chiu
20 days
RT @_onionesque: Estimating a set’s size from uniform (or o/w well-defined) samples is a classical problem, with two well-studied extremes:….
Tweet card summary image
arxiv.org
Let $S$ be a finite set, and $X_1,\ldots,X_n$ an i.i.d. uniform sample from $S$. To estimate the size $|S|$, without further structure, one can wait for repeats and use the birthday problem. This...
0
5
0
@justintchiu
Justin T Chiu
21 days
RT @ChangJonathanC: while we wait for gpt-5 to drop. Here is a flex attention tutorial for building a < 1000 LoC vllm from scratch. https://….
Tweet card summary image
jonathanc.net
PyTorch FlexAttention tutorial: Building a minimal vLLM-style inference engine from scratch with paged attention
0
37
0
@justintchiu
Justin T Chiu
21 days
RT @leloykun: If you wanna read more about our paper on training transformers with enforced lipschitz bounds, please check out this awesome….
0
1
0
@justintchiu
Justin T Chiu
23 days
RT @allhands_ai: We evaluated GPT-5 in OpenHands and it's the new number one coding agent model for us!. Using exactly the same tools and h….
0
31
0