evanthebouncy Profile
evanthebouncy

@evanthebouncy

Followers
1K
Following
614
Media
87
Statuses
584

asst prof @ NTU, ex principal scientist @ autodesk, phd mit 2019. I make programming more communicative 🧠↔️🤖

SG
Joined January 2013
Don't wanna be here? Send us removal request.
@evanthebouncy
evanthebouncy
3 years
want to get into program synthesis but don't know how to started? I wrote a minimalist intro to modern program synthesis that can help you -- from problem formulation to generating code by fine-tuning llm on huggingface.
2
17
127
@evanthebouncy
evanthebouncy
3 days
So, people are beginning to realize to what programs actually are.
0
0
1
@evanthebouncy
evanthebouncy
12 days
RT @GabrielPoesia: Thrilled to join the UMich faculty in 2026!. I'll also be recruiting PhD students this upcoming cycle. If you're interes….
0
27
0
@evanthebouncy
evanthebouncy
14 days
cleaning the whiteboard in my office, I noticed in the upper left corner, in faded ink, "I'd rather be feared than be loved". some previous prof really wrote that down, and it refused to fade. I don't know what to make of it.
0
0
6
@evanthebouncy
evanthebouncy
24 days
for fun I went through all my papers and tallied how many times I submitted each. note how it follows a geometric distribution, modeling the policy of "yolo submit until accepted". "but each submission's outcome are not independent coin tosses!!" . or is it?. plot yours.
Tweet media one
0
0
6
@evanthebouncy
evanthebouncy
29 days
Industry only engages with academia in areas that are:.1) poorly understood 2) tantalizingly profitable 3) threaten extinction. Engage industry through mystique, lust, and fears.
@pfau
David Pfau
1 month
From about 2013-2022, the highest impact thing you could do for AI in the tech industry was publish in academic venues. You didn't have to choose between climbing the ladder and doing open science. Now that world is gone, and I'm still not sure how to navigate this new world.
0
0
2
@evanthebouncy
evanthebouncy
1 month
I have yet to see RL succeeds with negative rewards in high dimensions of observation and action space. Do we know of a method that works well while using ONLY negative rewards? . OTOH, humans learn very well from negative rewards, we just call them 'lessons'. what gives?.
1
0
2
@evanthebouncy
evanthebouncy
1 month
Just look at these multi-modal refinement instructions! How would we ground them into reasonable executions?? joint work with. @wp_mccarthy.@saujasv.@judyefan.@dan_fried.@KarlDD.@JustinMatejka
Tweet media one
0
2
5
@evanthebouncy
evanthebouncy
1 month
What would it take to build agents that can similarly follow refinement instructions?. We hope that mrCAD can help, by giving rollouts of successful human-human communications. [8/n]
Tweet media one
1
1
3
@evanthebouncy
evanthebouncy
1 month
Evaluating human vs VLMs in the role of the Maker reveals that:. - human consistently makes improvement on the current design towards the target.- VLMs only makes improvement in generation (round 1) but make things worse in refinement (round 2+). [7/n]
Tweet media one
1
0
0
@evanthebouncy
evanthebouncy
1 month
Analyzing the instructions from successful rollouts reveals that:. - people used more drawings in generation (round 1) and more texts in refinement (round 2+).- the texts become more "verb like" in refinements.- the drawings become more partial in refinements. [6/n]
Tweet media one
1
1
2
@evanthebouncy
evanthebouncy
1 month
We collected successful rollouts of humans playing this game from 1,092 pairs of participants. The collected mrCAD dataset has 3 subsets:. - coverage : 2249 CADs with 1-2 rollouts.- dense : 698 CADs with 3+ rollouts.- very-dense : 27 CADs with 30+ rollouts.[5/n]
Tweet media one
1
0
0
@evanthebouncy
evanthebouncy
1 month
We test this hypothesis with a communication game, where 2 players collaborate to recreate target CAD designs. A target design is shown to the Designer, who must communicate how to recreate it to the Maker over several rounds, communicating using drawing + text. [4/n]
Tweet media one
1
0
0
@evanthebouncy
evanthebouncy
1 month
generation and refinements are different processes. - generation is nothing to something: {} → x.- refinement is something to something else: x → x’. consequently, we hypothesize that they are communicated and executed differently.[3/n].
1
0
0
@evanthebouncy
evanthebouncy
1 month
Imagine using AI to generate an image or to write code. If the first output doesn’t quite work, instructing the AI to refine it further (tweaking img, debugging code) often leads to nowhere. The mrCAD dataset makes this intuition measurable. [2/n].
1
0
0
@evanthebouncy
evanthebouncy
1 month
new multi-turn instruction grounding dataset with @wp_mccarthy and @saujasv . - multi-modal instruction : drawing + txt.- verifiable execution : 2D CAD gym env.- easy eval : API → score.- baselines : human vs VLMs.- large : 15,163 inst-exe rounds. [1/n]
Tweet media one
1
10
28
@evanthebouncy
evanthebouncy
1 month
I've recently started my job as an asst professor at NTU, Singapore. If you are ever in town come say hi :)
Tweet media one
28
11
695
@evanthebouncy
evanthebouncy
2 months
I do feel most projected growth trend (in AI, for instance) are logistic functions disguised as exponentials.
0
0
5
@evanthebouncy
evanthebouncy
3 months
$200 to fly from SFO to JFK. $100 to uber from JFK to Manhattan . Something is deeply wrong here.
2
0
9
@evanthebouncy
evanthebouncy
7 months
One of the best presentations of Chinese food categories.
0
0
4
@evanthebouncy
evanthebouncy
8 months
Winner winner chicken dinner.
@arcprize
ARC Prize
8 months
ARC Prize 2024 Paper Award Winners! 🏆. 1st "Combining Induction and Transduction For Abstract Reasoning", @xu3kev, @HuLillian39250, Carter Larsen, Yuqing Wu, @simon_alford0, Caleb Woo, Spencer M. Dunn, @haotang_ai, Michelangelo Naim, Dat Nguyen, @WeiLongZheng1, @ZennaTavares,.
0
0
11