
Yulu Gan
@yule_gan
Followers
154
Following
599
Media
6
Statuses
35
PhD student @MITEECS @MIT_CSAIL @MIT_CBMM / ex @PKU1898 @MSFTResearch
Cambridge, MA
Joined October 2022
New paper at #NeurIPS2024!. In which we try to make a *small yet interpretable* model work. We use decision trees, which offer a fully transparent decision-making process, in an autoregressive manner to do language tasks. paper: (1/n)
7
37
223
RT @phillip_isola: Our computer vision textbook is now available for free online here:. We are working on adding so….
0
623
0
RT @RichardSSutton: I’ve changed so little. From my 1978 Bachelor’s thesis:. “The adult human mind is very complex, but the question remain….
0
64
0
RT @sainingxie: When I first saw diffusion models, I was blown away by how naturally they scale during inference: you train them with fixed….
0
70
0
RT @deepseek_ai: 🚀 DeepSeek-R1 is here!. ⚡ Performance on par with OpenAI-o1.📖 Fully open-source model & technical report.🏆 MIT licensed: D….
0
7K
0
RT @kenneth0stanley: @daniel_mac8 @nickcammarata @jeffclune I think my former OpenAI colleague @nickcammarata makes a good point here that….
0
8
0
RT @haotiant1998: Personal update: I am excited to share that I will join @GoogleDeepMind next week after defending my PhD thesis @MITEECS….
0
55
0
RT @akarshkumar0101: Very excited to share ASAL! .Artificial Life aims to recreate natural evolution, but is severely bottlenecked by hand-….
0
28
0
RT @JeffDean: I and other members of the Gemini team are looking forward to chatting with @NeurIPS attendees tomorrow at the @GoogleDeepMin….
0
1
0
RT @jeffclune: Likewise! Welcome to Vancouver and please come say hi if you want to meet and/or chat about any of the below topics (or any….
0
8
0
I'll be presenting our poster on Thur, Dec 12 from 4:30 p to 7:30p PST at East Exhibit Hall A-C, booth #4807. Come and say hi if you're around!.
New paper at #NeurIPS2024!. In which we try to make a *small yet interpretable* model work. We use decision trees, which offer a fully transparent decision-making process, in an autoregressive manner to do language tasks. paper: (1/n)
0
0
12
RT @ShivamDuggal4: Current vision systems use fixed-length representations for all images. In contrast, human intelligence or LLMs (eg: Ope….
0
66
0
RT @GuangyuRobert: What will a world look like with 100 billion digital human beings?. Today we share our tech report on Project Sid – a gl….
0
250
0
I like this reaction. An interesting theoretical question is: which is more powerful, ARDTs or multi-layer Transformers? Also, can it be scaled up?.
Oh, wow! It doesn't have to be a transformer, it doesn't even have to be a “neural” model. Decision trees can also model language and solve tasks!.
1
0
3
This project came out of an amazing collaboration with @GalantiTomer, Tomaso Poggio, and @EranMalach!. Check out our paper for more details! (n/n).
0
0
4