neerjathakkar Profile Banner
Neerja Thakkar Profile
Neerja Thakkar

@neerjathakkar

Followers
327
Following
3K
Media
18
Statuses
195

computer vision PhD student @Berkeley_AI

Berkeley, CA
Joined December 2012
Don't wanna be here? Send us removal request.
@neerjathakkar
Neerja Thakkar
1 year
Human trajectory prediction often assumes consistent behavior trends over time. But human behavior is transient: party-goers act differently at night than when going to work. To address this, our ECCV ’24 paper introduces “latent corridors” for rapid deployment scene adaptation.
3
11
53
@neerjathakkar
Neerja Thakkar
2 months
RT @mjagadeesan25: I'm so excited to be joining @Penn as an Assistant Professor in CS (@CIS_Penn) in Fall 2026!. I’ll be working on machin….
0
52
0
@grok
Grok
3 days
Join millions who have switched to Grok.
178
198
2K
@neerjathakkar
Neerja Thakkar
2 months
I’m giving an invited spotlight talk at 4:15 today at the agents-in-interactions workshop @CVPR (Room 213). Hope to see you there! . Workshop schedule/info: #CVPR2025.
@neerjathakkar
Neerja Thakkar
2 months
Can we systematically generalize AR "word models" into "world models”? Our CVPR 2025 paper introduces a unified, general framework designed to model real-world, multi-agent interactions by disentangling task-specific modeling from behavior prediction.
Tweet media one
1
4
11
@neerjathakkar
Neerja Thakkar
2 months
I’ll be presenting this paper at CVPR! Check out my invited talk on this work at the Agents in Interaction workshop at 4:15 on Thursday June 12, and come chat at my poster at 10:30-12:30 on Saturday June 14.
1
0
7
@neerjathakkar
Neerja Thakkar
2 months
Overall, poly-autoregressive prediction is a strong paradigm and a great starting baseline for real-world interactive problems! We have released the code which is hopefully helpful to other researchers working on multi-agent predictive problems :).
1
0
4
@neerjathakkar
Neerja Thakkar
2 months
All our results used the same small 4M parameter transformer without any modifications to the base framework, architecture, or hyperparameters (aside from LR/EMA).
1
0
4
@neerjathakkar
Neerja Thakkar
2 months
Car trajectory prediction: modeling multiple cars together using PAR results in better prediction of the ego agent — here the PAR model predicts safer behavior than the AR model
1
0
7
@neerjathakkar
Neerja Thakkar
2 months
Object pose forecasting: when predicting the pose of an object during hand-object interaction, also modeling the hand using PAR helps make better rotation and translation predictions
1
0
5
@neerjathakkar
Neerja Thakkar
2 months
And across almost all of the multi-person AVA classes, we see a significant mAP gain using our PAR model over the AR baseline
Tweet media one
1
0
4
@neerjathakkar
Neerja Thakkar
2 months
Social action forecasting: taking into account an interacting agent via PAR results in more accurate action prediction. Here, the PAR model predicts accurate conversational turn-taking (the man stops talking and listens when the woman starts speaking) the AR model doesn't
1
0
4
@neerjathakkar
Neerja Thakkar
2 months
A few simple components—like training for same-agent next-timestep prediction and adding agent ID embeddings—makes our framework effective at multi-agent modeling across diverse scenarios, including:
Tweet media one
1
0
4
@neerjathakkar
Neerja Thakkar
2 months
Compared to autoregressive models, our poly-autoregressive models take other agents’ tokens as input when making next timestep predictions.
Tweet media one
1
0
4
@neerjathakkar
Neerja Thakkar
2 months
Can we systematically generalize AR "word models" into "world models”? Our CVPR 2025 paper introduces a unified, general framework designed to model real-world, multi-agent interactions by disentangling task-specific modeling from behavior prediction.
Tweet media one
2
10
32
@neerjathakkar
Neerja Thakkar
4 months
RT @CarlDoersch: We're very excited to introduce TAPNext: a model that sets a new state-of-art for Tracking Any Point in videos, by formula….
0
57
0
@neerjathakkar
Neerja Thakkar
4 months
RT @iclr_conf: For #ICLR2025, we piloted an LLM that provided optional feedback to some reviewers. Results are promising: over 12K suggesti….
0
16
0
@neerjathakkar
Neerja Thakkar
7 months
Super exciting progress in representation learning from @brjathu !.
@shiryginosar
Shiry Ginosar
8 months
New paper! A SSL object-centric 2.1D image representation using 3D Gaussians, extending MAE with a Gaussian bottleneck. While Gaussian splatting has been used for single-scene reconstruction, we’re the first to apply it to image representation learning!
Tweet media one
0
1
3
@neerjathakkar
Neerja Thakkar
7 months
RT @brjathu: An Empirical Study of Autoregressive Pre-training from Videos. paper: website: .
0
45
0
@neerjathakkar
Neerja Thakkar
9 months
RT @ml_angelopoulos: 🚨 New Textbook on Conformal Prediction 🚨. “The goal of this book is to teach the reader about….
0
91
0
@neerjathakkar
Neerja Thakkar
9 months
RT @r_ahulravi: 🚨 New Preprint!. We're excited to announce our new paper, "Scaling Properties of Diffusion Models For Perceptual Tasks.". P….
Tweet card summary image
arxiv.org
In this paper, we argue that iterative computation with diffusion models offers a powerful paradigm for not only generation but also visual perception tasks. We unify tasks such as depth...
0
3
0
@neerjathakkar
Neerja Thakkar
9 months
RT @shiryginosar: I am recruiting exceptional PhD students & postdocs with an adventurous soul for my💫new TTIC AI lab💫! We aim to understan….
0
49
0