James Chen Profile
James Chen

@jchencxh

Followers
479
Following
7K
Media
38
Statuses
1K

ai research @SCSatCMU mostly representation learning for vision and generalisation

Pittsburgh, PA
Joined June 2023
Don't wanna be here? Send us removal request.
@jchencxh
James Chen
3 hours
though i think optimising for what’s interesting is good, even if it doesn’t an out (immediately) for a career. i also think about melbourne and nyc a bit too often.
0
0
0
@jchencxh
James Chen
4 hours
it’s both fortunate and unfortunate that there are some ideas/people/places i can’t get out of my head.
1
0
0
@jchencxh
James Chen
12 hours
if everyone just had cats, i’d be able to go to sleep right now.
0
0
2
@jchencxh
James Chen
12 hours
> new neighbor.> dog.> barks
Tweet media one
1
0
2
@jchencxh
James Chen
14 hours
why can't the idea of long-range dynamics/video be learnt *AFTER* pretraining a static observation encoder. this could work as long as your representation of static observations still convey *how* the static scene can change.
1
0
2
@jchencxh
James Chen
14 hours
why shouldn't we have two things for representing video:.1) static observation encoder.2) dynamics model for describing how things change. instead of:.1) one large ViT that eats everything.
2
0
2
@jchencxh
James Chen
14 hours
do you actually *need* to shove the entire video through a big model to get good representations for a video?. how much of it is just pure static content, and how much else are things that can change? perhaps we only need to process multiple timesteps for learning the latter?.
2
0
3
@jchencxh
James Chen
2 days
i’m not watching MI8 in theatres out of protest bc rebecca ferguson is no longer in the franchise.
@navyrexie
tori ➃
3 days
this is lethal
Tweet media one
0
0
7
@jchencxh
James Chen
2 days
just have kids ig?. you don't even need to go full elon with it, you can just produce a few (<15) of your own little cultists (children).
@jchencxh
James Chen
3 days
people found companies because it’s too hard to found a religion or country.
0
0
4
@jchencxh
James Chen
2 days
my curiosity got the better of me and tonight i'll be finding out what the words "garlic epic stuffed crust" entail.
1
0
3
@jchencxh
James Chen
3 days
goomba lab will be the best lab name until @menhguin starts goon lab.
1
0
2
@jchencxh
James Chen
3 days
people found companies because it’s too hard to found a religion or country.
0
0
4
@jchencxh
James Chen
4 days
i don't personally advocate for *exact* architecture implementations, but this provides a very different perspective on architecture design. you should go read a lot of yilun's stuff.
@du_yilun
Yilun Du
4 days
Excited to share Energy-Based Transformers (EBTs), which allows you to implement system 2 thinking in any modality! . EBTs formulate reasoning as an energy optimization problem, allowing models to internally think without complexities like CoT or multiple recurrent latents.
Tweet media one
0
0
4
@jchencxh
James Chen
5 days
the public messaging of this field is sometimes irresponsible. it's overly-optimistic in the wrong places, and insufficiently nuanced in others. likely:.asi will not be here in five years to cure your rare disease. it will however cause economic shifts you aren't prepared for.
0
0
6
@jchencxh
James Chen
6 days
RT @MLStreetTalk: AI is so smart, why are its internals 'spaghetti'? We spoke with @kenneth0stanley and @akarshkumar0101 (MIT) about their….
0
62
0
@jchencxh
James Chen
7 days
whenever there’s a fly that’s obviously a bit slow and not in an annoying place (eg my house) i don’t kill it. i hope that it can spread its slow genes to other flies, so i can kill the flies in my house more easily.
0
0
4
@jchencxh
James Chen
7 days
seems to be something inherently different in how complexity emerges in finite (ViT encoder) vs infinite (LLM) “working space” designs, and thus how simplicity bias seems to benefit the latter more. we don’t understand how to scale the former purely. the largest ViT << LLM sizes.
0
0
0
@jchencxh
James Chen
9 days
the idea of embodied data in robotics is weird. human actions are embodied, but i don’t think a human’s (or an optimal) world model has to be. data should be for learning an (implicit) world model, i don’t know why there’s a tendency to do it from an embodied POV.
0
0
2