EpisodeYang Profile Banner
Ge Yang Profile
Ge Yang

@EpisodeYang

Followers
3K
Following
4K
Media
218
Statuses
2K

I am planting acorns one at a time with policy gradient.

London
Joined November 2011
Don't wanna be here? Send us removal request.
@EpisodeYang
Ge Yang
5 years
What can unsupervised representation learning learn from deep RL? As it turned out, we can learn representations (unsupervised) by making plans!. Check out our new work at
4
77
357
@EpisodeYang
Ge Yang
14 days
Our crew at 4:30 AM
Tweet media one
0
0
6
@EpisodeYang
Ge Yang
24 days
It is so cool to see in action.
@Haoyu_Xiong_
Haoyu Xiong
27 days
Many of today's data collection systems do not capture human perceptual behaviors. The observation mismatch—between what the human sees and what the robot learns from—hinders the learning of effective manipulation policies. To see what the robot sees, we developed a VR
2
0
1
@EpisodeYang
Ge Yang
2 months
Check out this awesome demo!.
@xiaolonw
Xiaolong Wang
2 months
We have been focusing on policy learning for robotics for a while. But can hardware be learned as well? Check out @yswhynot ‘s recent co-design work that learns what a soft gripper should be if we want to do better manipulation.
0
0
2
@EpisodeYang
Ge Yang
2 months
RT @xuxin_cheng: Meet 𝐀𝐌𝐎 — our universal whole‑body controller that unleashes the 𝐟𝐮𝐥𝐥  kinematic workspace of humanoid robots to the phys….
0
114
0
@EpisodeYang
Ge Yang
2 months
Take a look at these amazing humanoid results. Our dreams are coming true!.
@xiaolonw
Xiaolong Wang
2 months
A behind-the-scenes video on how teleoperation is done. Whole-body manipulation with only a VisionPro.
0
0
1
@EpisodeYang
Ge Yang
4 months
Congratulations! Incredible team and incredible traction.
@JasonMa2020
Jason Ma
4 months
Excited to launch @DynaRobotics with a team of incredible researchers, engineers and company builders!. At Dyna, our mission is to bring affordable general-purpose AI robots to real production environments.
Tweet media one
0
0
5
@EpisodeYang
Ge Yang
4 months
Look at how fast the training is 🔥.
@RogerQiu_42
Roger Qiu
4 months
Feature Splatting can now turn Objaverse assets into GS. With optimized kernel, 30K iters can be done in <1min on a single 4090 GPU.
0
0
4
@EpisodeYang
Ge Yang
4 months
So many friends in this video! Gemini robotics looks incredible. Congratulations to the team 🥳.
@GoogleDeepMind
Google DeepMind
4 months
Meet Gemini Robotics: our latest AI models designed for a new generation of helpful robots. 🤖. Based on Gemini 2.0, they bring capabilities such as better reasoning, interactivity, dexterity and generalization into the physical world. 🧵
0
0
4
@EpisodeYang
Ge Yang
4 months
The crazy part is that their closed loop evaluation setup models potholes.
@TaylorOgan
Taylor Ogan
4 months
5. Front-lift and nose-dive suppression test.During hard acceleration or sudden braking, the car’s nose remains steady (no “nose lift” or “nodding”), maintaining better contact with the road.
0
0
1
@EpisodeYang
Ge Yang
5 months
lol, Jon's visualization does not disappoint : ).
@jon_barron
Jon Barron
5 months
I just pushed a new paper to arXiv. I realized that a lot of my previous work on robust losses and nerf-y things was dancing around something simpler: a slight tweak to the classic Box-Cox power transform that makes it much more useful and stable. It's this f(x, λ) here:
0
0
7
@EpisodeYang
Ge Yang
5 months
What really excites me about this is that Atlas vector search will become even better, making it easier for a lot of smaller teams.
@tengyuma
Tengyu Ma
5 months
We joined @MongoDB! @VoyageAI’s best-in-class embedding models and rerankers will be part of MongoDB’s best-in-class database, powering mission-critical AI applications with high-quality semantic retrieval capability. A huge thank you to everyone with us on this journey, and to
Tweet media one
0
0
0
@EpisodeYang
Ge Yang
5 months
RT @Kimi_Moonshot: 🚀 Introducing our new tech report: Muon is Scalable for LLM Training. We found that Muon optimizer can be scaled up usin….
0
308
0
@EpisodeYang
Ge Yang
5 months
My friend did it!.
@xiaolonw
Xiaolong Wang
5 months
Honored to receive the Sloan Research Fellowship. Thank you for the support of the research in our lab!.
0
0
2
@EpisodeYang
Ge Yang
5 months
RT @_sholtodouglas: A distillation of our mental models that we use to think about the systems perspective on training and inference at sca….
0
34
0
@EpisodeYang
Ge Yang
6 months
This is really cool.
@sybilhyz
Peiyi Wang
6 months
Last year, I joined DeepSeek with no RL experience. While conducting Mathshepherd and DeepSeekMath research, I independently derived this unified formula to understand various training methods. It felt like an "aha moment", though I later realized it was PG.
0
0
5
@EpisodeYang
Ge Yang
6 months
Pretty nice paper from Dibya! Can read side by side the R1 report :P.
@its_dibya
Dibya Ghosh
6 months
4. We had signs of life back then too. It’s too easy to forget older papers. I never published my findings but a few from my friends (a biased sample):. from @avisingh599 JD, @agarwl_ . from @d_yuqing + collaborators @ Meta.
0
0
2
@EpisodeYang
Ge Yang
7 months
Check this out — they use a sparse MoE with k=2, to allow post-training pruning that reduce inference cost. It is quite clever 👏 @moritz_reuss and @jyo_pari !.
@moritz_reuss
Moritz Reuss
7 months
How can we make diffusion policies more computationally efficient while scaling up towards generalist policies?. Introducing MoDE: A novel generalist MoE-based Diffusion Policy
Tweet media one
0
0
2
@EpisodeYang
Ge Yang
7 months
A stunning level of extreme mobility on the wheeled B2. #unitree.
0
2
29
@EpisodeYang
Ge Yang
7 months
I really like this paper.
@jiaman01
Jiaman Li
7 months
🤖 Introducing Human-Object Interaction from Human-Level Instructions! First complete system that generates physically plausible, long-horizon human-object interactions with finger motions in contextual environments, driven by human-level instructions. 🔍 Our approach:.- LLMs
0
0
2
@EpisodeYang
Ge Yang
7 months
Look at what @yuewang314 got to work with his students! And the best part is you can also train your robot on these data : ) and there is no VXF involved in this video :-P.
@SihengZhao
Siheng Zhao
7 months
🎬Can internet videos enhance the scalability of humanoid learning?. 🤖Introducing Humanoid-X, a comprehensive dataset comprising over 20 million humanoid robot poses paired with text-based motion descriptions, on which we develop Universal Humanoid-1 (UH-1), a large model for
1
1
6