
BoxingBytes
@BoxingBytes
Followers
7
Following
61
Media
11
Statuses
87
Joined January 2024
METRA On-policy built on top of pufferlib. Pure skill discovery. Trained on 4 discrete skills in the convert_circle. Env action space is multi discrete and obs are 28 dim. Running exp on harder envs (non locomotion-based & partial obs). From @seohong_park & @_oleh, @svlevine
1
0
2
Start building RL with puffer on GPU for brookies today:
boxingbytes.github.io
Reinforcement Learning is hard, and most environment setups are wonky, slow, too expensive to run, or can only run a handfull of environments.
0
5
15
Implementing LSD from @seohong_park. Seems to work well on tasks where exploration is "coordinate" distance based but not much on partial one-hot obs. going to dive deeper in this, maybe switch to discrete skills?.
0
0
0