jakeABeck Profile Banner
Jacob Beck Profile
Jacob Beck

@jakeABeck

Followers
278
Following
142
Media
15
Statuses
54

Let’s get agents to learn fast! 🤖🔥 Research Scientist @Oracle | PhD @UniOfOxford, MS & BS @BrownUniversity, Predoc @Microsoft

Joined April 2014
Don't wanna be here? Send us removal request.
@jakeABeck
Jacob Beck
3 months
RT @luisa_zintgraf: 🎉 Our Meta-RL survey is now published in Foundations and Trends in Machine Learning! A deep dive into how agents can le….
0
11
0
@jakeABeck
Jacob Beck
3 months
@luisa_zintgraf @chelseabfinn @shimon8282 @ristovuorio For more, see our podcast interview.( . and conference presentation (.
0
0
7
@jakeABeck
Jacob Beck
3 months
4 years, 120 pages (and only a couple trade wars) later, it’s finally here. Huge thanks to my brilliant co-authors — Evan Liu, Zheng Xiong, @luisa_zintgraf, @chelseabfinn, @shimon8282, and especially @ristovuorio. Published Paper (with sample text).
1
1
5
@jakeABeck
Jacob Beck
3 months
Big news—our survey paper “A Tutorial on Meta-Reinforcement Learning” is officially published!. Meta-RL = learning how to adapt through interaction. It embraces The Bitter Lesson: don’t hardcode agents—train them to adapt on their own. 🧵⬇️.
2
79
337
@jakeABeck
Jacob Beck
4 months
2️⃣ Non-Markovian Feedback is Crucial: VLMs lack action-conditioned data, so visual cues over time are needed to assess progress. 3️⃣ Simplicity Outperforms Complexity: A filtered and weighted behavior cloning approach surpasses complex RL-based methods. 🔗
0
0
0
@jakeABeck
Jacob Beck
4 months
Still, VLMs can recognize successful task completion and provide valuable feedback to guide the learning of RL agents. 🔑 Key insights.1️⃣ Sub-Trajectories Matter: Full-trajectory preference learning worsens stitching issues, so sub-sampling trajectories is critical
Tweet media one
1
0
0
@jakeABeck
Jacob Beck
4 months
✈️ SFO: Piloting VLM Feedback for Offline RL. I’m excited to share a preliminary study on VLMs for offline reinforcement learning!. A lack of internet-scale action data has prevented foundation models from natively understanding control. 🧵.
Tweet media one
1
0
3
@jakeABeck
Jacob Beck
4 months
RT @ShangtongZhang: Excited to share our new survey of in-context reinforcement learning!! w/ @AmirMoeini99 @wangji….
0
51
0
@jakeABeck
Jacob Beck
7 months
Many thanks to @instadeepai and my amazing co-authors, @ShikhaSurana01, Manus McAuliffe, Oliver Bent, @tomdbarrett, @juanjogarau, and @pduckw, for their collaboration and support!.
0
0
3
@jakeABeck
Jacob Beck
7 months
🗓️ Don’t miss it: Sunday, 3-3:30 PM, West Meeting Room 202-204!. 📄 Read the paper: 🔗 Check out the project: 🎥 Watch the award-winning video:
Tweet media one
1
0
1
@jakeABeck
Jacob Beck
7 months
🎉🚨 Big news! Our research, Metalic: Meta-Learning In-Context with Protein Language Models, 🧬 won a competition! #NeurIPS2024🤖📚. We advance in-context learning and protein fitness prediction with this paradigm:.✨ Pre-training.🔥 Learning to in-context learn🔥.✨ Fine-tuning
Tweet media one
2
4
7
@jakeABeck
Jacob Beck
11 months
Many thanks to the organizers of the @AutoRL_Workshop for the lively discussion. Check out the workshop page: 🔗.
0
0
3
@jakeABeck
Jacob Beck
11 months
Missed this provocative panel? I was honored to share the stage at #ICML2024 with @pcastr, @XingyouSong, and my colleague @AlexDGoldie! We discussed future perspectives on automated RL, meta-learning, and LLMs 🤖. Catch the discussion here: at 7:16:00 🎙️
Tweet media one
Tweet media two
1
2
11
@jakeABeck
Jacob Beck
1 year
Wouldn’t it be nice if ChatGPT could figure out how to do your taxes?🧾🤖This would be in-context RL. learning to do in-context-RL is actually the problem studied in meta-RL. Come to our tutorial at 4:15 today in room 119 @ #AAAI24 to learn more!.
1
0
10
@jakeABeck
Jacob Beck
2 years
RT @whi_rl: 🤖 Come chat with @jakeABeck TODAY about his outstanding work on meta-RL, memory, and hypernetworks!🤖. ⏰ This morning at 10:45 a….
0
2
0
@jakeABeck
Jacob Beck
2 years
At NeurIPS this week presenting Recurrent Hypernetworks are Surprisingly Strong in Meta-RL 🔗 👨‍🏫🤖. Come chat about Meta-RL, Memory, or Hypernetworks! 🍻.
0
3
15
@jakeABeck
Jacob Beck
2 years
RT @whi_rl: RL agents are notoriously slow to learn 🐢. However, meta-RL can make RL agents that learn fast!🔥. Check out this talk introduci….
0
16
0
@jakeABeck
Jacob Beck
2 years
Heyyo! Was just interviewed by the TalkRL Podcast!! 🎙️🔥. @ristovuorio and I explain meta-RL. Give it a listen!.
@TalkRLPodcast
TalkRL Podcast
2 years
Episode 39.@jakeABeck and @ristovuorio of @whi_rl at @UniofOxford on their recent Survey of Meta-Reinforcement Learning.
0
5
16
@jakeABeck
Jacob Beck
2 years
With promising applications in sight and an array of open problems, we expect meta-RL research to continue to boom!.
1
0
2
@jakeABeck
Jacob Beck
2 years
Still, there are many open problems, such as out-of-distribution (OOD) generalization and learning over a broad distribution of tasks.
Tweet media one
1
1
4