Jacob Beck @jakeABeck X Profile

Jacob Beck

@jakeABeck

Followers

278

Following

142

Media

15

Statuses

54

Let’s get agents to learn fast! 🤖🔥 Research Scientist @Oracle | PhD @UniOfOxford, MS & BS @BrownUniversity, Predoc @Microsoft

Joined April 2014

Don't wanna be here? Send us removal request.

Jacob Beck

@jakeABeck

3 months

RT @luisa_zintgraf: 🎉 Our Meta-RL survey is now published in Foundations and Trends in Machine Learning! A deep dive into how agents can le….

0

11

0

Jacob Beck

@jakeABeck

3 months

@luisa_zintgraf @chelseabfinn @shimon8282 @ristovuorio For more, see our podcast interview.( . and conference presentation (.

0

7

Jacob Beck

@jakeABeck

3 months

4 years, 120 pages (and only a couple trade wars) later, it’s finally here. Huge thanks to my brilliant co-authors — Evan Liu, Zheng Xiong, @luisa_zintgraf, @chelseabfinn, @shimon8282, and especially @ristovuorio. Published Paper (with sample text).

1

5

Jacob Beck

@jakeABeck

3 months

Big news—our survey paper “A Tutorial on Meta-Reinforcement Learning” is officially published!. Meta-RL = learning how to adapt through interaction. It embraces The Bitter Lesson: don’t hardcode agents—train them to adapt on their own. 🧵⬇️.

2

79

337

Jacob Beck

@jakeABeck

4 months

2️⃣ Non-Markovian Feedback is Crucial: VLMs lack action-conditioned data, so visual cues over time are needed to assess progress. 3️⃣ Simplicity Outperforms Complexity: A filtered and weighted behavior cloning approach surpasses complex RL-based methods. 🔗

0

Jacob Beck

@jakeABeck

4 months

Still, VLMs can recognize successful task completion and provide valuable feedback to guide the learning of RL agents. 🔑 Key insights.1️⃣ Sub-Trajectories Matter: Full-trajectory preference learning worsens stitching issues, so sub-sampling trajectories is critical

1

0

Jacob Beck

@jakeABeck

4 months

✈️ SFO: Piloting VLM Feedback for Offline RL. I’m excited to share a preliminary study on VLMs for offline reinforcement learning!. A lack of internet-scale action data has prevented foundation models from natively understanding control. 🧵.

1

0

3

Jacob Beck

@jakeABeck

4 months

RT @ShangtongZhang: Excited to share our new survey of in-context reinforcement learning!! w/ @AmirMoeini99 @wangji….

0

51

0

Jacob Beck

@jakeABeck

7 months

Many thanks to @instadeepai and my amazing co-authors, @ShikhaSurana01, Manus McAuliffe, Oliver Bent, @tomdbarrett, @juanjogarau, and @pduckw, for their collaboration and support!.

0

3

Jacob Beck

@jakeABeck

7 months

🗓️ Don’t miss it: Sunday, 3-3:30 PM, West Meeting Room 202-204!. 📄 Read the paper: 🔗 Check out the project: 🎥 Watch the award-winning video:

1

0

1

Jacob Beck

@jakeABeck

7 months

🎉🚨 Big news! Our research, Metalic: Meta-Learning In-Context with Protein Language Models, 🧬 won a competition! #NeurIPS2024🤖📚. We advance in-context learning and protein fitness prediction with this paradigm:.✨ Pre-training.🔥 Learning to in-context learn🔥.✨ Fine-tuning

2

4

7

Jacob Beck

@jakeABeck

11 months

Many thanks to the organizers of the @AutoRL_Workshop for the lively discussion. Check out the workshop page: 🔗.

0

3

Jacob Beck

@jakeABeck

11 months

Missed this provocative panel? I was honored to share the stage at #ICML2024 with @pcastr, @XingyouSong, and my colleague @AlexDGoldie! We discussed future perspectives on automated RL, meta-learning, and LLMs 🤖. Catch the discussion here: at 7:16:00 🎙️

1

2

11

Jacob Beck

@jakeABeck

1 year

Wouldn’t it be nice if ChatGPT could figure out how to do your taxes?🧾🤖This would be in-context RL. learning to do in-context-RL is actually the problem studied in meta-RL. Come to our tutorial at 4:15 today in room 119 @ #AAAI24 to learn more!.

1

0

10

Jacob Beck

@jakeABeck

2 years

RT @whi_rl: 🤖 Come chat with @jakeABeck TODAY about his outstanding work on meta-RL, memory, and hypernetworks!🤖. ⏰ This morning at 10:45 a….

0

2

0

Jacob Beck

@jakeABeck

2 years

At NeurIPS this week presenting Recurrent Hypernetworks are Surprisingly Strong in Meta-RL 🔗 👨‍🏫🤖. Come chat about Meta-RL, Memory, or Hypernetworks! 🍻.

0

3

15

Jacob Beck

@jakeABeck

2 years

RT @whi_rl: RL agents are notoriously slow to learn 🐢. However, meta-RL can make RL agents that learn fast!🔥. Check out this talk introduci….

0

16

0

Jacob Beck

@jakeABeck

2 years

Heyyo! Was just interviewed by the TalkRL Podcast!! 🎙️🔥. @ristovuorio and I explain meta-RL. Give it a listen!.

TalkRL Podcast

@TalkRLPodcast

2 years

Episode 39.@jakeABeck and @ristovuorio of @whi_rl at @UniofOxford on their recent Survey of Meta-Reinforcement Learning.

0

5

16

Jacob Beck

@jakeABeck

2 years

With promising applications in sight and an array of open problems, we expect meta-RL research to continue to boom!.

1

0

2

Jacob Beck

@jakeABeck

2 years

Still, there are many open problems, such as out-of-distribution (OOD) generalization and learning over a broad distribution of tasks.

1

4