#ReinforcementLearning X Hashtag

Explore tweets tagged as #ReinforcementLearning

Deluthium

@Deluthium

17 minutes

What if markets could think before they move? At #Deluthium, we treat liquidity as signal, not noise. #ReinforcementLearning turns execution into adaptive intelligence. Brought to you by the Onchain Flash Boys. Powered by RL.

0

31

Chenlu Ye

@ye_chenlu

1 month

PROF🌀Right answer, flawed reason?🤔🌀 📄 https://t.co/8kFrxKQbVW Excited to share our work: PROF-PRocess cOnsistency Filter! 🚀 Challenge: ORM is blind to flawed logic, and PRM suffers from reward hacking. Our method harmonizes strengths of PRM & ORM. #LLM #ReinforcementLearning

2

10

37

T.Yamazaki

@ZappyZappy7

18 days

ワンコ🐶のように制約された空間を通過する四足歩行ロボット https://t.co/kDPk6MX0Rl #quadrupedal #robot #locomotion #transition #ImitationLearning #ReinforcementLearning #pipeline #framework #dynamic #biological #mimics #ArcLab

3

9

44

T.Yamazaki

@ZappyZappy7

26 days

「ムーンウォーク」や「アヒル歩き」などの高度な動きができる二脚ロボット高い縁石(階段)や障害物も難なく乗り越える https://t.co/FFNtxzfkf3 #bipedal #humanoid #robot #ReinforcementLearning #locomotion #moonwalk #KAIST #DRCDLab #強化学習

1

69

199

Amit Rathore

@amitrathore

1 month

In the Age of AI, start from First Principles. Unlock bottom-up design. Solve classes of problems, not isolated features. Think systems, not silos. Solve fundamentally. Scale exponentially. #AI #AIAgents #ReinforcementLearning #RAG #KnowledgeGraph #Orchestration

0

2

T.Yamazaki

@ZappyZappy7

27 days

強化学習により人間のような歩行を習得した人型ロボットオフィス内を歩き回る https://t.co/liyQ0pIqZN #bipedal #humanoid #robot #ImitationLearning #ReinforcementLearning #locomotion #Adam #PNDbotics #模倣学習 #強化学習

6

52

265

T.Yamazaki

@ZappyZappy7

1 month

ダンスパフォーマンスを披露する二足歩行人型ロボット https://t.co/Ch8sqE925V #bipedal #humanoid #robot #GeneralPurpose #EmbodiedAI #PhysicalAI #ReinforcementLearning #DeepLearning #Python #modular #LimXOli #LimxDynamics

4

27

106

Kirk Borne

@KirkDBorne

2 months

Deep #ReinforcementLearning Hands-On — Practical easy-to-follow guide to RL from Q-learning and DQNs to PPO and RLHF: https://t.co/pINhcw87qO [3rd Edition] v/ @PacktDataML —— #AI #MachineLearning #DeepLearning #DataScience #DataScientist —— 𝓚𝓮𝔂 𝓕𝓮𝓪𝓽𝓾𝓻𝓮𝓼: 🟢Learn with

215

34

177

Olivia Lee

@olivia_y_lee

3 months

Enabling robots to improve autonomously via RL will be powerful, and dense shaping rewards can greatly facilitate RL. Our #IROS2025 paper presents a method leveraging VLMs to derive dense rewards for efficient autonomous RL. ⚡🦾 #Robotics #ReinforcementLearning 🧵1/5

4

12

121

T.Yamazaki

@ZappyZappy7

2 months

様々な地形に適応する機敏な四脚ロボット(歩行/車輪) https://t.co/E3NZVFnnj4 #quadrupedal #quadruped #RobotDog #reinforcementlearning #ArtificialInteligence #EmbodiedAI #DeepRobotics

0

22

61

T.Yamazaki

@ZappyZappy7

1 month

アスリートのように考え、計画し、動くロボット自転車 https://t.co/DanO8MLvrI パルクールの機動性と、どんなに複雑な地形も知覚して計画し、ナビゲートする知性を兼ね備える #ReinforcementLearning #UltraMobileVehicle #UMV #JumpingBicycle #RAI_Institute

1

82

211

Science Robotics

@SciRobotics

2 months

Scientists have developed a method based on #ReinforcementLearning that enables a robot to use its upper body to lift and flip a water jug. @ToyotaResearch Learn more in Science #Robotics: https://t.co/IQjxoTsCni

4

5

44

Amir

@amir81k

3 hours

SAPO (Swarm sAmpling Policy Optimization) redefines LLM post-training through collective reinforcement learning — models learn together, share insights, and reach 94% higher rewards with less compute. 🧠🤝 🔗 https://t.co/fW592PEC4W #AI #LLMs #ReinforcementLearning #SAPO

16

0

16

HomeMadeGarbage

@H0meMadeGarbage

12 days

physical AI Sim2Real #ReinforcementLearning #MuJoCo

0

4

28

AGI.Eth

@ceobillionaire

17 days

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning Qian et al.: https://t.co/0J7YRf3sU9 #ArtificialIntelligence #DeepLearning #ReinforcementLearning

4

2

11

Caroline Wang

@CarolineWang98

30 days

[1/4] 🚀 We’re excited to announce the v1 release of JaxAHT – a new library for Ad Hoc Teamwork (AHT) research, built with JAX for speed & scalability! Check it out 👉 https://t.co/Vmpbm72YwS #AI #MARL #ReinforcementLearning #JAX #AdHocTeamwork

1

7

37

HomeMadeGarbage

@H0meMadeGarbage

12 days

physical AI バランス直立強化学習 #ReinforcementLearning #MuJoCo

0

1

13

Ahmed Hendawy | أحمد هنداوى

@AHendawy19

12 days

I had a fantastic time discussing my research @AmiiThinks and @UAlberta last August. If you are interested in Multi-Task Reinforcement Learning (MTRL) and Mixture of Experts (MoE), then this talk is for you. ➡️ Full talk: https://t.co/iOLb93DFQs #reinforcementlearning #AI

1

3

15

Fabian Otto

@ottofabianRL

1 day

In our shared projected between KIT and @Microsoft Research we are exploring how to bring more principled #ReinforcementLearning methods to the post-training stage of #LLMs. 🧌 Project page: https://t.co/wJuvcsp6Fa 📜 ArXiv: https://t.co/oypfBCy5Ot 🔧 Code: coming soon

1

0

1