Explore tweets tagged as #ReinforcementLearning
@Deluthium
Deluthium
17 minutes
What if markets could think before they move? At #Deluthium, we treat liquidity as signal, not noise. #ReinforcementLearning turns execution into adaptive intelligence. Brought to you by the Onchain Flash Boys. Powered by RL.
0
0
31
@ye_chenlu
Chenlu Ye
1 month
PROF🌀Right answer, flawed reason?🤔🌀 📄 https://t.co/8kFrxKQbVW Excited to share our work: PROF-PRocess cOnsistency Filter! 🚀 Challenge: ORM is blind to flawed logic, and PRM suffers from reward hacking. Our method harmonizes strengths of PRM & ORM. #LLM #ReinforcementLearning
2
10
37
@ZappyZappy7
T.Yamazaki
18 days
3
9
44
@ZappyZappy7
T.Yamazaki
26 days
「ムーンウォーク」や「アヒル歩き」などの高度な動きができる二脚ロボット 高い縁石(階段)や障害物も難なく乗り越える https://t.co/FFNtxzfkf3 #bipedal #humanoid #robot #ReinforcementLearning #locomotion #moonwalk #KAIST #DRCDLab #強化学習
1
69
199
@amitrathore
Amit Rathore
1 month
In the Age of AI, start from First Principles. Unlock bottom-up design. Solve classes of problems, not isolated features. Think systems, not silos. Solve fundamentally. Scale exponentially. #AI #AIAgents #ReinforcementLearning #RAG #KnowledgeGraph #Orchestration
0
0
2
@ZappyZappy7
T.Yamazaki
27 days
強化学習により人間のような歩行を習得した人型ロボット オフィス内を歩き回る https://t.co/liyQ0pIqZN #bipedal #humanoid #robot #ImitationLearning #ReinforcementLearning #locomotion #Adam #PNDbotics #模倣学習 #強化学習
6
52
265
@ZappyZappy7
T.Yamazaki
1 month
4
27
106
@KirkDBorne
Kirk Borne
2 months
Deep #ReinforcementLearning Hands-On — Practical easy-to-follow guide to RL from Q-learning and DQNs to PPO and RLHF: https://t.co/pINhcw87qO [3rd Edition] v/ @PacktDataML —— #AI #MachineLearning #DeepLearning #DataScience #DataScientist —— 𝓚𝓮𝔂 𝓕𝓮𝓪𝓽𝓾𝓻𝓮𝓼: 🟢Learn with
215
34
177
@olivia_y_lee
Olivia Lee
3 months
Enabling robots to improve autonomously via RL will be powerful, and dense shaping rewards can greatly facilitate RL. Our #IROS2025 paper presents a method leveraging VLMs to derive dense rewards for efficient autonomous RL. ⚡🦾 #Robotics #ReinforcementLearning 🧵1/5
4
12
121
@ZappyZappy7
T.Yamazaki
2 months
0
22
61
@ZappyZappy7
T.Yamazaki
1 month
アスリートのように考え、計画し、動くロボット自転車 https://t.co/DanO8MLvrI パルクールの機動性と、どんなに複雑な地形も知覚して計画し、ナビゲートする知性を兼ね備える #ReinforcementLearning #UltraMobileVehicle #UMV #JumpingBicycle #RAI_Institute
1
82
211
@SciRobotics
Science Robotics
2 months
Scientists have developed a method based on #ReinforcementLearning that enables a robot to use its upper body to lift and flip a water jug. @ToyotaResearch Learn more in Science #Robotics: https://t.co/IQjxoTsCni
4
5
44
@amir81k
Amir
3 hours
SAPO (Swarm sAmpling Policy Optimization) redefines LLM post-training through collective reinforcement learning — models learn together, share insights, and reach 94% higher rewards with less compute. 🧠🤝 🔗 https://t.co/fW592PEC4W #AI #LLMs #ReinforcementLearning #SAPO
16
0
16
@H0meMadeGarbage
HomeMadeGarbage
12 days
physical AI Sim2Real #ReinforcementLearning #MuJoCo
0
4
28
@ceobillionaire
AGI.Eth
17 days
UserRL: Training Interactive User-Centric Agent via Reinforcement Learning Qian et al.: https://t.co/0J7YRf3sU9 #ArtificialIntelligence #DeepLearning #ReinforcementLearning
4
2
11
@CarolineWang98
Caroline Wang
30 days
[1/4] 🚀 We’re excited to announce the v1 release of JaxAHT – a new library for Ad Hoc Teamwork (AHT) research, built with JAX for speed & scalability! Check it out 👉 https://t.co/Vmpbm72YwS #AI #MARL #ReinforcementLearning #JAX #AdHocTeamwork
1
7
37
@H0meMadeGarbage
HomeMadeGarbage
12 days
physical AI バランス直立 強化学習 #ReinforcementLearning #MuJoCo
0
1
13
@AHendawy19
Ahmed Hendawy | أحمد هنداوى
12 days
I had a fantastic time discussing my research @AmiiThinks and @UAlberta last August. If you are interested in Multi-Task Reinforcement Learning (MTRL) and Mixture of Experts (MoE), then this talk is for you. ​➡️ Full talk: https://t.co/iOLb93DFQs #reinforcementlearning #AI
1
3
15
@ottofabianRL
Fabian Otto
1 day
In our shared projected between KIT and @Microsoft Research we are exploring how to bring more principled #ReinforcementLearning methods to the post-training stage of #LLMs. 🧌 Project page: https://t.co/wJuvcsp6Fa 📜 ArXiv: https://t.co/oypfBCy5Ot 🔧 Code: coming soon
1
0
1