Explore tweets tagged as #ReinforcementLearning
What if markets could think before they move? At #Deluthium, we treat liquidity as signal, not noise. #ReinforcementLearning turns execution into adaptive intelligence. Brought to you by the Onchain Flash Boys. Powered by RL.
0
0
31
PROF🌀Right answer, flawed reason?🤔🌀 📄 https://t.co/8kFrxKQbVW Excited to share our work: PROF-PRocess cOnsistency Filter! 🚀 Challenge: ORM is blind to flawed logic, and PRM suffers from reward hacking. Our method harmonizes strengths of PRM & ORM. #LLM #ReinforcementLearning
2
10
37
ワンコ🐶のように制約された空間を通過する四足歩行ロボット https://t.co/kDPk6MX0Rl
#quadrupedal #robot #locomotion #transition #ImitationLearning #ReinforcementLearning #pipeline #framework #dynamic #biological #mimics #ArcLab
3
9
44
「ムーンウォーク」や「アヒル歩き」などの高度な動きができる二脚ロボット 高い縁石(階段)や障害物も難なく乗り越える https://t.co/FFNtxzfkf3
#bipedal #humanoid #robot #ReinforcementLearning #locomotion #moonwalk #KAIST #DRCDLab #強化学習
1
69
199
In the Age of AI, start from First Principles. Unlock bottom-up design. Solve classes of problems, not isolated features. Think systems, not silos. Solve fundamentally. Scale exponentially. #AI #AIAgents #ReinforcementLearning #RAG #KnowledgeGraph #Orchestration
0
0
2
強化学習により人間のような歩行を習得した人型ロボット オフィス内を歩き回る https://t.co/liyQ0pIqZN
#bipedal #humanoid #robot #ImitationLearning #ReinforcementLearning #locomotion #Adam #PNDbotics #模倣学習 #強化学習
6
52
265
ダンスパフォーマンスを披露する二足歩行人型ロボット https://t.co/Ch8sqE925V
#bipedal #humanoid #robot #GeneralPurpose #EmbodiedAI #PhysicalAI #ReinforcementLearning #DeepLearning #Python #modular #LimXOli #LimxDynamics
4
27
106
Deep #ReinforcementLearning Hands-On — Practical easy-to-follow guide to RL from Q-learning and DQNs to PPO and RLHF: https://t.co/pINhcw87qO [3rd Edition] v/ @PacktDataML —— #AI #MachineLearning #DeepLearning #DataScience #DataScientist —— 𝓚𝓮𝔂 𝓕𝓮𝓪𝓽𝓾𝓻𝓮𝓼: 🟢Learn with
215
34
177
Enabling robots to improve autonomously via RL will be powerful, and dense shaping rewards can greatly facilitate RL. Our #IROS2025 paper presents a method leveraging VLMs to derive dense rewards for efficient autonomous RL. ⚡🦾 #Robotics #ReinforcementLearning 🧵1/5
4
12
121
様々な地形に適応する機敏な四脚ロボット(歩行/車輪) https://t.co/E3NZVFnnj4
#quadrupedal #quadruped #RobotDog #reinforcementlearning #ArtificialInteligence #EmbodiedAI #DeepRobotics
0
22
61
アスリートのように考え、計画し、動くロボット自転車 https://t.co/DanO8MLvrI パルクールの機動性と、どんなに複雑な地形も知覚して計画し、ナビゲートする知性を兼ね備える #ReinforcementLearning #UltraMobileVehicle #UMV #JumpingBicycle #RAI_Institute
1
82
211
Scientists have developed a method based on #ReinforcementLearning that enables a robot to use its upper body to lift and flip a water jug. @ToyotaResearch Learn more in Science #Robotics: https://t.co/IQjxoTsCni
4
5
44
SAPO (Swarm sAmpling Policy Optimization) redefines LLM post-training through collective reinforcement learning — models learn together, share insights, and reach 94% higher rewards with less compute. 🧠🤝 🔗 https://t.co/fW592PEC4W
#AI #LLMs #ReinforcementLearning #SAPO
16
0
16
UserRL: Training Interactive User-Centric Agent via Reinforcement Learning Qian et al.: https://t.co/0J7YRf3sU9
#ArtificialIntelligence #DeepLearning #ReinforcementLearning
4
2
11
[1/4] 🚀 We’re excited to announce the v1 release of JaxAHT – a new library for Ad Hoc Teamwork (AHT) research, built with JAX for speed & scalability! Check it out 👉 https://t.co/Vmpbm72YwS
#AI #MARL #ReinforcementLearning #JAX #AdHocTeamwork
1
7
37
I had a fantastic time discussing my research @AmiiThinks and @UAlberta last August. If you are interested in Multi-Task Reinforcement Learning (MTRL) and Mixture of Experts (MoE), then this talk is for you. ➡️ Full talk: https://t.co/iOLb93DFQs
#reinforcementlearning #AI
1
3
15
In our shared projected between KIT and @Microsoft Research we are exploring how to bring more principled #ReinforcementLearning methods to the post-training stage of #LLMs. 🧌 Project page: https://t.co/wJuvcsp6Fa 📜 ArXiv: https://t.co/oypfBCy5Ot 🔧 Code: coming soon
1
0
1