
Rajeev Ranjan Pandey
@rrpandey_in
Followers
87
Following
309
Media
47
Statuses
360
PhDing @IITBHU_Varanasi | #ReinforcementLearning | Sharing PhD journey, insights and paper summaries.
Varanasi, India
Joined November 2023
Life is full of choices: short-term fun vs long-term payoff. Value Iteration is the algorithm that helps #RL agents figure it out. Here’s an explanation on how it works with some real world examples 👇.
1
0
0