
Dimitri Bertsekas
@DBertsekas
Followers
11K
Following
39
Media
22
Statuses
113
https://t.co/2usZLzRs4A
ASU and MIT, USA
Joined February 2017
RT @simoptim: @DBertsekas Agree. When an appropriate feature space is constructed, aggregation methods often outperform neural networks. I’….
0
1
0
Sharing my new paper (joint with Yuchao Li and Kim Hammar) on Feature-Based Belief Aggregation for Partially Observable Markov Decision Problems," The paper gives favorable computational results involving very large scale problems.#ReinforcementLearning.
0
3
23
I am pleased to share my new paper (joint with Yuchao Li) on .Error Bounds for Aggregation Methods. In my view, aggregation is an under-appreciated off-line training approach in #reinforcementlearning.
1
5
34
I am pleased to share at High quality AI-generated podcast links for my books:.1) Lessons from AlphaZero . 2) Parallel and Distributed Computation.See for PDF copies.#reinforcementlearning#machinelearning.
0
7
37
A free PDF of the 1996 Neuro-Dynamic Programming book by myself and John Tsitsiklis, the 1st book in #reinforcementlearning, has been posted at An AI-generated podcast that summarizes the book can be found at
3
37
178
I am often asked about the relative merits of various #reinforcementlearning approaches, such as policy gradient and value-based methods. The last lecture of my RL course deals with this question, and related training issues, see:.
0
28
164
I am pleased to share a review of my book "A course in reinforcement learning" (2nd edition) This is the textbook for my RL course at ASU (free PDF at .#reinforcementlearning #machinelearning.
4
95
492
A free PDF of my book "Rollout, Policy Iteration, and #ReinforcementLearning " has been posted at my web site An extensive research account on rollout algorithms, including multiagent rollout, and the connection with Newton's method.
4
54
284
RT @KimHammar1: A recording of my guest lecture at ASU on aggregation for approximating POMDPs is available here: .
0
2
0
Just posted a videolecture on a Viterbi-like rollout/#reinforcementlearning algorithm for most likely sequence generation in Markov chains, and HMM inference, at Applies to large state spaces where the Viterbi algorithm is intractable.
1
31
138
RT @tomssilver: This week's #PaperILike is "Model Predictive Control and Reinforcement Learning: A Unified Framework Based on Dynamic Progr….
0
54
0
A video lecture on #reinforcementlearning was posted at Originally delivered at an IEEE Symposium on ADPRL, Orlando, 2014. Several of the ideas await further exploration. Slides at
3
33
165
RT @victor_explore: This is an amazing playlist to learn Reinforcement Learning by Dimitri Bertsekas
0
25
0