Dimitri Bertsekas @DBertsekas X Profile

Dimitri Bertsekas

@DBertsekas

Followers

11K

Following

39

Media

22

Statuses

113

https://t.co/2usZLzRs4A

ASU and MIT, USA

Joined February 2017

Don't wanna be here? Send us removal request.

Dimitri Bertsekas

@DBertsekas

4 days

RT @simoptim: @DBertsekas Agree. When an appropriate feature space is constructed, aggregation methods often outperform neural networks. I’….

0

1

0

Dimitri Bertsekas

@DBertsekas

8 days

Sharing my new paper (joint with Yuchao Li and Kim Hammar) on Feature-Based Belief Aggregation for Partially Observable Markov Decision Problems," The paper gives favorable computational results involving very large scale problems.#ReinforcementLearning.

0

3

23

Dimitri Bertsekas

@DBertsekas

11 days

I am pleased to share at slides, podcast and an essay on my lecture:. “Ten Simple Rules for Mathematical Writing”. Since its original delivery in a slide presentation at MIT (2002), it has been referenced widely and used in mathematical writing courses.

3

28

158

Dimitri Bertsekas

@DBertsekas

13 days

I am pleased to share my new paper (joint with Yuchao Li) on .Error Bounds for Aggregation Methods. In my view, aggregation is an under-appreciated off-line training approach in #reinforcementlearning.

1

5

34

Dimitri Bertsekas

@DBertsekas

1 month

I am pleased to share the link to my videolecture from 5/2/2025, at Harvard University:.Reinforcement Learning, Model Predictive Control, and Newton's Method for Solving Bellman's equation .Slides at

1

55

368

Dimitri Bertsekas

@DBertsekas

2 months

I am pleased to share at High quality AI-generated podcast links for my books:.1) Lessons from AlphaZero . 2) Parallel and Distributed Computation.See for PDF copies.#reinforcementlearning #machinelearning.

0

7

37

Dimitri Bertsekas

@DBertsekas

2 months

I am pleased to share podcasts (<30 mins) describing two of my books:.Neuro-Dynamic Programming. A Course in Reinforcement Learning.Free PDF of both books can be found at

3

89

465

Dimitri Bertsekas

@DBertsekas

2 months

RL is an older name for a much broader AI methodology, that does not connect well to the present content of the field (what are you reinforcing?) We thought Neuro-Dynamic Programming is a more descriptive name (the marriage of DP theory and NN technology).

1

3

13

Dimitri Bertsekas

@DBertsekas

2 months

A free PDF of the 1996 Neuro-Dynamic Programming book by myself and John Tsitsiklis, the 1st book in #reinforcementlearning, has been posted at An AI-generated podcast that summarizes the book can be found at

3

37

178

Dimitri Bertsekas

@DBertsekas

3 months

I am often asked about the relative merits of various #reinforcementlearning approaches, such as policy gradient and value-based methods. The last lecture of my RL course deals with this question, and related training issues, see:.

0

28

164

Dimitri Bertsekas

@DBertsekas

3 months

Thanks to @AntoMon for this thoughtful review!.

1

4

Dimitri Bertsekas

@DBertsekas

3 months

I am pleased to share a review of my book "A course in reinforcement learning" (2nd edition) This is the textbook for my RL course at ASU (free PDF at .#reinforcementlearning #machinelearning.

4

95

492

Dimitri Bertsekas

@DBertsekas

3 months

I am pleased to share the full set of videolectures, slides, textbook, and other supporting material of the 7th offering of my Reinforcement Learning class at ASU, which was completed two days ago; check

16

238

1K

Dimitri Bertsekas

@DBertsekas

3 months

I am pleased to share the video from my yesterday's lecture "Abstract Dynamic Programming, Reinforcement Learning, Newton's Method, and Gradient Optimization" at the ASU Math Dept.This is an overview lecture on the relations between DP and RL.

3

90

436

Dimitri Bertsekas

@DBertsekas

3 months

A free PDF of my book "Rollout, Policy Iteration, and #ReinforcementLearning " has been posted at my web site An extensive research account on rollout algorithms, including multiagent rollout, and the connection with Newton's method.

4

54

284

Dimitri Bertsekas

@DBertsekas

3 months

RT @KimHammar1: A recording of my guest lecture at ASU on aggregation for approximating POMDPs is available here: .

0

2

0

Dimitri Bertsekas

@DBertsekas

4 months

Just posted a videolecture on a Viterbi-like rollout/#reinforcementlearning algorithm for most likely sequence generation in Markov chains, and HMM inference, at Applies to large state spaces where the Viterbi algorithm is intractable.

1

31

138

Dimitri Bertsekas

@DBertsekas

5 months

RT @tomssilver: This week's #PaperILike is "Model Predictive Control and Reinforcement Learning: A Unified Framework Based on Dynamic Progr….

0

54

0

Dimitri Bertsekas

@DBertsekas

5 months

A video lecture on #reinforcementlearning was posted at Originally delivered at an IEEE Symposium on ADPRL, Orlando, 2014. Several of the ideas await further exploration. Slides at

3

33

165

Dimitri Bertsekas

@DBertsekas

5 months

RT @victor_explore: This is an amazing playlist to learn Reinforcement Learning by Dimitri Bertsekas

0

25

0