Minibatch AI
About
Posts about
rl
REINFORCE on the short-corridor gridworld, Part 1 (RL S&B Chapter 13)
(26 Apr 2022)
Gradient Monte Carlo for Value Function Approximation (RL S&B Example 9.1)
(06 Sep 2021)
Vectorising the Bellman equations (RL S&B Examples 3.5, 3.8)
(31 Aug 2021)