← Back to module
Syllabus
Content
Core Bibliography
Reinforcement Learning
- Professor
- Prof. Dr. Thiago Silva
- Workload
- 30h
Decision-making under uncertainty. Monte Carlo simulation. Markov decision processes. Approximate dynamic programming. Q-learning. Proximal Policy Optimization (PPO). Applications in industry.
- Decision-making under uncertainty
- Monte Carlo simulation
- Markov decision processes
- Approximate dynamic programming
- Q-learning
- Proximal Policy Optimization (PPO)
- Applications in industry
- SUTTON, Richard S.; BARTO, Andrew G. Reinforcement learning: An introduction. MIT press, 2018.
- BERTSEKAS, Dimitri P. et al. Dynamic programming and optimal control. Belmont, MA: Athena scientific, 2005.
- POWELL, Warren B. Approximate Dynamic Programming: Solving the curses of dimensionality. John Wiley & Sons, 2011.