F Learning goals

The purpose of this course is to give you an introduction and knowledge about reinforcement learning (RL). After having participated in the course, you must, in addition to achieving general academic skills, demonstrate:

Knowledge of

RL for Bandit problems
Markov decision processes and ways to optimize them
the exploration vs exploitation challenge in RL and approaches for addressing this challenge
the role of policy evaluation with stochastic approximation in the context of RL

Skills to

define the key features of RL that distinguishes it from other machine learning techniques
discuss fundamental concepts in RL
describe the mathematical framework of Markov decision processes
formulate and solve Markov and semi-Markov decision processes for realistic problems with finite state space under different objectives
apply fundamental techniques, results and concepts of RL on selected RL problems.
given an application problem, decide if it should be formulated as a RL problem and define it formally (in terms of the state space, action space, dynamics and reward model)

Competences to

identify areas where RL are valuable
select and apply the appropriate RL model for a given business problem
interpret and communicate the results from RL