Download PDF
States and accentuators
Discrete choice
Continuous choice
States and deterministic environments
Intertemporal decision making
Planning
Lotteries and risk aversion
Fully observable Markov Decision Processes (MDP)
Decision making with unknown uncertainty
Sensors and Partially Observable Markov Decision Processes (POMDP)
Reinforcement learning
Control theory
Active learning
Simultaneous games
Sequential games
Deep sequential games
Repeated episodic games
Tree search
Games with imperfect or incomplete information
Stochastic games
For environment with no actors, transition model is matrix (for finite state).
Can multiply matrix n times to get state in n periods.
Transition matrix is perumation matrix. 1 or 0 only.
Does time stop to allow decisions. eg chess v driving.