Important Sampling
https://www.coursera.org/learn/sample-based-learning-methods/lecture/6PRvh/course-introduction
Last updated
https://www.coursera.org/learn/sample-based-learning-methods/lecture/6PRvh/course-introduction
Last updated
Repeated random sampling
RL: estimate directly from experiences
DP
Agent knows the transition probabilities
Monte Carlo: Estimate values by averaging over a large number of random samples
Continuously explore
Non-zero probability to each action in every state
Always stochastic (probability)