RL - HackMD

# RL ## 1 Define the following properties of the FishingDerbyRL MDP: - State Space S: The total number of states of the environment (which is represented by a grid world). - Action Space A: All possible actions carried by the diver. ## 4.1 1. Not improving/learning $\alpha = 1$ and $\alpha = 0$ 2. High variance but fast learning High $\alpha < 1$ 3. Low variance and high long-term return Low $\alpha$, high $\gamma$ 4. High variance and high long-term return High $\alpha$, high $\gamma$