[ML.1] AML - Reinforcement Learning assignment typos