Exercise 1.1 (Self-Play): If a reinforcement learning algorithm plays against itself it might develop a strategy where the algorithm facilitates winning by helping itself.

Reinforcement Learning: An Introduction, by Sutton and Barto.

Reinforcement Learning: An Introduction. Reinforcement learning (RL) can be viewed as an approach which falls between supervised and unsupervised learning. This textbook provides a clear and simple account of the key ideas and algorithms of reinforcement learning that is accessible to readers in all the related disciplines. The learner, often called, agent, discovers which actions give the best rewards. Reinforcement Learning (RL) is a learning methodology by which the learner learns to behave in an interactive environment using its own actions and rewards for its actions. A brief introduction to reinforcement learning: Reinforcement Learning is an aspect of Machine learning where an agent learns to behave in an environment, by performing certain actions and observing the rewards/results which it get from those actions. Reinforcement Learning (RL) has had tremendous success in many disciplines of Machine Learning. This field of research has been able to solve a wide range of complex decision-making tasks that were previously out of reach for a machine. Deep reinforcement learning is the combination of reinforcement learning (RL) and deep learning. Thus, deep RL opens up many new applications in domains such as healthcare, robotics, smart grids, finance, and many more.

