Reinforcement-learning algorithms 1,2 are inspired by our understanding of decision making in humans and other animals in which learning is supervised through the use of reward signals in response to ...
Biological and artificial agents must achieve goals to survive and be useful. This goal-directed or hedonistic behaviour is the foundation of reinforcement learning (RL) 1, which is learning to choose ...