Intro to Reinforcement Learning: The ExploreExploit ...
· This is called exploitation in reinforcement learning where one can take the optimal decisions with the highest possible outcome given current acquired knowledge or information. ... we should explore all the machines to find out the machine with the best win rates.