Training the AI to select empty buttons after a random initialization
To initialize a random button is clicked
To win the agent must click each unoccupied button once