Trial-and-error Reinforcement Learning