Critical factors in the empirical performance of temporal difference and evolutionary methods for reinforcement learning