Advanced Reinforcement Learning Posted byczxttkl April 13, 2017 Leave a comment on Advanced Reinforcement Learning Why TD($latex lambda$)? Why actor-critic? Why eligibility trace? Why contextual regret minimization?