Skip to content

Advanced Reinforcement Learning

Posted byczxttkl April 13, 2017 Leave a comment on Advanced Reinforcement Learning

Why TD($latex lambda$)?

Why actor-critic?

Why eligibility trace?

Why contextual regret minimization?

Posted byczxttklApril 13, 2017Posted inAlgorithm

Post navigation

Previous Post Previous post:
English Grammars

Next Post Next post:
Upgrade Cuda from 7.x to 8.0 on Ubuntu

Leave a comment

Cancel reply

Your email address will not be published. Required fields are marked *

Comment *

Name *

Email *

Website

Save my name, email, and website in this browser for the next time I comment.

Δ

Search for:

Recent Posts

New progress of generative modeling – flow matching
Learn GPU Optimization
Journey to Agents
LLM Long Context
Information Bottleneck + RL Exploration

Recent Comments

màn hình led on Focal loss for classification and regression
Ben You on Notes on “Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor”
chickenTensor on EmbeddingBag from PyTorch

Archives

Categories

Algorithm
finance
interview
latex
leetcode
leetsql
Network Technology
Python
R
RL
Uncategorized

Meta

Log in
Entries feed
Comments feed
WordPress.org

czxttkl, Proudly powered by WordPress.