Skip to content

Monthly Archives: October 2019

TRPO, PPO, Graph NN + RL

Posted byczxttklOctober 30, 2019November 15, 2019Posted inAlgorithmLeave a comment on TRPO, PPO, Graph NN + RL

Notes on “Recommending What Video to Watch Next: A Multitask Ranking System”

Posted byczxttklOctober 10, 2019November 11, 2019Posted inAlgorithmLeave a comment on Notes on “Recommending What Video to Watch Next: A Multitask Ranking System”

Search for:

Recent Posts

New progress of generative modeling – flow matching
Learn GPU Optimization
Journey to Agents
LLM Long Context
Information Bottleneck + RL Exploration

Recent Comments

màn hình led on Focal loss for classification and regression
Ben You on Notes on “Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor”
chickenTensor on EmbeddingBag from PyTorch

Archives

Categories

Algorithm
finance
interview
latex
leetcode
leetsql
Network Technology
Python
R
RL
Uncategorized

Meta

Log in
Entries feed
Comments feed
WordPress.org

czxttkl, Proudly powered by WordPress.