Skip to content

czxttkl

Author Archives: czxttkl

How does Metropolis-Hastings algorithm work?

Posted byczxttklAugust 29, 2022August 29, 2022Posted inAlgorithmLeave a comment on How does Metropolis-Hastings algorithm work?

Terms you need to know when working for Ads

Posted byczxttklAugust 8, 2022July 31, 2023Posted inNetwork TechnologyLeave a comment on Terms you need to know when working for Ads

Tools needed to facilitate long-term value optimization

Posted byczxttklJune 2, 2022September 2, 2022Posted inAlgorithmLeave a comment on Tools needed to facilitate long-term value optimization

Some SOTA Model-based RL

Posted byczxttklMarch 16, 2022March 20, 2022Posted inAlgorithm, RLLeave a comment on Some SOTA Model-based RL

Laplacian Approximation and Bayesian Logistic Regression

Posted byczxttklFebruary 15, 2022February 16, 2022Posted inAlgorithmLeave a comment on Laplacian Approximation and Bayesian Logistic Regression

Markov Chain and Markov Decision Process on Graphs

Posted byczxttklDecember 6, 2021September 1, 2022Posted inAlgorithmLeave a comment on Markov Chain and Markov Decision Process on Graphs

Check object memory in Python

Posted byczxttklDecember 4, 2021December 4, 2021Posted inPythonLeave a comment on Check object memory in Python

Normalizing Flows

Posted byczxttklNovember 15, 2021November 15, 2021Posted inAlgorithmLeave a comment on Normalizing Flows

Leetcode 695. Max Area of Island

Posted byczxttklAugust 30, 2021August 31, 2021Posted ininterview, leetcodeLeave a comment on Leetcode 695. Max Area of Island

Data Parallelism and Model Parallelism

Posted byczxttklAugust 9, 2021January 4, 2022Posted inNetwork TechnologyLeave a comment on Data Parallelism and Model Parallelism

Posts pagination

Newer posts 1 2 3 4 5 … 41 Older posts

Recent Posts

  • Information Bottleneck + RL Exploration
  • View LLMs as compressors + Scaling laws
  • TQQQ/UPRO + volatility
  • More details in DPO
  • Minimal examples of HuggingFace LLM training

Recent Comments

  • màn hình led on Focal loss for classification and regression
  • Ben You on Notes on “Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor”
  • chickenTensor on EmbeddingBag from PyTorch

Archives

  • April 2025
  • June 2024
  • May 2024
  • February 2024
  • January 2024
  • November 2023
  • February 2023
  • January 2023
  • November 2022
  • October 2022
  • September 2022
  • August 2022
  • June 2022
  • March 2022
  • February 2022
  • December 2021
  • November 2021
  • August 2021
  • July 2021
  • May 2021
  • April 2021
  • March 2021
  • February 2021
  • December 2020
  • October 2020
  • September 2020
  • August 2020
  • July 2020
  • June 2020
  • May 2020
  • April 2020
  • March 2020
  • February 2020
  • January 2020
  • October 2019
  • September 2019
  • July 2019
  • May 2019
  • April 2019
  • March 2019
  • February 2019
  • January 2019
  • October 2018
  • August 2018
  • May 2018
  • April 2018
  • February 2018
  • January 2018
  • December 2017
  • October 2017
  • September 2017
  • August 2017
  • July 2017
  • June 2017
  • May 2017
  • April 2017
  • March 2017
  • February 2017
  • January 2017
  • November 2016
  • October 2016
  • September 2016
  • August 2016
  • July 2016
  • June 2016
  • May 2016
  • April 2016
  • February 2016
  • January 2016
  • December 2015
  • November 2015
  • October 2015
  • September 2015
  • August 2015
  • July 2015
  • June 2015
  • May 2015
  • April 2015
  • March 2015

Categories

  • Algorithm
  • finance
  • interview
  • latex
  • leetcode
  • leetsql
  • Network Technology
  • Python
  • R
  • RL
  • Uncategorized

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org
czxttkl, Proudly powered by WordPress.