Skip to content

czxttkl

Author Archives: czxttkl

Information Bottleneck + RL Exploration

Posted byczxttklApril 1, 2025April 12, 2025Posted inAlgorithmLeave a comment on Information Bottleneck + RL Exploration

View LLMs as compressors + Scaling laws

Posted byczxttklJune 24, 2024July 14, 2024Posted inAlgorithmLeave a comment on View LLMs as compressors + Scaling laws

TQQQ/UPRO + volatility

Posted byczxttklMay 14, 2024July 13, 2024Posted infinanceLeave a comment on TQQQ/UPRO + volatility

More details in DPO

Posted byczxttklMay 13, 2024June 5, 2024Posted inAlgorithmLeave a comment on More details in DPO

Minimal examples of HuggingFace LLM training

Posted byczxttklFebruary 15, 2024February 15, 2024Posted inNetwork Technology, PythonLeave a comment on Minimal examples of HuggingFace LLM training

Causal Inference 102

Posted byczxttklFebruary 6, 2024March 5, 2024Posted inAlgorithmLeave a comment on Causal Inference 102

Reinfocement Learning in LLMs

Posted byczxttklJanuary 23, 2024February 5, 2024Posted inAlgorithm, RLLeave a comment on Reinfocement Learning in LLMs

Llama code anatomy

Posted byczxttklJanuary 17, 2024January 31, 2024Posted inAlgorithmLeave a comment on Llama code anatomy

Improve reasoning for LLMs

Posted byczxttklJanuary 15, 2024May 13, 2024Posted inAlgorithmLeave a comment on Improve reasoning for LLMs

Dollar cost average on TQQQ vs QQQ [Real Data]

Posted byczxttklJanuary 13, 2024January 15, 2024Posted infinance, PythonLeave a comment on Dollar cost average on TQQQ vs QQQ [Real Data]

Posts pagination

1 2 3 … 41 Older posts

Recent Posts

  • Information Bottleneck + RL Exploration
  • View LLMs as compressors + Scaling laws
  • TQQQ/UPRO + volatility
  • More details in DPO
  • Minimal examples of HuggingFace LLM training

Recent Comments

  • màn hình led on Focal loss for classification and regression
  • Ben You on Notes on “Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor”
  • chickenTensor on EmbeddingBag from PyTorch

Archives

  • April 2025
  • June 2024
  • May 2024
  • February 2024
  • January 2024
  • November 2023
  • February 2023
  • January 2023
  • November 2022
  • October 2022
  • September 2022
  • August 2022
  • June 2022
  • March 2022
  • February 2022
  • December 2021
  • November 2021
  • August 2021
  • July 2021
  • May 2021
  • April 2021
  • March 2021
  • February 2021
  • December 2020
  • October 2020
  • September 2020
  • August 2020
  • July 2020
  • June 2020
  • May 2020
  • April 2020
  • March 2020
  • February 2020
  • January 2020
  • October 2019
  • September 2019
  • July 2019
  • May 2019
  • April 2019
  • March 2019
  • February 2019
  • January 2019
  • October 2018
  • August 2018
  • May 2018
  • April 2018
  • February 2018
  • January 2018
  • December 2017
  • October 2017
  • September 2017
  • August 2017
  • July 2017
  • June 2017
  • May 2017
  • April 2017
  • March 2017
  • February 2017
  • January 2017
  • November 2016
  • October 2016
  • September 2016
  • August 2016
  • July 2016
  • June 2016
  • May 2016
  • April 2016
  • February 2016
  • January 2016
  • December 2015
  • November 2015
  • October 2015
  • September 2015
  • August 2015
  • July 2015
  • June 2015
  • May 2015
  • April 2015
  • March 2015

Categories

  • Algorithm
  • finance
  • interview
  • latex
  • leetcode
  • leetsql
  • Network Technology
  • Python
  • R
  • RL
  • Uncategorized

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org
czxttkl, Proudly powered by WordPress.