📖 Qiyao Wang's Blog
Total Blogs
2025
- Aug. 01, 2025 CS336 Assignment 1: Detailed Implementation #Class #Basics
- Feb. 27, 2025 Three Sampling Methods: Temperature, Top K and Top P #Decoding
- Feb. 26, 2025 Greedy Search and Beam Search #Decoding
- Feb. 25, 2025 Autoregressive Decoding: Basic Manner of Decoder-Only LLMs #Decoding
- Feb. 24, 2025 Proximal Policy Optimization (PPO) and RLHF #RL
- Feb. 23, 2025 Basic Knowledge of Reinforcement Learning before PPO #RL
- Jan. 08, 2025 CMU DLSys Course Homework 1: Implementation and Reflection (Part1) #MLSys
- Jan. 07, 2025 Chain-of-Thought Reasoning without Prompting #Reasoning
- Jan. 01, 2025 CMU DLSys Course Homework 0: Implementation and Reflection #MLSys
2024
- Dec. 28, 2024 Pattern Recognition and Machine Learning #ML