10) Lecture 9 - Temporal Difference Prediction Reinforcement Learning Phase ReasoningLLMsfromScratch3просмотра13 дней назад
9) Lecture 8 - Monte Carlo Methods Reinforcement Learning Phase Reasoning LLMs from Scratch5просмотров13 дней назад
8) Lecture 7 - Dynamic Programming Reinforcement Learning Phase Reasoning LLMs from Scratch5просмотров13 дней назад
7) Lecture 6 - Value Functions Reinforcement Learning Reasoning LLMs from Scratch3просмотра13 дней назад
28) How DeepSeek Rewrote Quantization Part 2 Accumulation Precision Online Quantization5просмотров14 дней назад
27) How DeepSeek Rewrote Quantization Part 1 Mixed Precision Fine-grained quantization3просмотра14 дней назад