Reasoning Models on Latent

Reasoning Models on Latent https://latent-site.pages.dev/tags/reasoning-models/ Recent content in Reasoning Models on Latent Hugo zh 2026 Latent Sun, 05 Jul 2026 00:00:00 +0000 03 | 推理模型——o1、R1 和 test-time compute 到底在做什么 https://latent-site.pages.dev/posts/2026/07/03-%E6%8E%A8%E7%90%86%E6%A8%A1%E5%9E%8Bo1r1-%E5%92%8C-test-time-compute-%E5%88%B0%E5%BA%95%E5%9C%A8%E5%81%9A%E4%BB%80%E4%B9%88/ Sun, 05 Jul 2026 00:00:00 +0000 https://latent-site.pages.dev/posts/2026/07/03-%E6%8E%A8%E7%90%86%E6%A8%A1%E5%9E%8Bo1r1-%E5%92%8C-test-time-compute-%E5%88%B0%E5%BA%95%E5%9C%A8%E5%81%9A%E4%BB%80%E4%B9%88/ 解释推理模型（o1/o3、DeepSeek-R1）的工作原理：test-time compute、内部思维链、强化学习训练，以及它和传统 CoT Prompt 的本质区别。