2025-02-13 长链式思维(CoT)示例微调的检查点2025-02-12 17:18
浏览全文阅读(0)好评(0)
2025-02-13 监督微调(SFT)数据2025-02-12 17:10
浏览全文阅读(0)好评(0)
2025-02-08 大型语言模型的多阶段训练流程2025-02-08 23:20
浏览全文阅读(0)好评(0)
2025-02-07 DeepSeek 的冷启动数据2025-02-07 17:34
浏览全文阅读(0)好评(0)
2025-02-07 监督学习2025-02-06 17:06
浏览全文阅读(0)好评(0)
2025-02-07 纯强化学习(RL)2025-02-06 17:04
浏览全文阅读(0)好评(0)
2025-02-07 DeepSeek技术论文拆解2025-02-07 14:40
浏览全文阅读(0)好评(0)
2024-09-02 蛤蟆先生去看心理医生2024-09-01 21:36
浏览全文阅读(0)好评(0)
2023-07-16 《八次危机》读书笔记(九)2023-07-15 23:00
浏览全文阅读(0)好评(0)
2023-07-15《八次危机》读书笔记(八)2023-07-14 22:49
浏览全文阅读(0)好评(0)