00:00 - Intro 引言 02:15 - 1: Generation (Perplexity) 生成 15:40 - 2: Memory (Attention) 内存 28:00 - 3: Efficiency (GEMM) 效率 38:40 - 4: Scaling (Chinchilla) 缩放 46:37 - 5: Reasoning (RASP) 推理 55:33 - Conclusion 结论