🚀Day4ofOpenSourceWeek:OptimizedP

春蕴评趣事 2025-02-28 09:35:29

🚀 Day 4 of OpenSourceWeek: Optimized Parallelism Strategies

✅ DualPipe - a bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.

🔗 github.com/deepseek-ai/Du…

✅ EPLB - an expert-parallel load balancer for V3/R1.

🔗 github.com/deepseek-ai/ep…

📊 Analyze computation-communication overlap in V3/R1.

🔗 github.com/deepseek-ai/pr…

0 阅读:0
春蕴评趣事

春蕴评趣事

感谢大家的关注