🚀 Day 4 of OpenSourceWeek: Optimized Parallelism Strategies
✅ DualPipe - a bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
🔗 github.com/deepseek-ai/Du…
✅ EPLB - an expert-parallel load balancer for V3/R1.
🔗 github.com/deepseek-ai/ep…
📊 Analyze computation-communication overlap in V3/R1.
🔗 github.com/deepseek-ai/pr…