DeepSeek open-source optimization parallel strategy, releases DualPipe and EPLB

PANews|Feb 27, 2025 02:39
According to the DeepSeek (@ deepseek_ai) announcement, on the the fourth day of the Open Source Week, the team opened a number of optimization and parallel strategies, including DualPipe (two-way pipeline parallel algorithm, optimizing the calculation communication overlap in V3/R1 training), EPLB (expert parallel load balancer, improving the efficiency of computing resource allocation) and calculation communication overlap analysis tools to help optimize the training performance.
Share To
Timeline
HotFlash
APP
X
Telegram
CopyLink