DeepSeek open-source optimization parallel strategy, releases DualPipe and EPLB

PANews
PANews|Feb 27, 2025 02:39
According to the DeepSeek (@ deepseek_ai) announcement, on the the fourth day of the Open Source Week, the team opened a number of optimization and parallel strategies, including DualPipe (two-way pipeline parallel algorithm, optimizing the calculation communication overlap in V3/R1 training), EPLB (expert parallel load balancer, improving the efficiency of computing resource allocation) and calculation communication overlap analysis tools to help optimize the training performance.
+5
Mentioned
Share To

Timeline

HotFlash

APP

X

Telegram

Facebook

Reddit

CopyLink

Hot Reads