DeepSeek is the first open-source library for efficient expert parallel communication, specifically designed for hybrid expert model training and inference.

Market IntelMonday, Feb 24, 2025 10:05 pm ET

1min read

BAND--

SM--

On February 25, DeepSeek opened the source code of DeepEP on the second day of the "Open Source Week". DeepEP is the first EP (Expert Parallelism, expert parallelism) communication library for MoE (Mixed Expert) model training and inference, which can achieve efficient and optimized all-to-all communication, support low-precision computing including FP8, and meet the modern high-performance computing needs. Moreover, DeepEP has been deeply optimized for the asymmetric bandwidth forwarding scenario from NVLink to RDMA, providing high throughput and supporting SM (Streaming Multiprocessors) number control, balancing the high throughput performance of training and inference tasks. For latency-sensitive decoding scenarios, DeepEP offers a low-latency kernel with pure RDMA, supporting adaptive routing, which can achieve more flexible GPU resource control to meet different scenario needs.

Comments

﻿

Add a public comment...

No comments yet

Disclaimer: The news articles available on this platform are generated in whole or in part by artificial intelligence and may not have been reviewed or fact checked by human editors. While we make reasonable efforts to ensure the quality and accuracy of the content, we make no representations or warranties, express or implied, as to the truthfulness, reliability, completeness, or timeliness of any information provided. It is your sole responsibility to independently verify any facts, statements, or claims prior to acting upon them. Ainvest Fintech Inc expressly disclaims all liability for any loss, damage, or harm arising from the use of or reliance on AI-generated content, including but not limited to direct, indirect, incidental, or consequential damages.