
Perspective AI|Feb 18, 2025 08:47
DeepSeek has introduced NSA, which stands for Native Sparse Attention. It's a new way to make AI systems work faster, especially when dealing with lots of information at once.
NSA uses a few smart tricks to speed things up: it has a special way of picking out the most important bits of information (dynamic hierarchical sparse strategy), squishes some parts of the data to save space (coarse-grained token compression), and then zooms in on the really important details (fine-grained token selection).
This all helps make the AI run quickly on today's computers without losing any of its smarts. It's just as good, or even better, than older methods when it comes to handling big tasks, understanding long pieces of text, and following instructions.
Interestingly, #DeepSeek chose to announce NSA right around the time when Grok 3 by xAI, another big AI model, was launched.
This timing might be a clever move to get more eyes on NSA, since everyone would be talking about AI because of #Grok3's launch.
It's like releasing a new song when there's already a big music event happening, so more people hear about it.
Note: This is just about the timing of the announcements and doesn't take sides on which is better.
Share To
Timeline
HotFlash
APP
X
Telegram
CopyLink