DeepSeek FlashMLA
Search documents
DeepSeek新模型“Model 1”曝光,疑似“高效推理模型”
Xin Lang Cai Jing· 2026-01-21 06:58
Core Insights - DeepSeek has updated its official GitHub repository with a series of FlashMLA code, drawing attention to a model named "Model 1" [1][2] - Model 1 is speculated to be the new model code that DeepSeek is expected to release around the Chinese New Year [2] Model Specifications - Model 1 is one of the two main model architectures supported in DeepSeek FlashMLA, alongside DeepSeek-V3.2 [2] - It is likely to be an efficient inference model with lower memory usage compared to V3.2, making it suitable for edge devices or cost-sensitive scenarios [2] - Model 1 may also function as a long-sequence expert optimized for sequences longer than 16K, making it ideal for tasks such as document understanding and code analysis [2]