Group 1 - DeepSeek has launched the new DeepSeek-OCR2 model, which utilizes the innovative DeepEncoderV2 method to dynamically rearrange image components based on their meaning, rather than scanning mechanically from left to right [1] - The new model achieved a performance of 91.09%, an improvement of 3.73% over its predecessor, while reducing the maximum visual token usage from 1156 to 1120 [1] - The release of DeepSeek-OCR2 is significant as it may disrupt traditional document processing methods and pave the way for native multimodal reasoning [1] Group 2 - Haitong International states that DeepSeek-OCR represents a new generation of "compressed storage" by mapping text to visual representations and compressing it at high rates, achieving about 97% text restoration accuracy at less than 10x compression [2] - At a 20x compression rate, the model maintains approximately 60% accuracy, suitable for scenarios with higher tolerance for errors [2] - Huachuang Securities highlights DeepSeek-OCR's capability to process 33 million pages of data daily on 20 A100 nodes and its strong support for minor languages, indicating a significant advantage for global business deployment [2] Group 3 - Jin Modern has collaborated with Baidu on the development of large model applications and complementary OCR recognition capabilities [3] - Hanwang Technology has provided clients with various platforms, including a low-code development platform and an OCR platform [3]
或颠覆文档处理模式,DeepSeek OCR模型再更新