大语言模型中的记忆遗忘机制 - filings, earnings calls, financial reports, news - Reportify

大语言模型中的记忆遗忘机制

Search documents

突破新领域，深度求索发布文字识别模型DeepSeek-OCR

Bei Ke Cai Jing· 2025-10-20 12:37

Core Insights - DeepSeek has released a new model called DeepSeek-OCR on the open-source community platform Hugging Face, which is designed for Optical Character Recognition (OCR) to extract text from images [1][3] Group 1: Model Description - DeepSeek-OCR is described in a related paper as a preliminary study on the feasibility of compressing long contexts through optical two-dimensional mapping [3] - The model achieves a decoding (OCR) accuracy of 97% when the number of text tokens is within 10 times the number of visual tokens (compression ratio < 10) [3] - Even at a compression ratio of 20, the OCR accuracy remains around 60%, indicating significant potential for research areas such as long context compression and memory forgetting mechanisms in large language models [3]

Seek .(US:SKLTY)

长上下文压缩

大语言模型中的记忆遗忘机制

Artificial Intelligence

长上下文压缩

大语言模型中的记忆遗忘机制

Artificial Intelligence