AI秒破18世纪“天书”账本，谷歌新模型盲测刷屏全网

Core Insights - Google has potentially solved two longstanding challenges in AI with a mysterious model that successfully recognized and corrected a 200-year-old merchant's handwritten ledger, showcasing advanced reasoning capabilities that astonished historians [1][3][15]. Group 1: Model Performance - The mysterious model achieved near-perfect performance in handwritten text recognition (HTR) and corrected a formatting error in the original ledger, indicating its ability to understand the logic and context behind the text [3][15]. - The model's performance in HTR reached human expert-level accuracy, with a strict Character Error Rate (CER) of 1.7% and a Word Error Rate (WER) of 6.5% on a challenging test set [13][15]. - Compared to previous models, the new Gemini model demonstrated significant improvements, with Gemini-2.5-Pro showing a 50-70% enhancement over earlier versions [11][15]. Group 2: Historical Context and Challenges - Recognizing historical handwriting requires not only visual recognition but also an understanding of the historical context, making it a complex task for AI models [5][8]. - The model's ability to accurately transcribe difficult historical documents, including those with ambiguous numbers and inconsistent styles, marks a significant advancement in AI capabilities [19][23]. - The model successfully interpreted complex historical currency and measurement systems, showcasing its potential for abstract reasoning and contextual understanding [23][24]. Group 3: Expert Validation - Historian Mark Humphries utilized the model to test its capabilities, emphasizing that the final accuracy in historical text recognition is crucial for practical use [8][9]. - The model's performance in transcribing a ledger with non-standard formats and mixed languages was particularly impressive, as it corrected errors and inferred missing context [20][23]. - Humphries noted that the model's ability to perform multi-step reasoning and contextual inference suggests a shift towards genuine understanding in AI, beyond mere pattern recognition [24].