非结构化数据解析
Search documents
从“模型为王”到“数据为基”:WPS 365如何帮企业挖掘数据金矿?
Xin Lang Cai Jing· 2026-02-03 11:33
来源:连线Insight 编辑/子夜 "即便是性能卓越的'神模',12个月后用户留存率也可能降至较低水平。" 2026年1月27日,WPS 365 AI协同办公峰会在上海举办。会上,中金公司研究部执行总经理、计算机行业首席分析师于钟海如此说道。 这番话放在三年前,可能没人会信。 但在2026年初,这个问题的答案变得越来越不重要。模型正在变成基础设施,真正的竞争焦点已悄然转移。 当大模型的上下文有限,模型能力趋同,那么toB AI的竞争实质,其实是效率竞争,是"谁能为AI提供更丰富、更准确,可被理解的上下文"。 这种情况下,企业数据的重要性会大幅拔高。 连线Insight在峰会现场观察到,延锋国际、东方航空、上海信投等华东龙头企业分享的落地案例指向同一个结论:AI项目从Demo到上线,最大障碍不是 算力或模型,而是如何让散落各处、格式混乱的企业文档真正被AI理解。 企业级AI应用的竞争重心,正从"模型能力"转向"数据治理"。 而在这场数据竞争中,一个容易被忽视的技术环节正在成为关键:非结构化数据的解析,尤其是复杂文档的解析与知识化能力,直接决定了企业数据资产 的质量上限。 WPS 365 统合进行知识治理,图源 ...
两个“卖铲”的程序员,不营销却在不到2年撬动7个亿
虎嗅APP· 2025-11-30 03:09
Core Insights - The article discusses the emergence of Reducto AI, a startup that addresses the critical issue of document parsing in the AI landscape, particularly focusing on the challenges posed by unstructured data [6][8][36]. - Reducto AI has rapidly gained attention and funding, raising $108 million in just 18 months, highlighting the urgent need for high-accuracy document processing solutions in various industries [9][13][22]. Company Overview - Reducto AI was founded in January 2023 by Adit Abraham and Raunak Chowdhuri, both MIT graduates with strong backgrounds in technology and product management [26][27]. - The company focuses on providing an API-first document AI platform that includes various APIs for parsing, extracting, splitting, and editing documents, aiming to convert unstructured data into structured formats [15][17][21]. Market Opportunity - The global market for document processing is estimated to exceed $100 billion, growing at over 40% annually, driven by the fact that over 80% of enterprise data is unstructured [9][36]. - Reducto AI positions itself as a "seller of shovels" in the AI gold rush, focusing on high-accuracy solutions for complex documents, which traditional OCR tools struggle to handle [38][39]. Product and Technology - Reducto's core technology, the Agentic OCR framework, allows for high accuracy in document parsing by employing a multi-step process that mimics human reading [17][21]. - The platform has achieved over 99% accuracy in document processing, significantly outperforming competitors like AWS and Google in specific use cases [39][41]. Growth Metrics - Reducto AI's Annual Recurring Revenue (ARR) grew from zero to over $1 million within six months of launching its core product, and it surpassed $5 million by October 2025 [22][23]. - The company processed hundreds of millions of pages of documents within a short timeframe, demonstrating its rapid scalability and market demand [23]. Competitive Landscape - The document processing market is competitive, with major players like Amazon and Google offering bundled services, but Reducto aims to carve out a niche by focusing on accuracy and specialized solutions [36][38]. - Challenges include potential competition from larger tech companies and emerging startups that may offer similar or superior capabilities at lower costs [40][41].