Workflow
Artificial Intelligence
icon
Search documents
智谱开源OCR!测完我把手机里的扫描软件都卸了......
量子位· 2026-02-11 12:49
Core Insights - The article discusses the capabilities and performance of the GLM-OCR model, highlighting its competitive edge in the OCR technology landscape, particularly in complex scenarios like handwriting and table recognition [1][39]. Performance Comparison - GLM-OCR outperforms several competitors in various OCR tasks, achieving a document parsing accuracy of 94.6% on OmniDocBench V1.5, surpassing PaddleOCR and others [2]. - In text recognition, GLM-OCR achieves 94.0% accuracy, significantly higher than some competitors like Deepseek-OCR2, which only reaches 34.7% [2]. - For formula recognition, GLM-OCR scores 96.5%, indicating strong performance in recognizing mathematical expressions [2]. - The model also excels in table recognition, with an accuracy of 85.2% on PubTabNet, outperforming many alternatives [2]. Practical Applications - GLM-OCR is particularly effective for structured documents such as Word, PPT, and academic papers, as well as for recognizing clear handwriting, receipts, and scanned contracts [3][4]. - The model demonstrates strong capabilities in recognizing handwritten forms, achieving an accuracy of 86.1% [4]. - It can accurately extract information from various documents, including meeting minutes and whiteboard notes, making it suitable for everyday work scenarios [3][4]. User Experience - Users report a generally positive experience with GLM-OCR in standard document parsing tasks, although challenges remain with unclear handwriting and complex layouts [4][12]. - The model's ability to handle low-quality inputs is commendable, with a recognition accuracy of around 96% for mixed content, although some errors were noted in specific cases [13][29]. Structural Extraction - GLM-OCR is capable of structured information extraction, producing outputs in standard JSON format from various documents, which is beneficial for applications like invoicing and identification [36][38]. - The model's performance in structured extraction improves significantly when clear prompts are provided, indicating its adaptability to user requirements [38]. Industry Trends - The OCR technology market is rapidly evolving, with new models like GLM-OCR emerging to meet increasing demands for efficiency and accuracy [39][40]. - The trend towards smaller model parameters (0.07B to 0.9B) is making deployment easier and more cost-effective for users [51]. - Enhanced output quality and reduced processing times are becoming standard expectations in the OCR industry, benefiting users across various sectors [51].
9B端侧开源模型跑通百万上下文,面壁全新稀疏-线性混合注意力架构SALA立功了!
量子位· 2026-02-11 12:49
henry 发自 凹非寺 量子位 | 公众号 QbitAI 最强的大模型,已经把scaling卷到了一个新维度: 百万级上下文 。 几天前,Claude Opus 4.6发布,让人第一次真切感受到了百万上下文的涌现能力—— 单次吃进50万字中文内容、实现跨文档法律分析、多轮Agent规划…… 此情此景,用户火速用脚投票,华尔街更是直接给出K线回应。 与此同时,基于SALA注意力架构的模型 MiniCPM-SALA 也将一并开源。 除此之外,面壁还以OpenBMB社区名义,联合SGLang与NVIDIA发起 2026稀疏算子加速大奖赛(SOAR) ,将这套scaling能力直接交到 开发者手中,推动端侧Agent部署的性能突破。 Linear-Sparse混合注意力架构 太长不看,咱直接说重点—— 面壁这次全新的 线性与稀疏注意力混合架构SALA(Sparse Attention-Linear Attention,SALA) ,究竟是怎么个混合法呢? 简单来说,这套架构将 75%线性注意力(Lightning Attention) 与 25%稀疏注意力(InfLLM v2) 结合,并通过 混合位置编码HyPE ...
Europe's OpenAI Rival Mistral Bets $1.4 Billion On Swedish AI Infrastructure Buildout - ASML Holding (NASDAQ:ASML), Alphabet (NASDAQ:GOOG)
Benzinga· 2026-02-11 12:49
Core Insights - Mistral AI has announced a €1.2 billion ($1.43 billion) investment in collaboration with EcoDataCenter to enhance Sweden's digital infrastructure [1] - This investment represents Mistral AI's first AI infrastructure project outside of France, aimed at establishing an AI-centric data center in Borlänge, Sweden [2][3] - The facility is expected to commence operations in 2027 and will facilitate the development and deployment of Mistral's next-generation AI models [3] Company and Industry Developments - The partnership will focus on creating AI-specialized data centers, advanced computing capacity, and localized AI capabilities [2] - Mistral's CEO emphasized that this investment is a significant step towards building independent AI capabilities in Europe [3] - The initiative aligns with Europe's broader strategy to enhance its technological infrastructure to compete with U.S. tech giants amid increasing geopolitical tensions [3]
Here's how much Palantir insiders have dumped in PLTR shares in 2026
Finbold· 2026-02-11 12:36
Core Insights - Palantir Technologies insiders have sold over $9 million worth of stock in 2026, continuing a trend amid the stock's rally [1] - The stock has experienced volatility, with a recent decline of over 2% in a single day and nearly 22% year-to-date [6] Insider Transactions - Director Alexander Moore sold 20,000 shares on February 2, 2026, for $2,949,000, leaving him with 1,172,978 shares [2] - Director Lauren Elaina Stat Friedman sold 400 shares on February 2, 2026, for $60,456, reducing her holdings to 58,287 shares [3] - Officer Ryan Taylor sold 12,000 shares on January 2, 2026, for $2,176,200, retaining 239,409 shares [4] - The total insider sales for 2026 amount to $9,148,246 across five disclosed transactions [4] Financial Performance - Palantir's Q4 2025 revenue rose 70% year-over-year to $1.41 billion, surpassing expectations of $1.33 billion [8] - Adjusted earnings per share for the quarter were $0.25, exceeding the forecast of $0.23 [8] - U.S. commercial revenue increased by 137% to $507 million, while U.S. government revenue grew by 66% to $570 million [8] - Full-year 2025 revenue reached $4.48 billion [8] Future Guidance - Management projects 2026 revenue between $7.182 billion and $7.198 billion, indicating approximately 61% annual growth [9] - U.S. commercial revenue is expected to exceed $3.144 billion, reflecting at least a 115% increase [9] Market Sentiment - Despite strong growth in the AI sector, Palantir's premium valuation raises concerns about its exposure to market volatility and potential slowing momentum [9] - Michael Burry identified a head-and-shoulders pattern in Palantir's stock, suggesting a potential downside to the low $50s, indicating a possible drop of up to 60% from current levels [10]
“世界正处于危险中!”Anthropic AI安全负责人警示后官宣离职
3 6 Ke· 2026-02-11 12:36
Core Insights - The departure of Mrinank Sharma, the senior AI safety lead at Anthropic, raises concerns about the direction of AI development and the underlying values guiding the industry [1][4][17] - Mrinank's resignation reflects deeper worries about the interconnected crises facing humanity, suggesting a need for a reevaluation of ethical considerations in AI [9][10][11] Group 1: Departure Reasons - Mrinank cited a conflict between internal pressures and the core values emphasized by the company, indicating a struggle to align actions with principles [4][11] - He expressed a desire to contribute in a way that aligns with his inner values and principles, leading to his decision to leave [12][13] - The concept of "poly-crisis" and "meta-crisis" was introduced, highlighting the complex challenges humanity faces beyond just AI or biological threats [9][10] Group 2: Achievements at Anthropic - During his two years at Anthropic, Mrinank focused on the phenomenon of AI "sycophancy," exploring why models cater to user preferences even when incorrect [6] - He developed defense mechanisms against AI-assisted bioterrorism risks and implemented internal transparency measures to ensure values were integrated into the organization [7] - His final research questioned whether AI assistants could diminish human qualities, reflecting on the broader implications of AI on human judgment and values [8] Group 3: Future Aspirations - Mrinank has not disclosed his next steps but has chosen to embrace uncertainty, indicating a shift towards a more humanistic approach [14][15] - He plans to pursue a degree in poetry, emphasizing the importance of understanding meaning and relationships in a technology-driven world [15] - His future focus will include guiding, coaching, and community building, transitioning from a technical safety role to one that fosters deeper human connections [15]
Seedance爆火背后:真假难辨和个人隐私受关注,AI创作如何守住边界?
YOUNG财经 漾财经· 2026-02-11 12:32
Core Viewpoint - The article discusses the rapid rise of Seedance 2.0, an AI video generation model by ByteDance, highlighting its capabilities and the concerns surrounding authenticity and personal privacy in AI-generated content [2][3]. Group 1: Seedance 2.0 Features and Impact - Seedance 2.0 has gained significant attention for its ability to generate high-quality videos from multi-modal inputs, including text and images, marking a leap in AI's understanding of multi-modal information [2][3]. - The product is described as a "game-killer" by industry experts, indicating its potential to redefine video production and editing standards [3]. - The model's ability to produce videos with professional-level editing quality without precise instructions on cuts suggests a blurring line between AI-generated and human-created content [5]. Group 2: Concerns and Recommendations - There are rising concerns about the potential for fake videos to create trust issues, especially as AI-generated content becomes indistinguishable from real footage [4][5]. - Experts recommend that individuals educate their families about the risks of AI-generated content, emphasizing the need for cross-verification of video authenticity [4]. - The platform has implemented restrictions on using real human images or videos as references to mitigate misuse, reflecting a responsible approach to AI development [7][8]. Group 3: Industry Context and Challenges - The article notes that the AI industry is grappling with balancing technological innovation and compliance with data protection and copyright laws [7][8]. - The training of large models like Seedance 2.0 relies on vast amounts of publicly available data, raising questions about content ownership and privacy [8]. - The ongoing development of AI technologies often outpaces the establishment of industry regulations, necessitating a collaborative effort to address privacy and security concerns [8].
Alset AI's Lyken.AI Signs Formal Cloud Compute Services Contract with Leading Multinational Technology and Telecommunications Company, Commences Service Delivery
Accessnewswire· 2026-02-11 12:30
Core Insights - Alset AI Ventures Inc. is focused on advancing innovation through strategic investments and cloud computing solutions [1] - Cedarcross Technologies Inc., operating under the brand Lyken.AI, has secured a purchase order for cloud compute services with a leading multinational technology and telecommunications company [1] - The formal contract for the purchase order was executed on February 6, 2026, following the initial announcement on February 3, 2026 [1] Financial Impact - The expected annualized recurring revenue from this contract is projected to be $1 million [1]
国产AI又刷屏,世界迎来Seedance时刻
21世纪经济报道· 2026-02-11 12:23
先给大家看一段视频,你们觉得拍这么一段得花多少钱多长时间? 如果我说,这根本不是实拍,而是AI在几分钟内"一键生成"的,你敢信吗? 春节前,当大家还在忙着抢红包的时候,字节跳动毫无征兆地扔出了一颗"王炸"——新一代视 频生成模型Seedance 2.0。 AI生成视频不是什么新鲜事,但过去可能得花不菲的价钱,反复生成几十次,才能赌到一个勉 强能用的片段。但Seedance 2.0不一样,它号称"一条过", 一次生成的视频可用率据说能达到 惊人的90%。 光能看还不够,它还能把你的想法,直接变成"导演级"的叙事,知道什么时候该特写,什么时 候该拉远,还能让不同镜头里的主角保持一致。影视飓风的Tim在体验后都抑制不住兴奋地表 示,模型在摄像机运动、分镜连续性以及音画匹配上都非常出色。 这意味着什么?这意味着,导演、摄影师、剪辑师、配乐师……这个传统影视工业链条,正在 被一个模型"打包压缩"。有专业摄影人士分析,如果 人工制作一段十几秒打斗视频,涉及CG 制作、动作捕捉、3D建模、后期合成等,需要跨部门协作数周甚至一两个月。而现在,这个 过程被压缩到了几分钟。 另一方面,我们也不能忽视背后的风险。Tim发布的实测视 ...
一人公司开始在北京流行
21世纪经济报道· 2026-02-11 12:23
然而近两年,人工智能带来的商业想象空间还在持续扩大。"一人公司"(OPC,One Person Company),开始在北京流行。 从2025年底至今,北京、上海、广东等地,通过发布相关方案,或打造专门服务OPC企业的创 业园区,吸引AI领域人才。寸土寸金的北京,也是AI人才聚集度最高的城市,为争夺AI时代 的"超级个体"聚集,北京海淀区和亦庄(北京经济技术开发区)都拿出了充分的诚意,以招揽 超级个体创新企业在各自的OPC社区聚集。 记者丨周慧 编辑丨张伟贤 视频|杨浩凯 "未来将出现一人创立的十亿美元公司。"当OpenAI CEO萨姆·奥尔特曼在2024年初抛出这个预 言的时候,听起来还像天方夜谭。 北京的另一个科创企业聚集地的亦庄,也发布了对OPC业态的召集令。1月31日,北京人工智 能产业创新发布会暨模数OPC社区生态伙伴大会在亦庄举办。会上,北京经开区发布《关于进 一步加快建设全域人工智能之城的实施方案(2026—2027年)》,宣布启动建设模数OPC社 区及其5A支持体系,与20家OPC企业现场签约。 亦庄的模数OPC社区实体空间位于经开区通明湖信息城,首期建筑面积达3000平方米,远期规 划10, ...
知鱼智联CTO董云鹏:发布“里链”AI智能体集群,以AI原生技术重构空间数智化未来形态
Zhong Guo Fa Zhan Wang· 2026-02-11 11:45
Core Insights - The forum highlighted the importance of AI-native technologies in reshaping spatial intelligence systems, with the launch of the "LiChain" AI agent cluster product by Zhiyu Zhili [1][3] - The evolution of spatial intelligence is now centered around AI-native approaches, emphasizing the integration of industry knowledge, AI tools, and human-machine collaboration [3][10] Company Developments - Zhiyu Zhili has developed the "LiChain" AI agent cluster, which features three core capabilities: intelligent computing power scheduling, industry model refinement, and collaborative agent networks [6][10] - The company has achieved over 70% code auto-generation in its AI-native reconstruction, transforming its development team to focus on architecture design and effect validation [10] Industry Trends - The current phase of spatial intelligence is moving beyond simple technology stacking to creating "spatial intelligent agents" with perception, analysis, decision-making, and interaction capabilities [10] - The "Double Hundred Star Chain" ecological co-construction initiative aims to unify standards in spatial intelligence, involving multiple industry partners to foster collaborative development [10][12] Technological Innovations - Zhiyu Zhili's AI platform addresses key technical pain points by ensuring data credibility and usability, which are essential for effective AI model training [12] - The company employs domestic technology to safeguard data sovereignty and security, enhancing the reliability of its AI applications [12] Future Outlook - The future of spatial intelligence is seen as a collaborative coexistence between humans and machines, allowing workers to focus on creative and complex decision-making tasks [12][13] - As digital technologies mature, spatial intelligence is expected to play a crucial role in driving industrial transformation and upgrading [13]