Workflow
Z Potentials
icon
Search documents
Z Tech|ICLR 2026字节发布:从短句到篇章,DiscoX为长文翻译提供评测新范式
Z Potentials· 2026-02-12 02:27
Core Insights - DiscoX has developed a long-form translation evaluation dataset consisting of 200 texts, with an average length of 1,712 tokens, focusing on translation accuracy, logical and stylistic consistency across paragraphs, terminology precision, and adherence to professional writing standards [4][9][12]. Group 1: Evaluation Framework - Metric-S is introduced as a novel evaluation framework for long-form translation that does not require reference answers, allowing for interpretable results through a multi-agent evaluation system [4][5][16]. - The evaluation process includes three stages: instruction adherence check, comprehensive quality assessment across accuracy, fluency, and appropriateness, and a deduplication and attribution mechanism to ensure fair scoring [17][18][19]. Group 2: Advantages of DiscoX and Metric-S - DiscoX enables precise assessment of long-form translations, revealing the shortcomings of models in handling such tasks, and provides detailed multi-dimensional scoring [7][8]. - The framework reduces the need for expensive manual annotation by utilizing a no-reference evaluation approach, addressing the lack of standard reference translations in business documents and academic papers [8][12]. Group 3: Model Performance - The evaluation of 20 representative models on DiscoX shows that the leading model, GPT-5-high, scored 76.66, which is still below the human expert level of 80.16, indicating that high-quality long-form translation remains a significant challenge for current LLMs [23][24][25]. - The performance of models varies across dimensions, with GPT-5 excelling in accuracy, Kimi-K2 in fluency, and Claude-4 series showing high accuracy but lower fluency [29].
深度|AI教母李飞飞最新访谈:AI的下一个前沿不是语言,而是空间智能
Z Potentials· 2026-02-12 02:27
Core Insights - AI is described as a civilization-level technology that should involve the entire global population, not just a select few [4][5][6] - The rapid development of AI has exceeded many expectations, creating a sense of responsibility to ensure diverse perspectives are included in its evolution [6][7] - The importance of localizing AI to understand different languages and cultures is emphasized, as it affects who is included or excluded from AI systems [9][10] Group 1: AI's Impact and Responsibility - AI is a new generation of computing that will eventually be integrated into any device or system that relies on chips [5][8] - The societal implications of AI are vast, affecting various sectors such as healthcare, agriculture, and education [7][8] - The need for diverse voices in AI development is crucial, as it shapes how the technology will influence the real world [7][9] Group 2: Future of AI and Spatial Intelligence - Spatial perceptual intelligence is identified as a key frontier for AI, moving beyond language models to immersive, three-dimensional interactions [11][12] - World Labs aims to develop next-generation models that enable AI to reason, understand, and interact in three-dimensional spaces, impacting various applications [12][13] - The evolution of AI will allow for more active and agentic capabilities, enhancing its ability to model real-world interactions [13][14] Group 3: Market Applications and Innovations - The gaming industry is highlighted as a significant market for AI innovations, with World Labs already engaging developers to enhance creativity and innovation [14][15] - The tools developed by World Labs are currently being utilized by game developers, indicating a positive reception and potential for future growth in this sector [14][15]
速递|硅谷禁忌打破!Founders Fund等领投Anthropic200亿美元融资,同时押注OpenAI
Z Potentials· 2026-02-12 02:27
Anthropic 即将完成一轮超过 200 亿美元的融资,本轮由包括彼得·蒂尔的 Founders Fund 、 D.E. Shaw & Co. 及 Dragoneer Investment Group 在内的投资机 构共同领投。这家人工智能公司的投资方阵容将因此进一步扩大,本轮融资有望成为史上规模最大的初创企业融资轮次之一。 知情人士透露,本轮其他共同领投方还包括 Iconiq 和 MGX 。由于信息未公开,该人士要求匿名。彭博社此前报道称,投资机构 Coatue Management 和新 加坡 GIC 预计也将参与本轮融资。 彭博社报道称,此次交易中 Anthropic 的估值预计将达到约 3500 亿美元,此数值未计入募资金额,最早可能于本周公布。该估值较 Anthropic 前一轮估值 增长近一倍,使其稳居全球最具价值初创企业行列 。此前约五个月,该公司刚完成 130 亿美元融资——这反映出投资者对这家 AI 开发商的狂热追捧,其 去年营收年化增速已突破 90 亿美元。 本轮融资堪称硅谷与华尔街投资机构的 "全明星阵容"。知情人士透露,其他参投方还包括 Accel 、黑石集团、贝莱德、 TPG 、 ...
速递|GitHub前CEO创办Entire,创开发工具领域种子轮融资纪录,获6000万美元融资
Z Potentials· 2026-02-12 02:27
Entire 希望帮助开发者更好地应对 AI 编程智能体生成的海量代码。 当前流行的开源项目尤其面临着代码贡献建议激增的困扰 ,这些代码质量参 差不齐——可能包含设计拙劣甚至无法运行的 AI 生成代码。 GitHub 前首席执行官托马斯·多姆克,正如其领投方 Felicis 所宣称的,为一家开发工具初创公司筹集了史上最大规模的种子轮融资。 这家名为 Entire 的初创公司以 3 亿美元的估值筹集了 6000 万美元。 Entire 提供一款开源工具,帮助开发者更好地管理由 AI 智能体编写的代码。 Entire 的技术包含三个组成部分。其一是与 Git 兼容的数据库,用于统一 AI 生成的代码。 Git 是一种分布式版本控制系统,在企业中广受欢迎,并 被 GitHub 和 GitLab 等开源站点所使用。 另一个组成部分是所谓的 "通用语义推理层",旨在让多个 AI 智能体协同工作。最后的组成部分则是一个专为智能体与人类协作而设计的 AI 原生 用户界面。 Entire 公司推出的首款产品是一款名为 Checkpoints 的开源工具,它能自动将智能体提交至软件项目的每段代码与其生成背景。 包括提示词和对 ...
速递|冲刺“世界模型”:Runway获E轮3.15亿美金弹药,英伟达、Adobe共同押注
Z Potentials· 2026-02-11 04:08
图片来源: Runway 知情人士 透露, AI 视频生成初创公司 Runway 已完成 3.15 亿美元 E 轮融资,公司估值飙升至 53 亿美元,较之前水平近乎翻倍。 公司在其宣布融资的博客中表示,新资金将使 Runway 能够 " 预训练下一代世界模型,并将其引入新产品和行业 " 。 世界模型是一种能够构建环 境内部表征的人工智能系统,从而能够对未来事件进行规划,许多顶尖学者认为这类模型对突破大语言模型的局限至关重要。 据公司发言人透露,展望未来, Runway 计划运用新资金将其约 140 人的团队在研发、工程和市场拓展等岗位进行快速扩容。 本轮融资由 General Atlantic 领投,参投方包括英伟达、富达管理与研究公司、 AllianceBernstein 、 Adobe Ventures 、未来资产、 Emphatic Capital 、 Felicis 、 Premji 以及 AMD Ventures 。 参考资料: https://techcrunch.com/2026/02/10/ai-video-startup-runway-raises-315m-at-5-3b-valuatio ...
深度|Loopit 预示的交互生成未来,比Sora更革命的一步
Z Potentials· 2026-02-11 04:08
Core Insights - The article discusses the evolution of AI-generated content, highlighting the transition from static content production to interactive experiences with the introduction of Loopit, which allows users to create dynamic, interactive environments rather than just viewing content [2][5][13]. Group 1: Evolution of AI Content Generation - In 2024, Sora demonstrated that AI could generate realistic worlds, but it remained limited to linear narratives [2]. - Loopit, set to launch before the 2026 Spring Festival, represents a significant advancement by enabling the creation of interactive scenes that respond to user input, moving beyond simple content generation [2][5]. - This shift allows users to become "lightweight developers," defining behavior logic with simple commands, thus changing the relationship between content and users [5][11]. Group 2: Interactive Generation and User Engagement - Loopit introduces a new content form where interaction is central, allowing users to influence and evolve the experience through their actions [14][20]. - The platform's design emphasizes immediate feedback, enhancing user engagement by providing a sense of control and agency over the created environment [15][17]. - This interactive model contrasts with traditional content consumption, where users were passive recipients, thus redefining the creator-user dynamic [14][20]. Group 3: Overcoming the "Impossible Triangle" - The article identifies a structural dilemma in interactive content creation, termed the "impossible triangle," which struggles to balance high freedom, high quality, and low barriers to entry [21]. - Loopit addresses this challenge by simplifying the creation process, allowing users to generate interactive scenes with minimal input, thus broadening creative possibilities [21][24]. - Advances in technology, such as cloud rendering and lightweight engines, enable high-quality visuals on mobile devices, further enhancing user experience [21]. Group 4: Future of Content and Experience - The future landscape suggests a shift from traditional content consumption to immersive experiences, where users actively participate in creating narratives [24]. - Loopit symbolizes a transition from static content to interactive systems, positioning users as "Prompt Engineers" who shape their experiences [24]. - This evolution indicates a potential decline in the relevance of traditional content formats, emphasizing the importance of user interaction and experience over passive consumption [24].
速递|OpenAI重大创收机遇:扩张电商业务,迁移支付数据直面税务合规深水区
Z Potentials· 2026-02-11 04:08
亚马逊和其他大型市场平台多年来一直在争论,何时应由市场平台(而非个体卖家)负责征收销售税。与此同时,各州通过法院裁决越来越多地将这一责 任转移到市场平台身上。 OpenAI 一直将在 ChatGPT 内购物吹捧为一个重大的商业机遇,因为它正试图筹集数百亿美元的新资金。与此同时,该公司仍在完善一些线上商务的基本 操作。 这包括找出处理州销售税的最佳方式 ——两位曾与 OpenAI 商务团队交流过的人士透露,负责其内部商务的人员尚未决定应如何处理通过其平台进行购物 时销售税的收取问题。 OpenAI 去年底开始在 ChatGPT 内部添加结账功能,通过应用内直接销售来自 Etsy 或 Shopify 等电商平台商家的商品。 这些企业负责处理交易流程,包 括销售税相关的大部分工作。但若要让购物功能真正形成规模, ChatGPT 可能需要引入更广泛的商品品类(包括大型品牌),这可能迫使其承担更多交易 处理工作——包括销售税的代收代缴。 这可能意味着要建立自己的税收代征代缴能力,并增设税务合规团队。如果未来 OpenAI 真的建立起大规模的购物业务,还可能面临各州税务稽查。其他线 上公司就曾因各州认定其应代收销售税,而 ...
速递|Anthropic的最新200亿美元融资,或最快于下周敲定
Z Potentials· 2026-02-10 02:07
图片来源: Anthropic 据知情人士透露, Anthropic 正在敲定一轮融资的最终细节,该轮融资预计筹集逾 200 亿美元,最早可能于下周完成。 知情人士称,这家 OpenAI 的竞争对手最初计划筹集 100 亿美元,但由于投资者兴趣远超预期, 目前正以 3500 亿美元估值推进超过原定目标两倍以上的融 资。 因相关细节未公开,知情人士要求匿名。 彭博新闻社此前报道 , Anthropic 在本轮融资中已获得 Coatue Management 、新加坡主权财富基金 GIC 及 Iconiq Capital 分别超过 10 亿美元的出资承诺, 此外战略投资者英伟达公司和微软公司的投资金额可能高达 150 亿美元。 此次最新融资轮将使 Anthropic 的估值较此前水平接近翻倍,距离该公司筹集 130 亿美元资金仅过去五个月——这一迹象反映出投资者对这家 AI 初创企业 的狂热追捧,其年化营收增速持续飙升,去年夏季已突破 90 亿美元大关。 参考资料: 本周 Anthropic 迎来高光时刻,发布了专为企业工作流程自动化优化的新型 AI 模型,引发软件与金融服务板块数十亿美元规模的抛售潮 。过去一年 ...
Z Potentials|沈俊潇:从 Meta 出走,剑桥博士创立 Memories.ai,获 Samsung Next、Susa Ventures 千万美元押注
Z Potentials· 2026-02-10 02:07
Core Insights - The article emphasizes the importance of visual long-term memory in AI, arguing that understanding the world requires more than just intelligence; it necessitates memory capabilities [1][2] - Memories.ai aims to create a foundational system for visual long-term memory, focusing on encoding video into structured data that can be efficiently retrieved and stored [2][10] - The company believes that the future of AI will require a system that understands human context and preferences, acting as a bridge between humans and various agents [8][18] Group 1: Company Vision and Technology - Memories.ai is developing the Large Visual Memory Model (LVMM), which transforms video into AI-consumable structured representations, enabling efficient retrieval and long-term storage [2][10] - The company differentiates itself by focusing on memory rather than intelligence, addressing the limitations of current AI systems that primarily rely on text-based memory [9][15] - The technology aims to provide a comprehensive understanding of the world, akin to human perception, rather than just processing text [1][25] Group 2: Market Position and Applications - The company is targeting three main business directions: consumer-grade AI hardware with cameras, enterprise-level AI hardware for security and operations management, and long-term memory systems for humanoid robots [22][21] - Current applications include partnerships with security companies to enhance real-time monitoring and behavior modeling, demonstrating the practical value of visual memory systems [26][27] - The company envisions becoming a centralized visual memory platform, providing unified video storage, understanding, and management capabilities for various industries [28][30] Group 3: Funding and Talent Strategy - Memories.ai has successfully raised over $8 million in seed funding, with notable investors including Samsung Next and Susa Ventures, which supports its technology development and market expansion [30][31] - The company emphasizes a focused approach, concentrating solely on visual memory and video encoding, avoiding distractions from hardware development [32][33] - By offering competitive compensation packages, the company aims to attract top-tier research talent, which is crucial for advancing its technology and product development [31][32]
速递|红杉再领投,一年内实现了从30亿到110亿美元,法律AI初创Harvey融资2亿美元
Z Potentials· 2026-02-10 02:07
据悉, Harvey 正在以 110 亿美元的估值进行融资,距离其估值达到 80 亿美元仅过去数月 法律 AI 初创企业 Harvey 的增长势头似乎无法阻挡,风险资本正持续向其注入资金。据《福布斯》报道,知情人士透露,该公司正就新一轮 2 亿 美元融资进行谈判,由红杉资本与新加坡政府投资公司领投,投后估值达 110 亿美元。 若本轮融资完成, Harvey 的估值将在数月内飙升 30 亿美元。去年 12 月,该公司确认已在秋季完成由 Andreessen Horowitz 领投的 1.6 亿美元融 资,投后估值为 80 亿美元。 回顾今年 6 月, Harvey 宣布完成由凯鹏华盈与 Coatue 领投的 3 亿美元 E 轮融资,估值达 50 亿美元。 此前数月( 2025 年 2 月),该公司刚以 30 亿美元估值完成了由红杉资本领投的 3 亿美元 D 轮融资。 这家为律师事务所提供 LLM 人工智能支持的初创公司,截至 2025 年底实现了 1.9 亿美元的年经常性收入( ARR ),创始人兼首席执行官温斯顿· 温伯格在领英上透露 。 该数据较去年 8 月的 1 亿美元 ARR 实现大幅增长( 具体取决 ...