量子位

Search documents
亿万打工人在用的WPS,未来可能要重塑你的工作流
量子位· 2025-07-30 09:44
Core Viewpoint - The article highlights the launch of WPS AI 3.0 by Kingsoft Office, which integrates AI capabilities into traditional office software, enhancing productivity and document management through features like a knowledge base and multi-modal AI functionalities [4][5][9]. Group 1: WPS AI 3.0 Features - WPS AI 3.0 introduces the native Office document knowledge base, allowing users to upgrade cloud documents into a searchable knowledge base, addressing issues of fragmented knowledge retrieval and sharing [5][15]. - The knowledge base supports various document formats, including tables, PPTs, and PDFs, enabling seamless integration without the need for re-uploading [15][19]. - WPS Lingxi, the AI assistant, offers comprehensive functionalities such as AI writing, PPT generation, and semantic search, allowing users to handle diverse office tasks efficiently [9][22]. Group 2: Document Management and Retrieval - The knowledge base transforms document management from a manual process to an AI-driven one, automatically categorizing and archiving documents based on their content [13][16]. - Users can perform natural language queries to retrieve specific information, with the AI providing detailed answers and sourcing the information accurately [17][18]. - The system can handle complex documents, including large tables and various formats, ensuring high accuracy in content retrieval and management [19][20]. Group 3: AI Writing and PPT Generation - WPS Lingxi supports various writing tasks, from short social media posts to long reports, with pre-set templates that cater to specific writing needs [22][24]. - The AI can generate structured content, such as scripts and presentations, while allowing users to interactively refine and adjust the output [38][40]. - Users can edit documents directly within WPS, utilizing AI to optimize text and structure without losing the original content [30][35]. Group 4: Integration and User Experience - The integration of WPS Lingxi into the WPS Office suite allows for a seamless user experience, where users can interact with the AI in a familiar environment [54][56]. - The AI's ability to understand document formats and user intent enhances the editing process, making it more intuitive and efficient [55][62]. - Kingsoft Office's long-standing expertise in document processing underpins the effectiveness of WPS AI 3.0, positioning it as a leader in the AI office software market [57][64].
Qwen全面升级非思考模型,3B激活、256K长文、性能直逼GPT-4o
量子位· 2025-07-30 09:44
Core Viewpoint - The article highlights the rapid advancements and performance improvements of the Qwen3-30B-A3B-Instruct-2507 model, emphasizing its capabilities in reasoning, long text processing, and overall utility compared to previous models [2][4][7]. Model Performance Enhancements - The new model Qwen3-30B-A3B-Instruct-2507 shows significant improvements in reasoning ability (AIME25) by 183.8% and capability (Arena-Hard v2) by 178.2% compared to its predecessor [4]. - The long text processing capability has been enhanced from 128K to 256K, allowing for better handling of extensive documents [4][11]. - The model demonstrates superior performance in multi-language knowledge coverage, text quality for subjective and open tasks, code generation, mathematical calculations, and tool usage [5][7]. Model Characteristics - Qwen3-30B-A3B-Instruct-2507 operates entirely in a non-thinking mode, focusing on stable output and consistency, making it suitable for complex human-machine interaction applications [7]. - The model's architecture supports a context window of 256K, enabling it to retain and understand large amounts of input information while maintaining semantic coherence [11]. Model Series Overview - The Qwen series has released multiple models in a short time, showcasing a variety of configurations and capabilities tailored for different scenarios and hardware resources [12][18]. - The naming convention of the models is straightforward, reflecting their parameters and versions, which aids in understanding their specifications [14][17]. Conclusion - The Qwen3 series is positioned as a comprehensive model matrix, catering to diverse needs from research to application, and is ready to address various demands in the AI landscape [19].
从OpenAI离职创业到估值1700亿美元,Anthropic用4年时间引硅谷巨头疯狂押注
量子位· 2025-07-30 09:44
Core Viewpoint - Anthropic, the company behind Claude, is set to raise $5 billion in a new funding round, bringing its valuation to $170 billion, making it the second AI unicorn to reach a valuation of over $100 billion after OpenAI [1][2]. Funding and Valuation - In March, Anthropic's valuation was $61.5 billion, indicating a nearly threefold increase in less than six months [3][5]. - The latest funding round, led by Iconiq Capital, will significantly boost Anthropic's total funding to approximately $20 billion [8][16]. - Amazon, a major investor, is expected to participate in this funding round, further solidifying its position as Anthropic's largest investor with a total investment of $4 billion [9][14]. Competitive Landscape - The rapid growth of Anthropic's valuation puts pressure on competitors like OpenAI and xAI, both of which are also raising substantial funds for data centers and talent acquisition [4]. - OpenAI's latest valuation stands at $300 billion, while xAI aims for a valuation of $200 billion [4]. Product and Revenue Growth - Anthropic's Claude models, particularly Claude 3.7 Sonnet, have established a strong competitive edge in AI programming, outperforming GPT-4 in benchmark tests [20][22]. - The company generates 70-75% of its revenue from API usage, with significant earnings from token consumption, while traditional consumer services contribute only 10-15% [25][26]. - Annualized revenue has surged from $1 billion at the beginning of the year to $4 billion, with projections reaching $9 billion by year-end, driven by its advantages in code generation [27][28].
1.5B刷新数学代码SOTA!快手&清华精细化Token管理,LLM推理能力飙升
量子位· 2025-07-30 09:44
Core Insights - The article discusses a new approach called Archer, developed by a team from Kuaishou and Tsinghua University, which utilizes a small model with 1.5 billion parameters to outperform larger state-of-the-art (SOTA) models in various reasoning benchmarks [1][3][18]. Group 1: Methodology - Archer's success is attributed to the refined management of the model's learning process, allowing it to retain essential knowledge while being flexible in reasoning [2][21]. - The method employs a "dual-token constraint" strategy, where tokens are not split but are given customized training rules based on their characteristics [10][11]. - Tokens are categorized into low-entropy (knowledge-based) and high-entropy (reasoning-based) types, with different training constraints applied to each [17][21]. Group 2: Performance Metrics - In mathematical reasoning tasks, Archer achieved significant improvements, with an 18.1% increase in accuracy on AIME24 and a 10.3% increase on AIME25 compared to the original model [18]. - Archer surpassed existing SOTA methods like DAPO, solving 6.6% more problems on AIME24 and 5.2% more on AIME25 [18]. - For code generation, Archer also showed a 3.4% accuracy improvement on LiveCodeBench v5 and a 2.6% improvement on v6 compared to DAPO [19]. Group 3: Efficiency - Archer's training efficiency is notable, requiring only 1900 H800 GPU hours, significantly less than the 16000 H100 hours needed by Nemotron, demonstrating a cost-effective approach to achieving high performance [20]. Group 4: Key Insights - The core insight of Archer is the balance between knowledge stability and reasoning exploration, which is crucial for enhancing the model's capabilities [21][24]. - Experimental validation indicates that without proper constraints on low-entropy tokens, the model's knowledge can deteriorate, while excessive constraints on high-entropy tokens can hinder reasoning flexibility [24].
腾讯入局具身智能,宇树首批用上“大脑”
量子位· 2025-07-30 09:44
明敏 发自 凹非寺 量子位 | 公众号 QbitAI 不造硬件、不量产、不做商业化。 这是腾讯加入当下具身智能热潮的姿势。 那要做什么? 一个具身智能的 通用外接大脑 。而且不是端到端,是 模块化 提供能力。也就是各家机器人可以从中获取自己想要的部分能力。 效果be like,搭载了该大脑的宇树机器人,可以实时处理人类语音指令,闲聊、完成任务,还能判断自己能干什么、不能干什么。 比如它能看到桌子上比原先多了一个玩偶,但是它没有灵巧手,所以并不能拿起玩偶。 这就是具身智能Tarios平台,在WAIC 2025期间正式亮相。 它集成了目前腾讯在具身智能领域的软件能力,包括多模态、规划、感知算法,以及开发、仿真、数据等工具。 包括宇树、越疆、乐聚、帕西尼、擎朗、众擎等大热具身智能领域玩家,都已火速达成合作。 而且还没啥后顾之忧——腾讯再次强调了自己不下场做硬件本体、不搞量产、不搞商业化。 这一波直接格局打开,谁都能来当"腾讯系"机器人了(doge)。 爆火具身智能,需要"外接大脑" 首先来拆解一下Tairos平台本身。 它核心包含两个方面: 模型算法 云服务 模型层面主要包含三部分:多模态感知模型、规划大模型、感知 ...
o3出圈玩法“看图猜位置”,豆包也安排上了!还是人人免费用那种
量子位· 2025-07-30 06:06
Core Viewpoint - The article discusses the new visual reasoning feature of the Doubao APP, which enhances its ability to analyze images and provide contextual information, making it a versatile tool for users [1][4][66]. Group 1: Doubao APP Features - Doubao APP has upgraded its visual reasoning capabilities, allowing it to analyze images and provide detailed contextual information, such as identifying locations and historical timelines [4][8]. - The app can perform image searches and utilize various image analysis tools (zooming, cropping, rotating) to derive conclusions from images [7][50]. - Users can easily engage with the app by uploading images or taking photos to receive instant analysis and information [5][26]. Group 2: Practical Applications - Doubao APP can assist users in identifying objects or details within images, such as distinguishing between AI-generated and real images [11][20]. - The app can also help with educational tasks, such as solving complex math problems, and has been validated against human solutions [40][43]. - It can extract structured data from financial reports and other documents, enhancing productivity in both personal and professional contexts [46][49]. Group 3: Industry Trends - The article highlights a broader trend in the industry towards visual reasoning capabilities, with major models like OpenAI's o3 and o4-mini leading the charge [68][70]. - The development of multi-modal technologies supports the integration of visual reasoning into various applications, addressing both industry needs and user demands [72][75]. - The increasing prevalence of mixed media information necessitates advanced visual reasoning capabilities to improve information processing and understanding [76].
1.5B参数撬动“吉卜力级”全能体验,国产开源之光多模态统一模型,来了
量子位· 2025-07-30 04:48
AIGC 的范式,已经被悄然 改变。 从割裂地处理文本、图像、声音,到现在,大众在应用领域的反馈已经证明, AI需要以更接近人类认知的方式,融合多模态信息 。 新的技术趋势值得关注,也有人第一时间开源了对新范式的深入思考: 鱼羊 西风 发自 凹非寺 量子位 | 公众号 QbitAI 听说了吗,GPT-5这两天那叫一个疯狂造势,奥特曼怕不是真有些急了 (doge) 。 但有一说一,回顾上半年最火AI事件,GPT-4o带来的"吉卜力"风暴,还是热度TOP。 △ 数据来自微信指数 不仅由"万物皆可吉卜力"为始,GPT-4o生图功能被网友们疯玩至今,更重要的是,还引发了更深的技术思考: 昆仑万维已开源 多模态统一模型Skywork UniPic ,和GPT-4o呈现出类似的图像一体化能力,在单一模型里实现 图像理解、文本到图像生 成、图像编辑 三大核心能力的深度融合。 对生图提示词的理解力,是这样的: 提示词:两位寿司师傅在江户时代熙攘的街市投掷彩虹寿司。他们头顶的纸灯笼明灭闪烁。整个场景呈现出像素化的复古游戏画风。 把图片转换成吉卜力风格,也很有内味儿: 并且相比狂卷大参数量的同类模型,Skywork UniPic ...
第三届世界科学智能大赛圆满收官!开放多项真实数据,1.6万人共探产业场景关键科学问题
量子位· 2025-07-30 02:29
Core Insights - The third World Scientific Intelligence Competition was held in Shanghai, featuring 30 teams competing for awards in five major categories, with a total of 5 first prizes, 10 second prizes, and 15 third prizes awarded [1][3] - The competition aimed to select global talent in the field of AI for Science, with no restrictions on nationality or region, and attracted nearly 16,000 participants from around 30 countries and regions [1][4] Group 1: Competition Overview - The event was co-hosted by the Shanghai Institute of Scientific Intelligence and Fudan University, with support from various institutions including Alibaba Cloud and Shanghai Fosun Pharmaceutical [1] - The competition focused on high-value industrial scenarios, with real-world data sets provided for the challenges, such as aviation safety and renewable energy power forecasting [4][5] - A new "Physical AI track" was launched to address core technological challenges in space intelligence and reasoning models, promoting the application of AI technology [4] Group 2: Open Collaboration and Platform Development - The competition emphasized open-source principles, providing access to real data from industrial scenarios and offering computational resources and toolchain support for participants [5] - Outstanding models from the competition will be deployed on the newly launched Xinghe Qizhi Scientific Intelligence Open Platform, which aims to facilitate collaboration among scientists, AI researchers, and engineers [5] - The platform currently hosts over 200 scientific models across 12 disciplines and has accumulated more than 12PB of scientific data, attracting over 120 research teams [5] Group 3: Youth Engagement - The competition introduced a middle school category, attracting 331 teams from 146 schools in Shanghai, with an average participant age of around 14 years [7] - This initiative aims to enhance the youth training system and showcase the innovative potential of young participants in the field of scientific intelligence [7] Group 4: Future Directions - The organizing committee plans to continue leveraging the competition platform to launch more cutting-edge events focused on scientific intelligence, fostering a sustainable ecosystem for innovation and talent development [10]
我在WAIC看见的十大趋势
量子位· 2025-07-30 02:29
Core Viewpoint - The article highlights the unprecedented enthusiasm and advancements in the AI industry showcased at the Shanghai World Artificial Intelligence Conference (WAIC), emphasizing the transformative impact of DeepSeek and the emergence of various trends in AI technology and applications [3][4]. Group 1: Key Trends in AI - Trend 1: DeepSeek has fundamentally changed the perception of AI in China, with a growing belief in the potential for achieving AGI (Artificial General Intelligence) [6][7]. - Trend 2: New foundational large models are not only focused on state-of-the-art (SOTA) performance but also on reasoning, multimodality, and cost-effectiveness [8][11]. - Trend 3: Open-source large models have entered a new phase in China, with significant players like Tongyi Qianwen leading the way [17][18][28]. Group 2: Integration of Hardware and Software - Trend 4: The integration of chips and models is creating a fully domestic AI ecosystem, with a focus on collaboration between hardware and software [32][34]. - Trend 5: AI infrastructure is rapidly developing, with vertical industry models providing direct productivity benefits, as seen in sectors like energy and finance [50][60]. Group 3: Consumer-Focused Innovations - Trend 6: AI innovation is shifting towards consumer-facing products, with AI agents becoming a new focal point in various applications [66][81]. - Trend 7: The first wave of commercial AI terminals includes automobiles, headphones, and glasses, indicating a growing market for AI-integrated hardware [88][99]. Group 4: Robotics and Non-Transformer Architectures - Trend 8: The field of embodied intelligent robots is experiencing rapid growth, with advancements in capabilities and applications [112][134]. - Trend 9: Non-Transformer architectures are emerging from research into practical applications, showcasing innovative approaches in AI development [144][146]. Group 5: Competitive Landscape - Trend 10: The gap between China's AI capabilities and those of Silicon Valley has narrowed to approximately six months, highlighting China's unique advantages in resources and talent [150][155].
超越DeepSeek-R1,数学形式化准确率飙升至84% | 字节&南大开源
量子位· 2025-07-30 00:24
Core Viewpoint - The article discusses the significant advancements made by ByteDance's Seed team and Nanjing University in the field of mathematical formalization through the introduction of the CriticLean framework, which enhances the accuracy of converting natural language mathematical statements into formal code. Group 1: CriticLean Framework - The CriticLean framework has improved the accuracy of converting natural language mathematical statements to Lean 4 code from 38% to 84% [2][33]. - The framework innovatively places the evaluation model at its core, utilizing the CriticLeanGPT model trained through reinforcement learning to assess the conformity of formalized code to its original meaning [2][7]. Group 2: Challenges in Mathematical Formalization - The core challenge in mathematical formalization lies in accurately converting natural language descriptions into machine-verifiable formal code, which requires deep understanding and faithful representation of mathematical semantics [4]. - Existing research has made progress in generative models and compilation effectiveness, but significant bottlenecks remain in semantic alignment for complex problems, including semantic gaps and insufficient data diversity [5][6]. Group 3: CriticLeanGPT Model - The CriticLeanGPT model was trained using a dataset of 48,000 entries, enhancing its ability to evaluate semantic accuracy [10]. - The model can identify 12 common error types, with type errors accounting for 24.9% and mathematical representation errors at 23.8% [10]. Group 4: CriticLeanBench Benchmark - CriticLeanBench is the first benchmark focused on semantic evaluation in formalization tasks, designed to assess the model's ability to convert natural language mathematical statements into formally verified theorems [12]. - The benchmark includes 500 test samples, with CriticLeanGPT achieving an accuracy of 87%, surpassing other models like GPT-4o and Claude 3.5 [20][21]. Group 5: FineLeanCorpus Dataset - The FineLeanCorpus dataset consists of 285,957 high-quality formalization samples, covering 16 mathematical fields from high school to university level [26]. - Each sample in the dataset has undergone syntax checks and semantic validation, achieving an accuracy rate of over 84% through manual sampling [27]. Group 6: Performance Improvement - The application of the CriticLean framework in automated formalization processes has led to a significant accuracy increase from 38% to 84%, with the semantic evaluation contributing 30 percentage points to this improvement [33].