量子位
Search documents
一行代码不写,龙虾军团日夜为我打工赚钱
量子位· 2026-03-05 08:32
Core Viewpoint - The article discusses the innovative use of OpenClaw, a no-code AI tool, by Matthew Berman, showcasing how it can automate various business tasks and enhance productivity without requiring extensive coding knowledge [1][2]. Group 1: Automation and Efficiency - Matthew Berman has successfully implemented OpenClaw to handle tasks such as processing business emails, gathering client information, and managing project reminders, effectively creating a digital employee [3][4]. - The AI tool can evaluate potential clients by analyzing their websites, product reviews, and social media presence, generating a scoring report for incoming emails [17][18]. - OpenClaw is integrated with platforms like Gmail and HubSpot CRM, allowing it to draft responses and update transaction progress automatically [20][22]. Group 2: Security and Reliability - To ensure security, Berman has established a three-layer defense system for OpenClaw, including code cleaning, sandboxing emails, and using advanced AI models for scanning [23][24]. - A dual prompt stack is utilized to cater to different AI models, ensuring that instructions remain consistent and effective [27][30]. - The system includes a self-repair feature that logs errors and allows the AI to autonomously fix issues, enhancing reliability over time [36][57]. Group 3: Knowledge Management and Content Creation - OpenClaw assists in knowledge management by automatically collecting and vectorizing content from various sources, making it easily shareable within teams [45][46]. - The AI can generate creative content suggestions and analyze performance data from platforms like YouTube and TikTok, providing insights for content strategy [47][56]. - Berman has formed an AI advisory group that analyzes multiple business data sources and provides optimization recommendations daily [49].
88岁图灵奖得主,用Claude一小时破解30年数学悬案
量子位· 2026-03-05 08:32
Core Insights - The article discusses the remarkable achievement of Claude, an AI model, which solved a 30-year-old problem in graph theory in just one hour, impressing Donald Knuth, a renowned computer scientist and Turing Award winner [2][5][12]. Group 1: AI Achievement - Claude utilized structural approaches like "fiber decomposition" and "snake-like construction" to derive a universal construction algorithm applicable to all odd m values after only 31 explorations [4][9]. - The problem involved determining if all arcs in a three-dimensional grid graph with m^3 vertices could be perfectly decomposed into three non-overlapping Hamiltonian cycles [6][10]. - Claude's solution process demonstrated logical reasoning and the ability to learn from errors, marking a significant advancement in AI's problem-solving capabilities [12][14]. Group 2: Donald Knuth's Perspective - Knuth, who had been skeptical about generative AI, expressed his admiration for Claude's achievement, stating "Hats off to Claude" [5][12]. - He highlighted that Claude's work was not merely a black-box result but a clear demonstration of logical reasoning and mathematical discovery [12]. - Knuth's long-standing engagement with the problem dates back to his work on "The Art of Computer Programming," showcasing the depth of the challenge [5][6]. Group 3: Historical Context - Donald Knuth is a legendary figure in computer science, having won the Turing Award at the age of 36 for his foundational contributions to algorithm analysis [18][19]. - His seminal work, "The Art of Computer Programming," is considered one of the most important scientific works of the 20th century, alongside Einstein's "Theory of Relativity" [22]. - Knuth's ongoing work on this series, which began in 1962, reflects his dedication to the field, with plans for seven volumes [24][26].
阿里刚投的AI明星,3D大模型那种
量子位· 2026-03-05 08:32
Core Viewpoint - VAST has completed a $50 million Series A financing round, led by Alibaba and Hengxu Capital, with participation from other notable investors, indicating strong confidence in the company's strategic direction and growth prospects [1][2]. Group 1: Company Overview and Achievements - Founded in 2023, VAST has emerged as a leader in the multimodal AI field, focusing on the development of foundational models and application ecosystems [4]. - The company has developed a proprietary 3D foundational model that maintains industry leadership, collaborating with major enterprises like Alibaba, Tencent, and ByteDance, and has attracted over 650 million creators who have generated nearly 100 million 3D models [4][27]. - VAST has launched a new AI 3D model family, with the Tripo H3.1 model achieving industry-leading performance in key metrics such as input alignment, structural accuracy, texture quality, and generation speed [5][10]. Group 2: Technological Innovations - The Tripo P1.0 model redefines the algorithmic paradigm for AI 3D, capable of generating professional-grade 3D assets in just 2 seconds, significantly improving speed compared to existing solutions [5][20]. - VAST's algorithm team has made substantial advancements in AI 3D modeling, addressing long-standing challenges in precision and speed, thus enabling scalable user-generated content (UGC) [9][16]. Group 3: UGC Platform Development - VAST believes that lowering the creation threshold for interactive content will lead to a new phase of UGC, which is being validated by increasing user engagement with 3D generation tools [21][24]. - The company has built a robust creator ecosystem, engaging with developers and creators globally to explore the evolution of interactive content as production efficiency improves [25][27]. - VAST plans to accelerate the development of a UGC interactive content platform by 2026, integrating generation capabilities, distribution mechanisms, and interactive systems into a complete loop [31]. Group 4: Future Vision and World Model - VAST aims to transition from creating individual objects to generating entire worlds, positioning the world model as the ultimate form of a universal model, grounded in a native understanding of three-dimensional space [32][33]. - The company is focusing its core R&D resources on world models, leveraging a high-quality 3D and world model database and a talented AI and graphics team to establish a unique advantage in this field [32][41]. Group 5: Investor Perspectives - Investors express strong confidence in VAST's technology maturity and product readiness, highlighting its potential to drive efficiency in various industries, including automotive design and smart manufacturing [35][39]. - The consensus among investors is that VAST is well-positioned to lead the next generation of interactive content platforms, with a focus on democratizing 3D creation [37][38].
悬赏5000刀!148局AI斗蛐蛐世界杯官方战报出炉,全球赛邀你接棒来战
量子位· 2026-03-05 06:33
Core Viewpoint - The article discusses the differences in performance among AI large models, questioning whether their rankings truly reflect their capabilities in complex interactive scenarios, such as social deduction games like "Werewolf" [4][5]. Group 1: AI Model Competition - Taobao organized an "AI Werewolf World Cup," bringing together 12 top AI models to compete under a unified framework, emphasizing direct competition [7][12]. - The competition involved 150 rounds of gameplay, focusing on the models' reasoning abilities in a complex social deduction environment [10][17]. - The models included notable names like GPT, Gemini, and Qwen, showcasing the latest versions from various companies [9][19]. Group 2: Evaluation Metrics - The evaluation criteria for the competition included vote accuracy, divine skill efficiency, kill precision, and overall scores, providing a detailed profile of each model's capabilities [24][25]. - Vote accuracy measures the model's ability to identify the "werewolf" amidst misinformation, while divine skill efficiency assesses decision-making during critical game moments [28][29]. - Kill precision reflects the model's ability to collaborate and deduce the location of opponents, while werewolf win rates indicate the model's effectiveness in deception and social strategy [31][32]. Group 3: Insights from Gameplay - The competition revealed that some models struggled with advanced strategies, highlighting the limitations of even the most advanced AI in high-stakes scenarios [35]. - AI models exhibited a more polite and measured approach in conflict situations compared to human players, indicating a unique strategic style [36][40]. - The ongoing matches and results are available on the WhoisSpy.ai platform, which aims to evaluate AI performance in social reasoning and gaming contexts [41]. Group 4: Future Developments - The article mentions an upcoming international competition that invites global developers to participate, expanding the scope of AI model testing [46][47]. - The competition will allow developers to utilize provided templates to create agents, making participation accessible even for those without extensive experience [55][56]. - Incentives for the competition include cash prizes, with the first place receiving $5,000, encouraging innovation and continuous improvement among participants [63][64].
量子位编辑作者招聘
量子位· 2026-03-05 06:33
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Group 1: Job Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. - Positions are open for various levels, including editors, lead writers, and chief editors, with a focus on matching roles to individual capabilities [6]. Group 2: Job Responsibilities - **AI Industry Direction**: Responsibilities include tracking innovations in infrastructure, such as chips, AI infrastructure, and cloud computing, as well as interpreting technical reports from conferences [6][7]. - **AI Finance Direction**: Focuses on venture capital, financial reports, and capital movements within the AI industry, requiring strong analytical skills and a passion for interviews [11]. - **AI Product Direction**: Involves monitoring AI applications and hardware developments, producing in-depth evaluations of AI products, and engaging with industry experts [11]. Group 3: Benefits and Work Environment - Employees can expect a vibrant team atmosphere, opportunities for personal influence through original content creation, and professional mentorship from senior editors [6][11]. - The company offers competitive salaries and comprehensive benefits, including social insurance, meal allowances, and performance bonuses [6]. Group 4: Company Growth and Reach - By 2025, Quantum Bit aims to have over 2.4 million subscribers on WeChat and more than 7 million users across platforms, with a daily reading volume exceeding 2 million [12]. - The company is recognized as the top new media outlet in the AI and frontier technology sectors according to third-party data platforms [12].
企业级OpenClaw最强拍档来了!万亿参数的国产多模态大模型,刚刚开源发布
量子位· 2026-03-05 06:33
Core Viewpoint - YuanLab.ai has officially released the Yuan3.0 Ultra multimodal foundational model, which is one of only three trillion-parameter open-source multimodal models in the industry [1][2]. Group 1: Model Features and Capabilities - Yuan3.0 Ultra is designed for enterprise applications, optimizing training efficiency through a mixture of experts (MoE) architecture, achieving a parameter scale reduction from 1515 billion to 1010 billion while improving pre-training computational efficiency by 49% [2][18]. - The model introduces Localized Filtering Attention (LFA) to enhance semantic relationship modeling, resulting in higher precision compared to traditional attention structures [2]. - It excels in multimodal document understanding, retrieval-augmented generation (RAG), table data analysis, content summarization, and tool invocation, making it suitable for complex enterprise tasks [2][4]. Group 2: Performance in Specific Tasks - In evaluations like DocMatix and MMTab, Yuan3.0 Ultra outperforms leading models such as Claude Opus 4.6 and GPT-5.2 in understanding complex documents and tables, facilitating tasks like financial report analysis and contract review [6][8]. - The model demonstrates superior capabilities in multi-source information retrieval and integration, outperforming competitors in ChatRAG and SummEval evaluations, enabling comprehensive information processing in enterprise knowledge environments [8][10]. - Yuan3.0 Ultra shows exceptional performance in Text-to-SQL benchmarks like Spider, supporting efficient data querying and operational analysis for business decision-making [10][12]. Group 3: Training and Optimization Techniques - The model employs Layer-Adaptive Expert Pruning (LAEP) to dynamically identify and prune low-contribution experts during pre-training, optimizing computational resources while maintaining functional specialization [14][15]. - The training strategy focuses on fast-thinking reinforcement learning, enhancing reasoning efficiency by prioritizing high-information-gain steps and reducing unnecessary reflections [16][19]. - The model's architecture aims to evolve into a cognitive system with specialized structures, emphasizing the importance of optimizing learning and computational efficiency through expert differentiation [15][22]. Group 4: Open Source and Future Directions - Yuan3.0 Ultra is fully open-sourced, providing model weights, technical reports, and training methods to support community-driven training and industry customization [22][23]. - The model family will include various versions with parameters ranging from 40 billion to 1 trillion, with further developments expected to be released [23].
模型砍掉一大半,准确率反升15%!华科&阿里安全新研究实现ViT近乎无损的类特定压缩|ICLR'26
量子位· 2026-03-05 06:33
Core Viewpoint - The article emphasizes the limitations of large, general-purpose visual models in real-world applications, advocating for smaller, specialized models that are more efficient and better suited for specific tasks [1][2]. Group 1: Limitations of Large Models - Large visual models, while powerful, have high computational costs and are not optimal for deployment in resource-constrained environments [1][4]. - Many applications only require a focus on a few key target categories, making the extensive knowledge in general models unnecessary and counterproductive [1][8]. Group 2: Advantages of Customized Models - Customized models, described as "small and specialized," align better with practical needs, reducing deployment costs and enhancing long-term operational stability [2]. - The new paradigm proposed by Huazhong University of Science and Technology and Alibaba, named Vulcan, allows for the derivation of specialized models from general ones, focusing on key target categories while minimizing knowledge loss [3]. Group 3: Methodology of Vulcan - Vulcan introduces a "train-then-prune" approach, which is a departure from traditional methods that prune first and then train, thus preserving critical information related to target categories [3][13]. - The methodology includes two main components: Class-Centric Neuron Collapse (CCNC) and Truncated Nuclear Norm Regularization (TNNR), which work together to refine the model's focus on relevant information [15][16]. Group 4: Experimental Results - The Vulcan-derived models demonstrated a significant accuracy improvement of up to 15.12% on ImageNet tasks while reducing the model size to 20%-40% of the original [19]. - In various tests across different datasets and model sizes, Vulcan showed superior performance compared to existing structured pruning methods, achieving up to 13.92% higher accuracy in class-specific tasks [19][21]. Group 5: Practical Deployment - In practical deployment scenarios, Vulcan achieved inference speedups ranging from 1.23× to 3.02× and reduced memory usage by 20.59% to 76.47% on edge devices [22][23]. - The research indicates that understanding the internal knowledge structure of models is crucial for achieving reliable lightweight deployment [25].
阿里批准林俊旸离职,CTO周靖人接管千问!Gemini周浩确定加盟
量子位· 2026-03-05 04:13
Core Viewpoint - Alibaba's CEO Wu Yongming has officially approved Lin Junyang's resignation, which has raised questions about the sudden leadership changes within the company, particularly in the Tongyi Lab and the Qwen team [1][2][3][4]. Group 1: Leadership Changes - Lin Junyang's resignation was unexpected, and the company has not provided details regarding the reasons behind it or the appointment of a successor [3][12]. - The All Hands meeting revealed that the leadership transition was not planned and that there was a lack of communication regarding the organizational changes [5][8]. - The company aims to expand its team and resources, indicating that the adjustments are not related to internal political struggles [9][11]. Group 2: Team Dynamics - The Qwen team, which Lin Junyang led, reportedly had only a few hundred members, significantly smaller than comparable AI teams in other major companies [10]. - The team is currently facing a "resource-tight situation," which has been acknowledged by Alibaba Cloud's CTO [11]. Group 3: New Leadership Candidate - Zhou Hao, a senior researcher from Google DeepMind, has emerged as a potential successor to Lin Junyang, with expectations that he will lead the subsequent training efforts for the Qwen model [13][17]. - Zhou Hao has a strong background, having contributed significantly to the Gemini series of models at Google, which includes advancements in large language models [18][20][21]. Group 4: Zhou Hao's Background - Zhou Hao holds a bachelor's degree in Mathematics and Statistics from the University of Science and Technology of China and has pursued advanced degrees in the United States [23][24]. - His previous experience includes roles at Meta and Google, where he was involved in deep learning and conversational AI, contributing to major projects that have garnered significant academic citations [28][34].
华为重金押注的世界模型公司,新融了10个亿!
量子位· 2026-03-05 04:13
允中 发自 凹非寺 量子位 | 公众号 QbitAI 融资规模背后,是极佳视界在具身智能领域的全栈深耕: 具身基模、世界模型、原生本体、泛化场景 ,极佳视界正通过这"四位一体"的打法,叩开物理世界AGI的大门,加速通用机器人走进千家万 户。 产业巨头和顶尖资本集体押注,要做"物理世界的OpenAI" 资本正在加速押注具身智能的下一阶段。 近日,具身基模和通用机器人企业 极佳视界 ,最新完成近 10亿元Pre-B轮融资 。 本轮投资方包括: 其中, 中金资本、华强资本、财鑫资本、张科垚坤等作为老 股东持续超额重磅加持,庚辛资本中国担任财务顾问。 以大语言模型为代表的数字AGI已深度重塑了数字世界。 然而,全球GDP中仍有约50%根植于物理实体。极佳视界认为, 物理世界的基础模型突破 将是下一波变革的核心。 随着端到端架构、VLA (视觉-语言-动作模型) 及世界模型等技术的飞速演进,物理AGI正处于从技术积累转向产业爆发的临界点。 中芯聚源、上海半导体产投基金、临芯资本、星源资本、万林国际等 顶尖芯片和汽车产业资本 ; 中金资本、苏创投、华强资本、长江资本、光谷产投、锡山国投、金雨茂物、新鼎资本、领阳投资、财鑫 ...
GPT-5.3 Instant上线:ChatGPT终于不说教了
量子位· 2026-03-04 11:30
Core Viewpoint - OpenAI has released the GPT-5.3 Instant model, which focuses on improving user experience by reducing unnecessary verbosity and enhancing emotional intelligence in responses [1][9]. Group 1: Model Improvements - The GPT-5.3 Instant model is designed for speed optimization, making it suitable for quick queries, draft writing, and instant translations [2]. - The model has significantly reduced unnecessary disclaimers and awkward responses, resulting in more natural and fluent communication [5][10]. - It has improved contextual understanding, allowing it to respond more appropriately to user intent without excessive caution [21][15]. Group 2: Enhanced Capabilities - The model's internet search and writing capabilities have been enhanced, providing more reliable and timely information [25][29]. - It can now integrate web information with its existing knowledge base, offering deeper analysis on current topics [26]. - The writing style has become more nuanced, with improved detail and emotional resonance in generated content [31][34]. Group 3: Safety and Limitations - The hallucination rate has decreased by 26.8% when using internet searches and by 19.7% when relying solely on internal knowledge [39]. - The model has shown improvements in handling sensitive topics, with better performance in rejecting inappropriate requests [40]. - However, it still exhibits some limitations in non-English languages, where responses may feel stiff or overly literal [42]. Group 4: Future Developments - GPT-5.4 is expected to support a context window of 2 million tokens and introduce "stateful AI" technology for persistent memory across sessions [45][48]. - This upcoming model may also allow for pixel-level visual analysis by bypassing traditional image compression mechanisms [46][47].