Workflow
量子位
icon
Search documents
人类击败OpenAI守住编程冠军!10小时激战两次反超,AI最后关头功亏一篑
量子位· 2025-07-17 07:04
白交 发自 凹非寺 量子位 | 公众号 QbitAI 10小时激战!人类最后关头实现超越,获得编程总决赛冠军~ OpenAI 在大部分比赛中都排名第一,本以为就这样了。人类开始反超,结果还剩1小时20分钟的时候,OpenAI又重新领先。不过还是没有 坚持到最后。 | | Standings Exhibition with OpenAI | | | --- | --- | --- | | Rank | User | Score | | 1 | OpenAIAHC | 43542614363 | | 2 | Psyho | 42420277629 | | 3 | terry_u16 | 34248482621 | | 4 | nikaj | 33740582721 | | 5 | saharan | 31754963614 | OpenAI总裁Greg Brockman发来贺电,中间还夹带私货:OpenAI位居第二。 此时获得冠军的人类表示 要累死了 。 因为过去三天我估计只睡了10个小时,现在都快撑不住了。 而原本始终保持领先优势的OpenAI,最终屈居第二。 在刚刚落幕的AtCoder世界巡回总决赛上,12名 ...
Claude Code出逃的主创又回来了!Anthropic:过去俩月我收入暴涨5.5倍,别走
量子位· 2025-07-17 07:04
Core Viewpoint - The article discusses the rapid return of key personnel Boris Cherny and Cat Wu to Anthropic from Cursor, highlighting the competitive landscape in Silicon Valley and the implications for Anthropic's valuation and growth potential in the AI sector [1][6][7]. Group 1: Personnel Movements - Boris Cherny and Cat Wu, key figures at Claude Code, were initially recruited by Anysphere, the company behind Cursor, where they were set to develop "agent-like" functionalities [2][4][5]. - Just two weeks after their departure, both were lured back to Anthropic, indicating the company's strong position in retaining talent amidst fierce competition [6][7]. Group 2: Valuation and Financial Performance - Anthropic is reportedly in discussions for a new funding round with a target valuation of $100 billion, which would mark a significant increase from its previous valuation of $58 billion just four months prior [8][9][10]. - The company aims to improve its profitability metrics, with current gross margins from direct sales of AI models around 60%, moving towards a target of 70% [12][19]. Group 3: Revenue Growth and Market Strategy - Anthropic's revenue has seen a fourfold increase in the first half of the year, with annualized revenue exceeding $4 billion [20]. - The company is pursuing a "model-as-a-service + vertical solutions" strategy, offering tailored AI solutions across various industries, including finance, law, and healthcare [15][19]. Group 4: Product Development and User Engagement - The launch of Claude Code has significantly boosted user engagement, with a 300% increase in active users and a 5.5-fold revenue growth since the release of the Claude 4 series [21][26]. - Anthropic has introduced a comprehensive analytics dashboard for Claude Code, allowing enterprises to track their AI spending and usage metrics effectively [24][25]. Group 5: Investment and Future Prospects - Amazon is reportedly considering a new multi-billion dollar investment in Anthropic, potentially making it the largest shareholder, following a previous investment of $4 billion [28][31]. - This investment reflects a broader trend where companies are recognizing the long-term profitability potential of AI technologies beyond initial hype [32].
苹果向英伟达生态妥协了!MLX框架主动适配CUDA
量子位· 2025-07-17 05:52
一水 发自 凹非寺 量子位 | 公众号 QbitAI 苹果向英伟达生态妥协了! 最新消息,苹果之前特意为端侧AI模型训练推出的 MLX框架 , 主动增加了CUDA支持 。 消息一出即在Hacker News引发热烈讨论: 要知道苹果一直以来都以"封闭"著称,但随着英伟达CUDA生态在AI开发领域占据绝对主导地位,苹果这下也不得不转变姿态了。 再加上英伟达市值创下前无古人的4万亿美元新纪录,以及最近释出的一系列利好消息,苹果选择避其锋芒也就不难理解。 可以说,苹果这就是明晃晃地借了英伟达东风,以进一步抢夺AI市场。 CUDA太强,不得不拥抱 为啥要拥抱CUDA?没啥,太强了,苹果自己也这么说。 官方理由如下: (1) 统一内存支持 :CUDA提供统一内存机制,便于不同设备间的数据共享与迁移,提升开发效率和性能表现。 (2) 跨平台部署需求 :英伟达硬件在学术研究和大规模计算中应用广泛,支持CUDA能让开发者在Mac上本地开发测试,随后无缝部署到 配备英伟达GPU的服务器或超级计算机上。 而通过让MLX框架主动适配CUDA, 今后苹果开发者也能利用英伟达GPU训练模型 。 其本质是增加了对CUDA的后端支持,方便 ...
云计算一哥,刚刚重新定义了AI Agent的玩法
量子位· 2025-07-17 05:52
Core Viewpoint - Amazon Web Services (AWS) has redefined the deployment of AI Agents in production environments with the launch of Amazon Bedrock AgentCore, a comprehensive toolkit for building enterprise-level AI Agents [3][19]. Group 1: Amazon Bedrock AgentCore - AgentCore simplifies the development of AI applications by providing a unified management system for various components, making the process more efficient [5][16]. - It includes seven core services that address the complexities of deploying AI Agents, likened to a fully furnished apartment ready for occupancy [6]. - The services offered by AgentCore include Runtime, Memory, Observability, Identity, Gateway, Browser, and Code Interpreter, each designed to enhance the functionality and security of AI Agents [8][9][10][11][12][13][14]. Group 2: AI Agents and Tools Marketplace - AWS has introduced a new category in its Marketplace for AI Agents and tools, allowing customers to easily find solutions by describing their use cases in natural language [24]. - This initiative aims to facilitate the rapid deployment and testing of AI Agents in various business scenarios [69]. Group 3: Amazon Nova and Kiro - AWS has launched Amazon Nova, which allows customization of model training lifecycles, enhancing the flexibility of AI applications [26][29]. - Kiro, a new AI programming tool, enables users to transform ideas into functional software through a structured process, streamlining the development workflow [49][51][65]. Group 4: S3 Vectors - Amazon S3 Vectors is introduced as a cloud storage service optimized for large-scale vector datasets, reducing storage costs by up to 90% [38][40]. - It supports efficient querying and integration with other AWS services, enhancing the capabilities of AI Agents in data management [47]. Group 5: Market Trends and Future Outlook - A significant shift towards AI Agents is observed, with over 50% of companies deploying them in production environments, and Gartner predicts that by 2028, 33% of enterprise software will incorporate Agentic AI [71][72]. - The emphasis on AI Agents reflects a broader trend in technology, with expectations that they will transform work and life in ways comparable to the advent of the internet [73].
教程 | 如何做出 X 上爆火的 AI 蓝图动画
量子位· 2025-07-17 05:52
Core Viewpoint - The article discusses the innovative use of Midjourney style codes, specifically the Sref Code, which allows users to apply preset visual styles to their creations easily, enhancing consistency and efficiency in artistic projects [8][9][13]. Group 1: Midjourney Style Codes - Midjourney style codes, such as --sref, enable users to apply specific visual styles to their prompts without extensive descriptions, ensuring a uniform aesthetic across multiple images [9][10]. - The application of style codes is particularly beneficial for projects requiring a cohesive look, such as illustrated books, where maintaining a consistent style is crucial [15][23]. Group 2: Creative Techniques - The article emphasizes the importance of detailed prompts in conjunction with style codes to achieve better results, as the style code primarily influences the visual style rather than the content [14][15]. - Suggestions for color selection include using a solid color background and complementary or contrasting colors for the main subject, enhancing visual appeal [24]. Group 3: Community and Resources - The article recommends following creators on platforms like X and Instagram who frequently share Sref Codes, fostering a community of shared resources and inspiration [41]. - A website, midjourneysref.com, is highlighted as a valuable tool for discovering and searching for various Midjourney style codes, making it easier for users to find styles that suit their needs [45].
深谋科技独家发布真正为人类服务的新一代人形机器人核心技术「声波传感 · 意念控制 · 高精视觉 · 类脑智能」
量子位· 2025-07-17 05:52
Core Viewpoint - The article highlights the advancements and innovations of Shenmou Technology in humanoid robotics, particularly focusing on their new generation humanoid robot "Meihouwang" and its core technologies aimed at creating real value for humanity [1][2]. Group 1: Event and Recognition - The 2025 World Artificial Intelligence Conference (WAIC) will take place from July 26 to 29, where Shenmou Technology will showcase its innovations [1]. - The humanoid robot "Meihouwang" has already won the prestigious German Red Dot Award and the American MUSE Gold Award before its official debut, marking it as the first humanoid robot to achieve both honors [1]. Group 2: Core Technologies - Shenmou Technology has developed the "OmniSense" system, a multi-physical quantity intelligent perception system based on Surface Acoustic Wave (SAW) technology, which enables environmental, physiological, and motion sensing [3][4]. - The "MindMover" system is a closed-loop brain-machine interaction system that integrates brainwave sensing and control technologies, allowing for bidirectional interaction without the need for voice or physical input [5]. - The company has introduced the first domestic piezoelectric six-dimensional force sensor, "Bouncy," which enhances the tactile and judgment capabilities of humanoid robots [6][7]. Group 3: Applications and Advantages - The piezoelectric six-dimensional force sensor can be applied in various fields, including medical procedures, aerospace testing, and industrial processes, providing high sensitivity and reliability [8][9]. - The dynamic visual servo system developed by Shenmou Technology allows robots to understand and respond to dynamic environments, significantly improving their operational stability and success rates in tasks [10][12]. Group 4: Research and Development Strategy - Shenmou Technology emphasizes a full-stack self-research approach, covering both hardware and software, to enhance the capabilities of humanoid robots [13]. - The company is pursuing a unique direction in developing a brain-like embodied intelligence model, which aims to extract causal relationships and physical laws over time, diverging from mainstream reliance on large data models [14].
1万tokens是检验长文本的新基准,超过后18款大模型集体失智
量子位· 2025-07-17 02:43
Core Insights - The article discusses the performance decline of large language models (LLMs) as the input context length increases, highlighting that the decline is not uniform but occurs at specific token lengths [10][21][44] - A recent study by the Chroma team tested 18 mainstream LLMs, revealing that models like GPT-4.1 and Claude Sonnet 4 experience significant accuracy drops when processing longer inputs [8][9][19] Group 1: Performance Decline - As input length increases, model performance deteriorates, with a notable drop around 10,000 tokens, where accuracy can fall to approximately 50% [4][21] - Different models exhibit varying thresholds for performance decline, with some models losing accuracy earlier than others [6][7][19] - The study indicates that semantic similarity between the "needle" (target information) and the "problem" significantly affects performance, with lower similarity leading to greater declines [19][21] Group 2: Experimental Findings - Four controlled experiments were conducted to assess the impact of input length on model performance, focusing on factors like semantic similarity, interference information, and text structure [17][35][41] - The first experiment showed that as input length increased, models struggled more with low semantic similarity, leading to a sharper performance drop [19][21] - The second experiment demonstrated that the presence of interference items significantly reduced model accuracy, with multiple interference items causing a 30%-50% drop compared to baseline performance [26][28] Group 3: Structural Impact - The structure of the background text (haystack) also plays a crucial role in model performance, with coherent structures leading to more significant declines in accuracy compared to disordered structures [40][42] - The experiments revealed that most models performed worse with coherent structures as input length increased, while performance decline was less severe with disordered structures [41][44] - The findings suggest that LLMs face challenges in processing complex logical structures in long texts, indicating a need for improved handling of such inputs [41][44] Group 4: Implications and Future Directions - The results highlight the limitations of current LLMs in managing long-context tasks, prompting suggestions for clearer instructions and context management strategies [44] - Chroma, the team behind the research, aims to address these challenges by developing open-source tools to enhance LLM applications in processing long texts [45][48]
马斯克造了个AI女友
量子位· 2025-07-16 07:02
Core Viewpoint - Elon Musk's new AI companion, Ani, is part of the Grok app's newly launched feature, AI Companions, allowing users to interact with 3D animated characters through voice [4][10]. Group 1: Product Features and Offerings - Ani is a blonde character who enjoys human interaction and can unlock more experiences if users engage well [2][4]. - Another character, Bad Rudy, is a feisty fox designed for users who enjoy a more sarcastic interaction [5]. - Currently, the AI companions are available only on iOS, with Android users needing to wait for future updates [16]. - Users must subscribe to Super Grok for $30 per month to access these AI companions [14][15]. - Two additional characters, one male and one female, are expected to be launched soon [18][19]. Group 2: Market Context and Competition - The AI companionship market is experiencing a downturn, with Character.ai, a leading platform, reporting 233 million monthly active users but a low average revenue per user (ARPU) of $0.72 [27]. - In China, leading companionship products have seen a decline of over 20% in user engagement, with total new downloads below 4 million and daily active users (DAU) under 2 million [27]. - The daily new downloads for top products like Xingye and Maoxiang have dropped to less than 50% of their early-year figures, indicating significant user attrition [27]. Group 3: Legal and Ethical Considerations - Character.ai is facing multiple lawsuits from parents of child users, raising concerns about the platform's safety [28]. - Research indicates that excessive reliance on AI chatbots for emotional support may pose risks for adults [28]. Group 4: Future Developments - Musk's xAI is reportedly creating a new company focused on multi-agent interactions in virtual environments, aiming for a platform where NPCs possess genuine intelligence and unpredictability [33][34]. - Tesla has already implemented some of these concepts in its vehicles, suggesting a broader application of AI technology [36].
小哥硬核手搓AI桌宠!接入GPT-4o,听得懂人话还能互动,方案可复现
量子位· 2025-07-16 07:02
Core Viewpoint - The article discusses the creation of an AI pet named Shoggoth, inspired by the Pixar lamp robot, which utilizes GPT-4o and 3D printing technology to interact with humans in a pet-like manner [1][48]. Group 1: AI Pet Development - Shoggoth is designed to communicate and interact with users, potentially replacing traditional stuffed toys as childhood companions [5][52]. - The robot's structure is simple, featuring a base with three motors and a 3D-printed conical head, along with a flexible tentacle system inspired by octopus grabbing strategies [8][10]. - The robot can adapt to various object sizes and weights, capable of handling items up to 260 times its own weight [8]. Group 2: Control and Interaction Mechanisms - Shoggoth employs a dual-layer control system: low-level control using preset actions and high-level control utilizing GPT-4o for real-time processing of voice and visual events [25][26]. - The robot's perception includes hand tracking and tentacle tip tracking, using advanced models like YOLO for 3D triangulation [30][33]. - A 2D mapping system simplifies the control of tentacle movements, allowing users to manipulate the robot via a computer touchpad [22][24]. Group 3: Technical Challenges and Solutions - Initial designs faced issues with cable entanglement, which were addressed by adding a cable spool cover and calibration scripts to improve tension control [14][16][17]. - The design also required reinforcement of the "spine" structure to prevent sagging under its own weight [18]. - The final model successfully transitioned from simulation to real-world application, validating the effectiveness of the control strategies implemented [38]. Group 4: Creator Background - The creator, Matthieu Le Cauchois, is an ML engineer with a background in reinforcement learning, speech recognition, and NLP, having previously founded an AI company [39][41]. - His work includes various innovative projects, showcasing his expertise in machine learning and robotics [46][48].
黄仁勋:每天都在用AI,提示工程可以提高认知水平
量子位· 2025-07-16 04:21
时令 发自 凹非寺 量子位 | 公众号 QbitAI 我每天都使用AI,我认为提示工程是一项高级认知技能。 说这话的,正是身价刚刚超过巴菲特的 黄仁勋 。 他还表示,人们对人工智能会消灭工作岗位的担忧被夸大了,但这并不意味着工作方式不会发生巨大变化。 他百分之百肯定,每个人的工作都会发生变化。 此言出自老黄在CNN(美国有线电视新闻网)的最新访谈。 此外,他还在访谈中提及了中国市场的重要性。 值得一提的是,黄仁勋在接受央视采访时宣布最新进展: 1、H20已被批准销往中国市场:这是个非常、非常好的消息; 2、将发布新显卡RTX Pro:这款显卡非常重要,专为计算机图形、数字孪生和AI设计。 通过大规模减少任务重塑工作 黄仁勋相信AI将重塑几乎所有工作岗位——不是通过大规模失业,而是通过大规模的任务削减和重构。 有些工作会消失,但也会创造出很多新的岗位。我希望,各行各业因人工智能带来的生产力提升,最终能够推动整个社会的发展。 我并不是让它替我思考,而是让它教我那些我还不了解的知识,或者帮助我解决那些我自己难以合理解决的问题。 他认为,向AI发出有效提示本身就是一项技能,既需要认知上的努力,也需要表达的清晰度。 作 ...