Workflow
DeepMind
icon
Search documents
MOSS孙天祥新公司要让AI自己写100篇论文,还要全网直播一个月
3 6 Ke· 2026-02-12 09:52
Core Insights - The article discusses a month-long live demonstration of an AI system named FARS, which aims to autonomously conduct the entire research process, producing 100 complete research papers without human intervention [1][20]. Company Overview - Analemma, the company behind FARS, was founded less than a year ago and has secured tens of millions of dollars in angel funding from notable investors such as Sequoia China and Meituan [1]. - The founder, Tianxiang Sun, was a key developer of MOSS, a significant model in the AI field, which gained attention for its capabilities [11][12]. Technology and Architecture - FARS, or Fully Automated Research System, is a multi-agent system composed of four modules: Ideation, Planning, Experiment, and Writing, which collaborate in a shared file system [2][4]. - The system utilizes APIs from various closed-source models, including Claude, GPT, and Gemini, along with self-developed models for certain tasks [5]. Research Focus and Methodology - FARS focuses on AI research itself, allowing for fully automated experiments that do not require physical laboratories [8]. - The system is designed to produce "short papers" that emphasize clear hypotheses and reliable validation, diverging from traditional academic publishing norms [7]. Quality Control and Evaluation - Each paper produced by FARS will undergo review by at least three team members with over five years of research experience before being uploaded to arXiv, ensuring a level of quality control [8]. - The team plans to invite peer reviews rather than submitting to traditional academic conferences, focusing on the practical citation and value of the results [8]. Competitive Landscape - FARS is part of a growing trend in automated research systems, competing with others like Sakana AI's AI Scientist and AI-Researcher from Hong Kong University [17][19]. - Unlike its competitors, FARS aims for real-time, large-scale, and fully transparent public deployment, which is a bold move in the field [19]. Future Directions - The live demonstration of FARS will begin on the company's website and social media platforms, marking a significant step in evaluating the system's capabilities [20]. - The results of this experiment could provide insights into the potential of AI to conduct research autonomously, a question that remains to be answered through the quality of the 100 papers produced [20][21].
过劳病倒、职权被削、联创跑路:xAI 48小时内上演最惨烈人才地震
AI前线· 2026-02-11 03:40
48 小时内,xAI 两位联创出走,引发 Grok 难产猜测? 作者 | 冬梅、高允毅 这两天,全球首富马斯克旗下人工智能公司 xAI 连丢两位联创, 一个是 Yuhuai (Tony) Wu(吴宇 怀),另一个是深度学习大神 Jimmy Ba。 他俩都曾是杰弗里·辛顿(Geoffrey Everest Hinton)的 门徒。 在推特上,他们纷纷发表了"离职"声明。 Yuhuai (Tony) Wu: "这家公司——以及我们之间如同家人般的情谊——将永远铭刻在我的记忆中。我会深深怀 念这里的人们、作战室,以及我们并肩作战过的所有战役。 我的人生新篇章即将开启。这是一个充满无限可能的时代:一支配备人工智能的小团队可以 移山填海,重新定义一切皆有可能。" 致埃隆 @elonmusk,感谢你们相信我们的使命,也感谢你们带给我们这段毕生难忘的旅 程。 Jimmy Ba: 在 xAI 的最后一天。 xAI 的使命是推动人类技术在卡尔达舍夫科技树上不断攀升。很荣幸能参与创立 xAI。还要特别 感谢 @elonmusk 感谢你们让我们齐聚一堂,共同踏上这段非凡的旅程。我为 xAI 团队的成就感到无比自豪,并将 永远以朋友 ...
Proptech startup Smart Bricks raises $5M pre-seed led by a16z
Yahoo Finance· 2026-02-10 14:00
For much of his career, Mohamed Mohamed worked at institutions — BlackRock, Goldman Sachs, and McKinsey — all of whom, he said, “treated real estate as a computational problem.” “They had proprietary data pipelines, internal valuation models, simulation tools, and increasingly, early AI systems supporting underwriting and capital allocation,” he told TechCrunch, explaining how these firms analyzed property investments. But he knew that regular people who also invested in real estate didn’t have access ...
数字经济双周报(2026年第3期):智能体开启AI从工具到伙伴新时代-20260210
Yin He Zheng Quan· 2026-02-10 06:52
Core Insights - The report highlights the transition of AI Agents from tools to partners, marking a new era in digital economy [2][4] - The AI Agent market is projected to grow significantly, with an expected increase from $8.2 billion in 2023 to $14.1 billion by 2033, representing a growth rate of 68.18% [4] - The report emphasizes the importance of regional collaboration and ecosystem construction in accelerating AI industry transformation in China [11][12] Section Summaries 1. Focus of the Report: AI Agents Transitioning to Partners - The evolution of AI technology can be categorized into three stages, with AI Agents now entering a phase of explosive growth [4] - The AI Agent market is expected to grow from $12 billion in 2023 to $20.55 billion by 2033, indicating a compound annual growth rate (CAGR) of 5.5% [4] 2. China Dynamics: Accelerating AI Industry Transformation - Central government policies are increasingly focused on integrating AI into various sectors to promote high-quality development [11][12] - Local governments are implementing tailored policies to foster AI integration, leveraging regional strengths [12] - Significant capital inflows into leading AI companies reflect market confidence in technological advancements [12] 3. U.S. Dynamics: Intensifying Capital and Commercialization - Major AI companies are attracting substantial investments, with valuations and funding targets on the rise [13][14] - Strategic investments by tech giants are enhancing their integration with key AI firms, reinforcing their positions in the industry [13] - The competition in AI commercialization is heating up, with diverse revenue models being explored [14] 4. European Dynamics: Balancing Regulation and Industry Competitiveness - The UK is collaborating with tech giants to develop frameworks addressing new digital threats like deepfakes [16] - Regulatory bodies are investigating the misuse risks of generative AI applications, focusing on data protection and online safety [17] - The EU is initiating investigations to ensure fair competition in the AI sector, emphasizing interoperability and data access [17] 5. Other Countries: Strengthening AI Infrastructure - AI demand is driving a supercycle in the storage industry, with South Korean firms enhancing their market positions [18] - Storage chip prices are rising due to AI needs, leading to record performances for major manufacturers [18] - The global semiconductor landscape is shifting, with a focus on diversifying supply chains to meet AI demands [19]
马斯克 vs 哈萨比斯 vs 杨立昆:谁定义的才是AI的真实未来?
3 6 Ke· 2026-02-09 12:51
当埃隆·马斯克公开判断"2026 年实现 AGI(通用人工智能),2030 年集体智能将碾压人类"时,整个科技圈迅速被点燃。 在他看来,AI 正处在一条几乎无法减速的加速曲线上:能力每 7 个月翻倍,现有模型还有百倍潜力尚未释放,一旦放缓节奏,人 类反而可能失去对系统的控制权。这种近乎"悬崖式推进"的判断,也让 AI 的未来被推向更极端的讨论区间。 与之相对的,则是相对而言更加保守的声音,以DeepMind CEO 戴密斯·哈萨比斯为代表的其他从业者则认为:"2030年前AGI落地 概率仅50%",他强调物理世界交互能力才是关键,安全测试必须先行;而"AI教父"辛顿更直接呼吁全球签署"AI开发暂停条约", 以防失控。 前 Meta 首席 AI 科学家杨立昆的态度则更加冷静甚至悲观。他和不少研究者直言,当前被频繁讨论的 AGI,更像是一种叙事工 具;仅依赖大语言模型,几乎不可能真正通向通用人工智能。 关于 AGI 的争论从未停歇:它会在何时到来?是否真的存在?又是否会从根本上改变人类社会?答案仍然高度分裂。 为此,Morketing整理了包括2026年达沃斯世界经济论坛核心对话、《财富》、《The Verge》 ...
腾讯研究院AI速递 20260209
腾讯研究院· 2026-02-08 16:03
Group 1: Claude Opus 4.6 Release - Anthropic launched Claude Opus 4.6, outperforming GPT-5.2 by approximately 144 Elo in GDPval-AA knowledge work assessment and achieving top scores in Terminal-Bench 2.0, Humanity's Last Exam, and BrowseComp [1] - The Opus model supports a context window of 1 million tokens and an output limit of 128,000 tokens, achieving 76% in long context retrieval tests, which is four times better than Sonnet 4.5 [1] - The product line has been updated with new features, including agent teams in Claude Code, an upgraded Excel, and a research preview for PowerPoint, along with new API functionalities like adaptive thinking and context compaction [1] Group 2: OpenAI GPT-5.3-Codex Release - OpenAI released GPT-5.3-Codex shortly after Claude Opus 4.6, achieving 77.3% in Terminal-Bench 2.0, regaining the highest score and being 25% faster than its predecessor, GPT-5.2-Codex [2] - This model is the first to participate in creating its own model, utilizing early versions for debugging its training process, managing deployment, and analyzing evaluation results [2] - The OSWorld-Verified score improved from 38.2% to 64.7%, nearing the human benchmark of 72%, with a cybersecurity CTF score of 77.6%, marking it as the first high-capability cybersecurity model [2] Group 3: Claude Opus 4.6 Fast Mode - Anthropic introduced a Fast Mode for Claude Opus 4.6, which is 2.5 times faster than the standard version, available to Claude Code and API users, with initial support from platforms like Cursor and GitHub Copilot [3] - Pricing for Fast Mode has significantly increased, with input costs at $30 per million tokens and output costs at $150 per million tokens, while long context pricing has doubled, offering a 50% discount until February 16 [3] - This mode is recommended for rapid code iteration and real-time debugging, with automatic fallback to the standard version after hitting rate limits [3] Group 4: Pony Alpha Model - The OpenRouter platform launched the mysterious anonymous model Pony Alpha, which excels in programming, logical reasoning, and role-playing, available for free [4] - Speculation surrounds the model's identity, with guesses including DeepSeek-V4, GLM new models, Opus 5.3, Codex 4.6, or Grok 4.2, but no consensus has been reached [4] - Pony Alpha supports reasoning with a context of 200,000 tokens, with users successfully creating complete web applications containing 500 lines of code, hinting at a possible Chinese origin due to its name [4] Group 5: ByteDance Seedance 2.0 Launch - ByteDance quietly launched Seedance 2.0, which supports self-storyboarding, synchronized audio-visual generation, multi-shot narratives, and up to 12 multimodal reference files [5] - The usability rate improved from under 20% to over 90%, with actual production costs reduced to near theoretical levels, fundamentally changing the industry's economics [5] Group 6: Tencent WorkBuddy Internal Testing - Tencent opened internal testing for WorkBuddy, a desktop AI agent capable of planning and executing complex multimodal tasks on local computers [7] - Core capabilities include automatic batch file processing, document/spreadsheet/PPT generation, deep data analysis, and industry research, with built-in model switching and high-risk command interception [7] - Since its internal testing began on January 19, it has served over 2,000 Tencent employees, targeting non-technical workplace groups like HR, administration, operations, and sales to lower the AI tool usage barrier [7] Group 7: Waymo and DeepMind Collaboration - Waymo introduced a world model built on DeepMind Genie 3, capable of generating highly realistic and interactive 3D environments, simulating rare driving scenarios like tornadoes and elephants [8] - The model supports three control mechanisms: driving behavior, scene layout, and language, converting ordinary driving record videos into multimodal simulations, showcasing the Waymo Driver's perspective [8] - Waymo Driver has completed nearly 200 million miles of fully autonomous driving, with the world model enabling the system to rehearse billions of miles of complex scenarios in a virtual environment [8] Group 8: Elon Musk's Future Plans - Elon Musk revealed SpaceX plans to launch 20,000 to 30,000 times annually, predicting that within five years, space computing power will exceed the global total [9] - The Tesla AI5 chip is set for mass production in Q2 next year, with the AI6 chip following within a year, and Optimus expected to reach a production capacity of 1 million units in three years and 10 million in four years [9] - Musk described Optimus as a "money-making perpetual motion machine," asserting that without breakthrough innovations, the U.S. will fall behind China in AI, electric vehicles, and humanoid robot manufacturing [9] Group 9: AI Growth Projections - ARK Invest forecasts that global GDP growth will exceed 7% by 2030, driven by the integration of five technologies, with a bullish Bitcoin price target of $1.5 million by 2030 [12] - The differentiated development of AI between China and the U.S. sees China breaking through with an open-source approach, while the U.S. leads in application-level global competitiveness, with proprietary data being a decisive advantage in the AI era [12] - Tesla is positioned to lead the Robotaxis market through vertical integration, with future travel costs potentially dropping to $0.20 per mile, and a market capitalization of a trillion dollars by 2030 is anticipated [12]
Waymo联手DeepMind打造世界模型:基于Genie 3,让自动驾驶「脑补」罕见场景
机器之心· 2026-02-07 07:00
Core Insights - Waymo has launched the Waymo World Model, a new standard in large-scale, hyper-realistic autonomous driving simulation, built on DeepMind's Genie 3 [1][4] - The model can generate highly realistic and interactive 3D environments tailored for the strict requirements of autonomous driving [4][8] - Waymo Driver has completed nearly 200 million miles of fully autonomous driving, enhancing road safety through extensive virtual world training [4][28] Group 1: Model Capabilities - Waymo World Model leverages Genie 3's extensive world knowledge to simulate rare events that are difficult to replicate in real life, such as tornadoes and encounters with elephants [4][9] - The model supports high-fidelity, multi-sensor data generation, including camera images and LiDAR point clouds, providing a comprehensive training and testing environment for autonomous systems [4][8] - The simulation allows for real-time adjustments through simple language prompts, driving inputs, or scene layouts, enhancing the model's adaptability [4][11][16] Group 2: Simulation Control Mechanisms - The model features three main control mechanisms: driving behavior control, scene layout control, and language control, enabling the simulation of various driving scenarios [11][13][16] - Driving behavior control allows for the simulation of counterfactual events, assessing how the Waymo Driver would respond under specific conditions [11] - Scene layout control enables customization of road layouts and traffic signals, while language control provides flexibility in adjusting time of day and weather conditions [13][16] Group 3: Realism and Accuracy - Waymo World Model can convert real-world videos into multi-modal simulations, achieving high levels of realism and factual accuracy [22] - The model's efficient variants allow for long-duration simulations while maintaining high fidelity, supporting large-scale testing [24] - By simulating rare scenarios, Waymo Driver prepares for complex driving situations, setting a higher safety benchmark for autonomous systems [28]
像挖币一样挖激活函数?DeepMind搭建「算力矿场」,暴力搜出下一代ReLU
机器之心· 2026-02-07 04:09
Core Insights - The article discusses the evolution of activation functions in neural networks, highlighting the transition from traditional functions like Sigmoid and ReLU to newer ones like GELU and Swish, emphasizing the impact on model performance [1][2]. Group 1: DeepMind's Innovation - Google DeepMind is revolutionizing the search for activation functions through a new method called AlphaEvolve, which explores an infinite space of Python functions rather than relying on predefined search spaces [2][4]. - The research paper titled "Finding Generalizable Activation Functions" showcases how DeepMind's approach led to the discovery of new activation functions, including GELUSine and GELU-Sinc-Perturbation, which outperform traditional functions in certain tasks [4][30]. Group 2: Methodology - AlphaEvolve utilizes a large language model (LLM) to generate and modify code, allowing for a more flexible and expansive search for activation functions [8][11]. - The process involves a "micro-laboratory" strategy, where synthetic data is used to optimize for out-of-distribution (OOD) generalization capabilities, avoiding the high costs of searching on large datasets like ImageNet [14][18]. Group 3: Performance of New Functions - The newly discovered functions demonstrated superior performance in algorithmic reasoning tasks, with GELU-Sinc-Perturbation achieving a score of 0.887 on the CLRS-30 benchmark, surpassing ReLU and GELU [34]. - In visual tasks, GELUSine and GELU-Sinc-Perturbation maintained competitive accuracy on ImageNet, achieving approximately 74.5% Top-1 accuracy, comparable to GELU [34][35]. Group 4: Insights on Function Design - The research indicates that the best-performing functions often follow a general formula combining a standard activation function with a periodic term, suggesting that incorporating periodic structures can enhance model generalization [25][35]. - The study highlights the importance of understanding the inductive biases introduced by activation functions, suggesting that periodic elements can help capture complex data structures beyond linear relationships [40][42].
AI将导致码农失业?资深程序员自述已不再手工写代码,拒绝AI很危险,职业将迎分化
Sou Hu Cai Jing· 2026-02-06 11:42
Core Viewpoint - The rapid evolution of AI programming models is reshaping the software development landscape, leading to concerns about job displacement among programmers [3][5][24]. Group 1: AI Programming Models Competition - OpenAI and Anthropic are in direct competition, with OpenAI recently launching GPT-5.3-Codex, claiming it to be the most powerful AI programming model [3][9]. - The competition has intensified as both companies focus on enhancing their programming capabilities, with significant updates being released in quick succession [4][8]. Group 2: Impact on Software Development - AI is transforming the way software development is conducted, with many developers reporting a shift from manual coding to AI-assisted coding, significantly increasing efficiency [16][18]. - The use of AI tools has led to a dramatic reduction in the time required for coding tasks, with some developers stating that projects can now be completed in a fraction of the time compared to traditional methods [16][22]. Group 3: Job Market and Employment Concerns - There is a growing concern that AI will lead to job losses in the programming sector, with predictions that up to 50% of entry-level jobs may disappear in the next few years [24][25]. - The employment rate for programmers in the U.S. has dropped by 27.5% over the past two years, with many layoffs attributed to the impact of AI [25][26]. Group 4: Changing Role of Programmers - The role of programmers is expected to evolve, with a shift towards overseeing AI tools rather than performing traditional coding tasks [30][28]. - Future software engineers may focus more on defining project requirements and guiding AI to execute tasks, rather than writing code themselves [29][30]. Group 5: Accessibility of Programming - AI is lowering the barriers to entry for programming, allowing individuals with little to no coding experience to develop applications by simply communicating their ideas to AI [19][20]. - While AI can handle many coding tasks, the ability to articulate clear project requirements remains a critical skill for success in the evolving landscape [21][22].
AI Is Not Driving Widespread Job Losses, Google DeepMind CEO Demis Hassabis Says — But Sees 'Beginnings' Of Slower Entry-Level Hiring
Yahoo Finance· 2026-02-05 20:31
Artificial intelligence is beginning to reshape how companies hire, but there is no sign of widespread job losses. Changes are emerging gradually rather than all at once, Google DeepMind CEO Demis Hassabis said during a discussion with Anthropic CEO Dario Amodei at the World Economic Forum in Davos, adding that current labor-market data does not point to broad job displacement. Early Pressure On Junior Hiring Hassabis told the Davos panel that the earliest effects are emerging at the junior end of orga ...