Workflow
量子位
icon
Search documents
人脑细胞做成芯片打Doom!20万活体神经元自己探路杀敌,学习效率碾压深度强化学习
量子位· 2026-03-02 03:28
Core Insights - The article discusses the development of a biological computing system where 200,000 human brain cells, referred to as "brain PU," learned to play the classic game Doom through reinforcement learning techniques [1][10] - This achievement builds on previous work where brain cells learned to play Pong, showcasing advancements in translating digital game environments into signals that neurons can understand [12][15] Group 1: Learning Process and Performance - The process of teaching brain cells to play Doom was completed in under a week by independent developer Sean Cole using Cortical Labs' cloud platform API, contrasting with the 18 months taken for Pong [6][7] - The biological system demonstrated superior sample efficiency compared to three mainstream reinforcement learning algorithms (DQN, A2C, and PPO) in terms of key performance metrics such as average hits per game and error rates [20][22] - The study revealed that the biological cultures showed significant improvements in performance over time, while traditional algorithms did not exhibit similar enhancements [21][23] Group 2: Experimental Design and Findings - The research involved recording neural spike activities across 1,024 channels during 285 game sessions, with a sampling frequency of 20 kHz, allowing for detailed analysis of neural dynamics [16][25] - The experiments were designed to compare the biological system's performance against reinforcement learning algorithms under controlled conditions, ensuring that both systems had the same training volume [17][18] - The findings indicated that even with sparse input information, the biological system outperformed the algorithms, challenging the notion that more data always leads to better performance [22][23] Group 3: Implications and Future Directions - The research team introduced the concept of "Synthetic Biological Intelligence" (SBI), marking the first formal comparison between biological systems and reinforcement learning systems [29] - The study suggests that biological systems may rely on more efficient learning processes, such as predictive coding and active inference, which differ from traditional backpropagation methods [30][31] - Future goals include enhancing the capabilities of the neurons to not only play Doom effectively but also tackle more complex tasks, such as controlling robotic arms [38][39]
硅谷全面“龙虾化”!Anthropic微软Meta和Notion等集体交卷自己的Claw
量子位· 2026-03-01 02:01
Core Viewpoint - The article discusses the recent surge in AI companies developing their own "Claw" systems, which are advanced AI agents capable of executing tasks and automating workflows, marking a significant shift in the AI landscape [2][12][48]. Group 1: Company Actions - Meta has accelerated its efforts by integrating the Manus Agent into Telegram, focusing on long-term memory to enhance user interaction [5][17][18]. - Anthropic has rapidly released new features for Claude Cowork, including mobile remote control and automated task management, to maintain its competitive edge [6][22][23]. - Microsoft introduced Microsoft Copilot Tasks, which autonomously plans schedules, operates across applications, and manages timed tasks, enhancing productivity within its ecosystem [29][30][31][33]. - Notion's Custom Agents represent a significant transformation, allowing for 24/7 operation without manual input, marking its shift from a document tool to a collaborative platform [37][39][42]. - Perplexity launched Perplexity Computer, aiming to unify various AI functions, from research to deployment, under one system [44][46]. Group 2: Industry Trends - The shift towards "Claw" systems is driven by AI models reaching a trust threshold, allowing companies to delegate more complex tasks to AI agents [49][51]. - There is a growing consensus that the next wave of AI growth will focus on the practical capabilities of AI rather than just knowledge accumulation [54][56]. - The commercialization of AI is evolving from selling tokens to selling labor hours, indicating a shift in how AI services are monetized [58][60].
海淀放大招!90亿资金+近30条政策,重磅释放三大科创关键信号
量子位· 2026-02-28 10:30
Core Insights - February 2026 is marked as a "super release month" in AI history, with significant product launches from global and domestic companies, indicating a new phase of practical application and autonomous evolution in AI technology [1][2] - Chinese tech companies have transitioned from quantity advantages to becoming core leaders in the global AI arena, supported by government strategic guidance and institutional guarantees [3] - Haidian District in Beijing is highlighted as a key innovation hub, with a strong focus on technology and industrial upgrades, making it well-suited for addressing global tech competition [4] Group 1: Economic and Innovation Data - Haidian District entered the "trillion club" in 2022, becoming the first district in Beijing and the second nationwide to achieve this status, with a projected GDP of 1.37 trillion yuan by 2025, reflecting a robust growth rate of 7.2% [9] - The district has a significant innovation capacity, housing 2 national laboratories, 92 key laboratories, 37 universities, and 96 national research institutions, with a talent pool exceeding 2 million, including 692 academicians and 12,300 AI scholars [6][9] - In 2025, Haidian is expected to see the establishment of over 24,000 new tech enterprises, bringing the total to 145,400, with nearly 40% of national "little giant" enterprises located in the district [9] Group 2: Policy Initiatives and Funding - On February 28, 2026, Haidian announced nearly 30 major policies aimed at supporting the tech innovation industry, backed by a special fund of no less than 9 billion yuan to assist enterprise growth [12][13] - The district's policies include financial support for technology transfer institutions, consumer spending, high-quality industrial park development, and housing assistance for young talents [17] - The newly established "Zhongguancun Science City Technology Achievement Transformation Fund" and "Technology Growth Fund" are designed to support early-stage quality projects, with a total scale of 20 billion yuan for the former and 80 billion yuan for the latter [23][25] Group 3: Modern Industrial System and Collaboration - The "1+X+1" modern industrial system framework emphasizes AI as the core engine, supported by five strategic emerging industries and three future industries, ensuring a clear direction for Haidian's tech innovation [32][34] - The "Five Forces and Six Powers" mechanism aims to enhance collaboration among various stakeholders, breaking down barriers between universities, research institutions, and enterprises to facilitate technology transfer [40][42] - Haidian is actively pursuing regional and international collaboration, establishing partnerships with neighboring districts and global tech parks to foster innovation and resource sharing [43][46]
谷歌突发Gemini 3.1 Pro!首次采用「.1」版本号,推理性能×2的那种
量子位· 2026-02-20 01:28
Core Viewpoint - The article discusses the significant upgrades of Google's Gemini 3.1 Pro model compared to its predecessor, Gemini 3 Pro, highlighting improvements in multimodal generation, semantic understanding, and reasoning capabilities [1][9][10]. Group 1: Model Upgrades - Gemini 3.1 Pro shows a noticeable enhancement in multimodal generation and semantic understanding, achieving a higher level of performance [1]. - The model can convert everyday data into interactive visual content, such as aerospace dashboards and city simulations [3][5]. - In the ARC-AGI-2 benchmark test, Gemini 3.1 Pro achieved a verification score of 77.1%, which is double that of Gemini 3 Pro [10]. Group 2: Performance Metrics - The performance comparison table indicates that Gemini 3.1 Pro outperforms other models in various benchmarks, including academic reasoning and abstract reasoning puzzles [11]. - The overall ranking score of Gemini 3.1 Pro in Arena's evaluation is 13 points higher than that of Gemini 3 Pro, with significant improvements in text and code dimensions [12]. - The model supports a context length of 1 million tokens and has a knowledge cutoff date of January 2025, enhancing its multimodal understanding and long-context performance [11]. Group 3: User Experience and Applications - Users have reported positive experiences with Gemini 3.1 Pro, generating complex visualizations and interactive applications, such as a 3D simulation of a flock of birds [17][20]. - The model has been utilized to create personal websites and educational applications, showcasing its versatility and advanced capabilities [24][25]. - The model is now available in Gemini applications and APIs, with specific access for Google AI Pro and Ultra users [29]. Group 4: Cost and Market Implications - The release of Gemini 3.1 Pro marks Google's first use of a ".1" version number, indicating a rapid pace of development in large models [30]. - The pricing for Gemini 3.1 Pro remains competitive, with input costs at $2 for less than 200k tokens and $4 for more, while output costs are $4 for less than 200k tokens and $18 for more [36]. - The cost per ARC-AGI-2 task is approximately $0.96, significantly lower than the previous model, suggesting a shift in the cost-performance curve in AI development [37][41].
量子位编辑作者招聘
量子位· 2026-02-20 01:28
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Group 1: Job Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. - Positions are open for various levels, including editors, lead writers, and chief editors, with a focus on matching roles to individual capabilities [6]. Group 2: Job Responsibilities - **AI Industry Direction**: Responsibilities include tracking innovations in infrastructure, such as chips, AI infrastructure, and cloud computing, as well as producing accessible reports on technical conferences and papers [6][7]. - **AI Finance Direction**: Focuses on venture capital, financial reports, and analyzing capital movements within the AI industry, including interviews with investors and entrepreneurs [11]. - **AI Product Direction**: Involves monitoring AI applications and hardware developments, writing in-depth product evaluations, and engaging with product experts [11]. Group 3: Benefits and Work Environment - Employees can expect a vibrant team atmosphere, opportunities for personal influence through original content creation, and professional mentorship from senior editors [6][11]. - The company offers competitive salaries and comprehensive benefits, including social insurance, meal allowances, and performance bonuses [6]. Group 4: Company Growth and Reach - By 2025, Quantum Bit aims to have over 2.4 million subscribers on WeChat and more than 7 million users across platforms, with a daily reading volume exceeding 2 million [12]. - The company is recognized as the top new media outlet in the AI and frontier technology sectors according to third-party data platforms [12].
AMD英伟达都投了!李飞飞创业公司官宣10亿新融资
量子位· 2026-02-19 07:03
Core Insights - World Labs, founded by AI pioneer Fei-Fei Li, has raised $1 billion in its latest funding round, achieving a valuation of $5 billion, significantly surpassing initial expectations of $500 million [2][48]. - The rapid growth of World Labs, which reached a fivefold revaluation in just over a year, reflects strong investor confidence in the potential of world models and spatial intelligence [8][19]. - The investment round included major players like AMD, NVIDIA, and Autodesk, indicating a strategic focus on building an ecosystem around spatial intelligence [9][11]. Funding and Valuation - World Labs completed its first funding round shortly after its establishment in April 2024, achieving a valuation of $200 million with no products at that time [18]. - By the time of the latest funding, the company had raised a total of $230 million, reaching a valuation of $1 billion [19]. - The latest funding round attracted significant investments from various sectors, highlighting the diverse applications of spatial intelligence [9][15]. Strategic Importance of Spatial Intelligence - Fei-Fei Li emphasizes that spatial intelligence is the next frontier in AI, crucial for understanding and interacting with the physical world [24][46]. - The Marble model, World Labs' first-generation spatial intelligence model, allows for multi-modal input and creates navigable 3D worlds, differentiating it from current video generation models [27][46]. - The applications of Marble span various fields, including robotics, game development, and mental health research, showcasing the versatility of spatial intelligence [30][42]. Investor Insights - The involvement of major tech companies like AMD and NVIDIA signifies a recognition of the computational needs for spatial intelligence [12][15]. - Autodesk's $200 million investment aligns with the 3D design and industrial software ecosystem, indicating a focus on practical applications of spatial intelligence [13]. - The participation of Fidelity and Emerson Collective reflects a broader acceptance of World Labs within mainstream financial circles, emphasizing long-term investment in next-generation technologies [14][15]. Future Outlook - The rapid advancements in AI and the increasing interest in spatial intelligence suggest that World Labs is well-positioned to lead in this emerging field [40][48]. - The company aims to collaborate across various industries, indicating a commitment to exploring the wide-ranging applications of spatial intelligence [42][43]. - The success of World Labs may signal a shift towards physical AI and general-purpose robotics, marking a significant evolution in the AI landscape [49][50].
谷歌Gemini学会了看图作曲,你的朋友圈也能拥有专属BGM了
量子位· 2026-02-19 07:03
Core Viewpoint - Google has transformed Gemini into a comprehensive creative tool capable of generating music and album covers based on user input, significantly simplifying the creative process [1][2][4]. Group 1: Features of Gemini - The latest Lyria 3 model integrated into Gemini allows users to create music by simply providing text or images, producing complete songs with lyrics, melodies, and vocals in seconds [2][4]. - The audio sampling rate of Lyria 3 is 48kHz, ensuring high-fidelity sound quality for generated music, enhancing the overall user experience [5][7]. - Users can upload photos, and the AI can generate music that captures the essence of the image, providing a personalized soundtrack for social media [7][9]. Group 2: Creative Capabilities - Gemini can generate songs in various styles, from nostalgic African beats to classic Motown soul, showcasing its versatility in music production [10][13]. - The AI can produce natural-sounding vocals and lyrics, making it feel like users have a personal music producer at their disposal [11][12]. - The integration of the Nano Banana model allows for the automatic creation of album covers that match the generated music, further streamlining the creative process [3][15]. Group 3: Strategic Intent of Google - Google aims to establish Gemini as a "super entry point" for digital life, integrating various services like cloud storage, photo albums, and YouTube into a single platform [16][18]. - This comprehensive approach reduces the need for users to switch between different applications, enhancing efficiency and convenience [17][18]. - By creating a seamless user experience, Google strengthens its position in the market, making it less likely for users to seek out independent applications [17][18].
春晚之后,AI和机器人为啥都去了一个地方?
量子位· 2026-02-19 04:27
Core Viewpoint - The article discusses the integration of AI technology and embodied intelligence into mainstream culture during the 2026 New Year's Eve, highlighting the need for technology companies to maintain engagement beyond initial exposure during the Spring Festival Gala [1][4]. Group 1: AI and Embodied Intelligence Engagement - The Spring Festival Gala served as a peak moment for AI and robotics, but companies are anxious about sustaining interest beyond the event [4][5]. - Following the gala, tech companies are actively seeking to extend discussions and maintain engagement through online platforms, particularly Bilibili [6][7]. - Bilibili is emerging as a key platform for AI and robotics discussions, capitalizing on the momentum generated during the Spring Festival [8][30]. Group 2: Bilibili's Role in AI and Robotics - Bilibili's collaboration with the Spring Festival Gala has deepened, with the platform becoming the exclusive bullet screen video platform for the event [33]. - The platform has a unique ecosystem that fosters discussions around AI and robotics, making it a prime location for brands to engage with a tech-savvy audience [37][39]. - Data indicates that Bilibili has a high concentration of users interested in AI, with nearly 100,000 active creators related to AI content each month [38][39]. Group 3: User Engagement and Content Creation - The introduction of "interest rooms" on Bilibili allows for more targeted engagement, facilitating a smoother transition from awareness to deeper understanding of AI and robotics [62][63]. - Users are encouraged to interact with content, leading to a more engaged community that discusses, creates, and shares AI-related content [60][64]. - The active bullet screen culture on Bilibili enhances the information density of tech discussions, making it a unique platform for real-time feedback and interaction [52]. Group 4: Long-term Strategy for Brands - Companies like Songyan Power, Yuanbao, and Yushu Technology are not just marketing for the Spring Festival but are strategically positioning themselves for long-term brand recognition and user engagement [69][70]. - The article emphasizes the importance of finding the right community to sustain interest and deepen user understanding of AI and robotics post-exposure [70][71]. - The future of human-machine coexistence will depend on who can effectively engage the most receptive and innovative audience [68].
懂人性更懂执行,蚂蚁这个万亿开源模型把情商和Agent战斗力都给拉满了
量子位· 2026-02-19 01:35
克雷西 发自 凹非寺 量子位 | 公众号 QbitAI 现在想找个既能干活又像真人一样好聊的模型变难了,AI好像正在变得越来越理性,但也越来越"不通人性"。 在这个节骨眼上,蚂蚁百灵大模型家族全新推出了万亿参数的旗舰级模型 Ling-2.5-1T ,不仅主打通用全能,还是个能够高效回复的即时模 型。 具体来说,Ling-2.5-1T 既拥有强大的Agent执行力,又保留了情商和写作能力 。 同时,它还想证明万亿参数的大块头也能身轻如燕,不需要在那儿"转圈思考"半天才能出结果,关键还不喜欢废话,非常节约Token。 在与前一代、及现在主流的大尺寸即时模型对比中,Ling-2.5-1T 在复杂推理、 指令遵循能力方面具有明显优势 。 | | Benchmark | Evaluation Config | Ling-2.5-1T | Ling-1T | 国产模型A | 国产模型B | GPT-5.2-chat | | --- | --- | --- | --- | --- | --- | --- | --- | | | | | | | (非思考) | (非思考) | | | | C-SimpleQA | Acc | ...
极限30天机器狗爆改大熊猫!揭秘春晚百台级机器人群控演出
量子位· 2026-02-18 13:09
Core Viewpoint - The article highlights the advancements in China's embodied intelligence industry, showcasing the impressive performances of robots at the Spring Festival Gala, particularly focusing on the capabilities of the company Magic Atom and its robots [5][6][66]. Group 1: Robot Performances - The Spring Festival Gala featured a remarkable display of robots, including a dual-arm robot performing a 360-degree spin and a hundred robotic pandas dancing in unison, demonstrating high levels of coordination and technical prowess [3][8][32]. - The MagicBot Z1 successfully executed complex movements, showcasing its hardware performance and motion control capabilities, which are critical for high-difficulty actions [8][10][15]. - The performance of a hundred robotic pandas, known as MagicDog, tested the company's group control and large-scale coordination abilities, emphasizing the complexity of managing multiple robots simultaneously [17][20][23]. Group 2: Technical Innovations - The MagicBot Z1 features a self-developed joint system with 24 basic degrees of freedom, expandable to 49, allowing for a wide range of motion and stability during dynamic actions [11][12]. - The company has optimized its robots' performance through continuous testing and iteration, significantly enhancing stability in high-speed jumps and continuous rotations [13][14]. - The robots' ability to perform tasks like noodle pulling and serving drinks reflects their advanced sensory perception and autonomous planning capabilities, distinguishing them from competitors [39][40][46]. Group 3: Commercialization and Globalization - Magic Atom has adopted a comprehensive self-research approach, achieving a 90% self-research rate in hardware, which reduces supply chain risks and manufacturing costs [50][52]. - Since launching sales in May last year, the company secured 500 million yuan in intention contracts and completed millions in actual deliveries, expanding its commercial footprint across various sectors [53][54]. - The company's international business accounted for over 30% of its operations last year, with significant engagement in 27 countries, indicating a robust global expansion strategy [55][57]. Group 4: Industry Impact - The successful execution of complex robotic performances at a national event signifies a maturation of the underlying technology and systems in the embodied intelligence sector [61][62]. - The robots' ability to perform intricate tasks in real-world scenarios demonstrates their potential for delivering continuous value, marking a significant shift from experimental stages to practical applications [64][66].