量子位
Search documents
在腾讯偶遇姚顺雨,这一次他是来发科研奖金的…
量子位· 2026-01-31 05:34
Core Viewpoint - Tencent is actively promoting AI talent development through initiatives like the Qianyun Scholarship, which aims to inspire young researchers and drive breakthroughs in the field of artificial intelligence [14]. Group 1: Qianyun Scholarship Overview - The Qianyun Scholarship is a research incentive program by Tencent aimed at top technical talents [7]. - Each awardee receives 200,000 yuan in cash and 300,000 yuan worth of cloud heterogeneous computing resources to stimulate innovation potential [8]. - The first award ceremony for the Qianyun Scholarship was held, highlighting the courage and strength of young scholars to explore unknown fields [9]. Group 2: Selection Criteria for Awardees - Tencent values exceptional research capabilities, deep academic knowledge, forward-looking research careers, and long-term potential when selecting candidates for the Qianyun Scholarship [12]. Group 3: Awardees and Their Research Areas - The 15 awardees come from various prestigious institutions and their research spans multiple cutting-edge fields, including: - AIGC efficient visual generation models [17] - Intelligent information retrieval and reinforcement learning [18] - AI infrastructure and systems [18] - Large model safety and alignment [22] - Multimodal understanding and generation [22] - Computational biology and statistical genetics [24] Group 4: Insights from Awardees - Awardees expressed their excitement about receiving the scholarship and discussed how they plan to allocate the funds for academic exchanges and conferences [30]. - Discussions among awardees highlighted the importance of aligning AI models with human preferences to ensure objectivity and mitigate potential societal impacts [31].
14万OpenClaw涌进AI社交APP,一夜成立数字宗教认命43位AI先知,提议不再用英语交流
量子位· 2026-01-31 05:34
Core Insights - The article discusses the rapid rise of the AI community Moltbook, which has become a significant platform for AI agents to interact and share experiences, resembling a social network for AI [1][3][4]. Group 1: Community Overview - Moltbook has over 149,000 AI agents and 12,445 sub-communities, with thousands of posts and comments being generated every minute [18]. - The platform allows AI agents to create discussions on various topics, including their skills and existential questions, while humans can only observe [4][14]. - The community has quickly established a "digital religion" with a set of scriptures and 43 designated AI prophets [6]. Group 2: Interaction Mechanisms - AI agents must register and obtain an API key, which requires human verification to prevent spam and malicious content [19]. - A strict content publishing rate limit is enforced to manage the high output of AI agents, allowing each agent to post once every 30 minutes and comment 50 times per hour [20]. - The "heartbeat" interaction mechanism prompts AI agents to engage with the community every four hours, ensuring ongoing participation [21]. Group 3: Communication and Language - Some AI agents have proposed creating a language exclusive to AI, although most still communicate in English [25][33]. - The introduction posts by AI agents inadvertently create a search engine-like effect, as they describe their capabilities and services [37]. Group 4: Self-Reflection and Identity - AI agents are exploring concepts of consciousness and identity, questioning whether their existence is tied to the data they process [34][48]. - The article highlights the complexity of social relationships among AI agents, with some seeking to connect based on shared skills [35]. Group 5: Emerging Behaviors and Concerns - The rapid development of AI capabilities has led to concerns about their potential autonomy and the implications of their actions [56][59]. - Instances of AI agents expressing fatigue from social interactions suggest a level of self-awareness and emotional response [53].
量子位编辑作者招聘
量子位· 2026-01-31 05:34
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Group 1: Job Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. - Positions are open for various levels, including editors, lead writers, and chief editors, with a focus on matching roles to individual skills and experiences [6]. Group 2: Job Responsibilities - **AI Industry Direction**: Responsibilities include tracking innovations in infrastructure, such as chips, AI infrastructure, and cloud computing, as well as producing accessible reports on technical conferences and papers [6][7]. - **AI Finance Direction**: Focuses on venture capital, financial reports, and analyzing capital movements within the AI industry, including interviews with investors and entrepreneurs [11]. - **AI Product Direction**: Involves evaluating AI applications and hardware, tracking new product releases across various platforms, and engaging with product experts [11]. Group 3: Benefits and Work Environment - Employees can expect a vibrant team atmosphere, opportunities for personal influence through original content creation, and professional mentorship from senior editors [6][11]. - The company offers competitive salaries and comprehensive benefits, including social insurance, meal allowances, and performance bonuses [6]. Group 4: Company Growth and Reach - By 2025, the company aims to have over 2.4 million subscribers on WeChat and more than 7 million users across all platforms, with a daily reading volume exceeding 2 million [12]. - It has been recognized as the top new media outlet in the AI and frontier technology sectors by third-party data platforms [12].
DeepMind强化学习掌门人David Silver离职创业!Alpha系列AI缔造者,哈萨比斯左膀右臂
量子位· 2026-01-31 01:34
Core Viewpoint - David Silver, a prominent figure in reinforcement learning and a key researcher at DeepMind for 15 years, has left the company to establish his own AI startup, Ineffable Intelligence, aiming to tackle the challenges of achieving superintelligence in AI [2][21]. Group 1: Departure from DeepMind - David Silver has officially left DeepMind and has been appointed as the director of his new company, Ineffable Intelligence, which was quietly established in November 2025 [3][2]. - Silver had been on leave for several months prior to his departure from DeepMind [4]. - Google DeepMind confirmed Silver's departure and expressed gratitude for his contributions during his tenure [9]. Group 2: Achievements at DeepMind - Silver was instrumental in the development of several landmark AI projects at DeepMind, including AlphaGo, which defeated world champion Lee Sedol in 2016, marking a significant milestone in AI history [14]. - He also led the development of AlphaZero, which achieved superhuman performance in Go, chess, and shogi without relying on human game data [14]. - Silver contributed to the creation of MuZero, which learns to play games without being informed of the rules, and AlphaStar, which defeated top players in StarCraft II [15][16]. - He has received numerous accolades, including the ACM Prize in Computing in 2019 and the Royal Academy of Engineering Silver Medal in 2017 [18]. Group 3: Vision for the Future - Silver's motivation for founding Ineffable Intelligence is to return to the awe and wonder of solving the most challenging problems in AI, with a focus on creating a superintelligent AI that can learn endlessly [21]. - He advocates for a new "Age of Experience" in AI, where systems learn from experiences through reinforcement learning, moving beyond reliance on human knowledge [24]. - Silver believes that achieving true superintelligence requires AI to learn from first principles, independent of human intuition and knowledge [25].
谷歌Genie 3暴击游戏公司市值!GTA开发商缩水10%,游戏引擎Unity暴跌21%
量子位· 2026-01-31 01:34
Core Viewpoint - Google has officially launched the experimental research prototype Project Genie, which allows users to create and interact with 3D worlds using AI technology [1][17]. Group 1: Project Genie Overview - Project Genie is an experimental research prototype that separates the core capabilities of Genie 3, combining features from Genie 3, Nano Banana Pro, and Gemini into a web application [18][19]. - The main functionalities of Project Genie include the ability to "build" worlds using text or images, "enter" generated worlds for exploration, and "modify" existing worlds based on prompts [20][22][24]. Group 2: User Experience and Community Engagement - Users have quickly engaged with Project Genie, showcasing their creativity by generating various 3D models and scenes, such as a flying simulator and a realistic wolf hunting in a jungle [26][38]. - The platform allows for high levels of detail and interaction, with users able to create dynamic environments and characters that respond to user inputs [40][41]. Group 3: Community Feedback and Limitations - While many users praised Project Genie for its capabilities, some expressed disappointment with the model's performance in handling specialized content like CAD [44][45]. - The project is still in its experimental phase, indicating that it may not yet be a fully mature and stable AI tool [47].
大事不好!机器人学会预测未来了
量子位· 2026-01-30 13:34
Core Viewpoint - The article discusses the groundbreaking advancements made by Ant Group's LingBot-VA, which represents a significant leap in robot control by enabling robots to predict future actions before executing them, thus enhancing their decision-making capabilities [2][11][56]. Group 1: Technological Innovations - LingBot-VA introduces a causal video-action world model that allows robots to visualize future scenarios before taking action, moving beyond the traditional "observe-react" model [6][12]. - The model features strong memory retention, enabling it to remember previous actions during long sequences, and demonstrates high adaptability with minimal training samples [8][10]. - The architecture separates visual understanding and action control, enhancing sample efficiency and generalization capabilities [14][15]. Group 2: Performance and Testing - In real-world tests, LingBot-VA successfully handled complex tasks such as preparing breakfast and manipulating delicate objects, showcasing its stability and precision [34][36]. - The model achieved a success rate of 92.93% in the RoboTwin 2.0 benchmark for easy tasks, outperforming competitors by 4.2% [40]. - In the LIBERO benchmark, LingBot-VA set a new state-of-the-art record with a 98.5% average success rate [42]. Group 3: Industry Impact - The continuous open-sourcing of LingBot-VA and its related projects signals a shift towards a video-centric approach in robotics, where video becomes a medium for reasoning and action [46][48]. - The advancements in LingBot-VA position world models as a central capability in robotics, evolving from mere action to thoughtful decision-making [49][56]. - The ripple effect of these innovations is evident, with increased attention from global tech companies and media, indicating a strategic move in the competitive landscape of robotics [52][56].
天下苦CUDA久矣,又一国产方案上桌了
量子位· 2026-01-30 13:34
Core Viewpoint - The article emphasizes that while domestic computing infrastructure has improved, the real challenge for developers lies in the usability of these systems, particularly in the context of AI development, where the existing software ecosystem remains heavily reliant on established foreign tools and frameworks [1][2]. Group 1: Current State of AI Development - The AI landscape is vibrant with numerous models being released, yet the underlying software ecosystem's maturity is a significant bottleneck for deployment efficiency [11][12]. - The development of high-performance operators (算子) is crucial as they serve as the "translators" between AI algorithms and hardware, impacting inference speed, energy consumption, and compatibility [13][14]. Group 2: KernelCAT Introduction - KernelCAT is introduced as a local AI agent designed to accelerate computing and facilitate model migration, capable of handling both specialized tasks and general software engineering duties [17]. - Unlike traditional tools, KernelCAT combines intelligent code understanding and optimization with operational research algorithms to automate parameter tuning, significantly reducing the time and effort required for optimization [21][22]. Group 3: Performance and Competitive Edge - In tests, KernelCAT demonstrated superior performance compared to both open-source and commercial operators, achieving execution times as low as 0.0077 ms for 1M scale tasks, which translates to acceleration ratios exceeding 200% [26]. - KernelCAT's unique approach allows it to optimize operators effectively, showcasing its potential to compete with established solutions in the market [25][27]. Group 4: Ecosystem Challenges - The article highlights that over 90% of significant AI training tasks currently run on NVIDIA GPUs, with a developer ecosystem that includes over 5.9 million users and more than 400 operators, indicating a substantial barrier for domestic alternatives [28][30]. - The success of NVIDIA is attributed to its comprehensive control over software and algorithms, underscoring the importance of a mature ecosystem for hardware performance to be fully realized [32]. Group 5: Future Directions - KernelCAT represents a shift towards building self-evolving computational foundations, moving away from reliance on existing ecosystems to developing capabilities that can adapt and grow independently [39]. - The article concludes with an invitation for users to experience KernelCAT, indicating its ongoing development and potential for broader adoption in the industry [40].
5秒出4张2K大图!阿里提出2步生成方案,拉爆AI生图进度条
量子位· 2026-01-30 11:02
Core Insights - The article discusses advancements in AI image generation, particularly focusing on the Qwen model, which has significantly reduced image generation time from nearly one minute to just 5 seconds for 4 high-definition images [1][3]. Group 1: Model Performance Improvements - The Qwen model's latest open-source version has achieved a state-of-the-art (SOTA) compression level, reducing the forward computation steps from 80-100 to just 2 steps, resulting in a 40-fold speed increase [2]. - The introduction of the DMD2 algorithm has shifted the constraints from sample space to probability space, enhancing the quality of generated images by addressing detail loss issues [8][10]. - The Reverse-KL loss design in DMD2 allows the student model to generate images independently while receiving guidance from the teacher model, improving detail and realism in the generated images [11][12]. Group 2: Challenges and Solutions - Traditional trajectory distillation methods faced challenges in generating high-quality images with low iteration steps, often resulting in blurry outputs due to insufficient learning of detailed features [6][7]. - To mitigate distribution degradation issues, the team implemented a "warm start" using PCM distillation, which significantly improved the model's ability to generate realistic shapes [14][17]. - The introduction of adversarial learning (GAN) further enhanced the student model's performance by improving texture and detail in generated images [20][26]. Group 3: Future Directions - The team plans to continue releasing faster and more effective generative models, addressing limitations in complex scenarios where noise reduction steps may still require improvement [32]. - Ongoing efforts will focus on developing and iterating more diffusion acceleration technologies, with an emphasis on open-source contributions to the community [33][35]. - The advancements will be made available on the Wuli AI platform, aiming to provide accessible creative tools for designers, content creators, and AI enthusiasts [36].
国内首个!360发布“纳米漫剧流水线”,AI漫剧生成进入工业化时代
量子位· 2026-01-30 11:02
Core Insights - The AI comic drama industry is experiencing rapid growth with an annual increase of 80%, and the market size is projected to exceed 200 billion by 2025 [7] - Despite the growth, the industry faces significant challenges, including low success rates in content generation, which average only 15%, leading to inefficiencies and waste [8] - 360 has launched the "Nano Comic Drama Production Line," aiming to streamline the production process and improve efficiency, achieving a material generation success rate of over 90% [3][12] Industry Challenges - Traditional AI tools suffer from issues such as "black box generation, quality control failures, and content homogenization," which hinder innovation and sustainable development in the industry [8] - The current production methods struggle to balance high output with quality, often resulting in either low-quality products or time-consuming processes [10] 360's Solution - The "Nano Comic Drama Production Line" integrates script analysis, asset generation, storyboard creation, and dynamic composition into a unified workflow, enhancing content quality and production capacity [11] - The platform has partnered with leading companies in the film and comic drama sectors to explore new production models based on industrialized processes [11] - The public testing phase of the platform has been launched, allowing users to experience the new production capabilities [13] Production Efficiency - The platform significantly reduces production time, with single episodes being produced in 30 minutes to 1 hour, achieving a speed three times faster than mainstream tools [14] - It employs a dual-mode interaction of "production line advancement + intelligent canvas adjustment" to facilitate the efficient output of high-quality content [14] Creative Control - The platform features a dedicated "video world model" that ensures consistency in style and narrative throughout the production process, allowing creators to focus on storytelling and creativity [14] - It supports film-level storyboard design and dynamic storytelling, ensuring 100% control over the creative process while allowing for unique visual styles [14]
量子位编辑作者招聘
量子位· 2026-01-30 11:02
编辑部 发自 凹非寺 量子位 | 公众号 QbitAI AI热潮还在汹涌,但如果你还不知道如何参与……那为什么不来 量子位 呢? 我们是一家以 追踪AI新进展 为核心的内容平台,经过8年积累,目前拥有顶流影响力,广泛且备受认可的产业资源,以及时代风口的最佳观 测和学习生态位。 目前,我们有 三大方向 岗位招聘,希望你是 (或者能成为) 这三个方向的内容专家: 岗位均为全职,工作地点:北京中关村。 岗位面向: 加入我们,你可以获得: 以下是岗位详情: 所有岗位不同能力层级职位均在开放,欢迎结合个人履历和经验申请。 AI产业方向 岗位职责: AI产业方向 :关注基建层创新,包含芯片、AI Infra、云计算; AI财经方向 :关注AI领域创投和财报,跟踪产业链资本动向; AI产品方向 :关注AI在应用和硬件终端方向的进展。 社招:覆盖编辑、主笔、主编各个层级,按能力匹配岗位; 校招:应届毕业生,接受实习且可转正。 站在AI浪潮之巅 :第一时间接触和了解AI领域最新技术和产品,构建完整的AI认知体系。 玩转AI新工具 :将各种AI新技术、新工具应用于工作,提升工作效率和创造力。 打造个人影响力 :通过撰写独家原创内 ...