量子位
Search documents
量子位编辑作者招聘
量子位· 2026-03-02 16:00
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Group 1: Job Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. - Positions are open for various levels, including editors, lead writers, and chief editors, with a focus on matching roles to individual capabilities [6]. Group 2: Job Responsibilities - **AI Industry Direction**: Responsibilities include tracking innovations in infrastructure, such as chips, AI infrastructure, and cloud computing, as well as interpreting technical reports from conferences [6][7]. - **AI Finance Direction**: Focuses on venture capital, financial reports, and capital movements within the AI industry, requiring strong analytical skills and a passion for interviews [11]. - **AI Product Direction**: Involves monitoring AI applications and hardware developments, producing in-depth evaluations of AI products, and engaging with industry experts [11]. Group 3: Benefits and Growth - Employees will have the opportunity to engage with cutting-edge AI technologies, enhance their work efficiency through new tools, and build personal influence in the AI field [6]. - The company offers competitive salaries, comprehensive benefits including social insurance, meal allowances, and performance bonuses, fostering a dynamic and open work environment [6]. Group 4: Company Growth Metrics - By 2025, Quantum Bit aims to have over 2.4 million subscribers on WeChat and more than 7 million users across platforms, with a daily reading volume exceeding 2 million [12]. - The company is recognized as the top new media outlet in the AI and frontier technology sector according to third-party data platforms [12].
中国机器人手机火爆MWC!老外快门按得根本停不下来
量子位· 2026-03-02 10:08
Core Viewpoint - The article discusses the emergence of a new type of smartphone, the "Robot Phone," which integrates a movable external gimbal camera, enhancing its capabilities for AI video calls and interactive photography, showcasing innovation at the MWC event [1][2][5]. Group 1: Product Features - The Robot Phone features a unique external gimbal camera that allows for full-angle AI video calls and real-time environmental understanding [2][6]. - It supports two modes, 90-degree and 180-degree, enabling users to capture images without adjusting the phone's position [11]. - The device incorporates a three-axis mechanical stabilization system combined with an AI stabilization engine, ensuring stable footage even during dynamic movements [12]. Group 2: Technological Innovations - The design of the Robot Phone includes a compact structure, reducing the size of micro motors and the 4DoF gimbal system by approximately 70% compared to mainstream solutions [8]. - The camera's movement mimics joint structures, allowing for versatile motion while maintaining a small form factor [9]. - The AI capabilities of the Robot Phone enable it to perform tasks such as voice interaction and content summarization locally on the device [22]. Group 3: Market Impact - The introduction of the Robot Phone has generated significant interest, with attendees at MWC expressing amazement and curiosity about its unique design [25][30]. - The product challenges traditional smartphone design by separating hardware capabilities, suggesting a new model where smartphones can work in conjunction with specialized devices [32][34]. - The innovation raises questions about potential collaborations between smartphone manufacturers and companies like DJI or Insta360 to create multifunctional devices [35].
硅谷开启预防式裁员!AI还没替代,先裁50%做准备,股价还能大涨17%
量子位· 2026-03-02 10:08
鹭羽 发自 凹非寺 量子位 | 公众号 QbitAI 这可能硅谷近期最值得关注的裁员事件。 一次性裁掉公司近一半员工,股价不跌反升,暴涨 17% 。 更让人不寒而栗的是,裁员并非业务收缩或AI已经取代岗位,而是硅谷从上至下的一次预判—— AI时代,企业将不会再需要那么多人。 而这次第一个动刀子的是推特 (现名) 的创始人 杰克·多西 ,他在社媒上广而告之Block的裁员通知: 做出这一决定并非Block遇到了麻烦,而是因为AI驱动下,小团队将比大团队更高效。而且,这种趋势正在迅速发展。 与此同时,市场的热烈反响也间接证明:硅谷大规模裁员,这才刚刚开始。 一次性裁员一半 除了,杰克·多西也是Block的联合创始人。他领导下的Block目前是全球金融科技领域的领头羊之一。 他在推文中宣布,将裁掉公司4000多名员工,约占总员工数量的 50% 。 这种裁员力度之大,在硅谷历史上都极为罕见。即便是亚马逊、微软这类科技巨头裁员,其裁员比例也极少单次高于25%。 至于他选择如此大刀阔斧裁员的理由,也并非常见的业务线收缩,而是 提前预见AI的强替代性 。 杰克·多西表示,目前公司业务发展势头向好,盈利能力不断提升。但随着A ...
西湖大学破解Rectified Flow反演不稳定难题,实现零成本稳定增强|ICLR'26
量子位· 2026-03-02 09:09
Core Insights - The article discusses the evolution of large-scale generative models in visual content production, shifting from "pure generation" to "controllable generation" and "intelligent editing" [1] - It highlights the need for models to possess stable and reliable inversion capabilities to accurately map input images back to their latent representation space [2] Group 1: Rectified Flow Model and Challenges - The Rectified Flow (RF) model is emerging as a significant alternative to diffusion models due to its efficient ODE inference and smooth generation trajectories [4] - However, RF faces fundamental numerical instability issues during the inversion phase, making it difficult to stably reconstruct input images for practical editing tasks [4][6] - Existing methods to improve inversion often rely on additional training or modifications, increasing engineering costs and affecting generalizability across different architectures [6] Group 2: Prox-Mean-Inversion (PMI) Introduction - The West Lake University research team proposed Prox-Mean-Inversion (PMI) to enhance the stability of RF inversion without altering the model structure or introducing extra parameters [6][7] - PMI incorporates a lightweight proximal correction step at each reverse ODE step, effectively suppressing trajectory deviation and divergence [8][9] - This plug-and-play mechanism allows for immediate integration into various RF models, providing significant stability improvements without additional training costs [6][12] Group 3: Mimic-CFG Strategy - The article introduces the mimic-CFG strategy to address the "identity crisis" during image editing, which can lead to loss of original features [14] - This strategy projects the guided velocity field onto the historical average direction, preserving essential structural information while responding to editing commands only in necessary dimensions [15][16] - The combination of PMI and mimic-CFG enhances both numerical stability and semantic controllability in RF models, improving the reliability of controllable editing tasks [19] Group 4: Experimental Results - The research team conducted comprehensive evaluations of PMI on the PIE-Bench dataset, demonstrating superior performance in image reconstruction tasks [20] - PMI achieved significantly higher PSNR (Peak Signal-to-Noise Ratio) and lower LPIPS (Learned Perceptual Image Patch Similarity) scores compared to traditional methods, indicating near-lossless restoration of original image details [21] - PMI's efficiency allows for faster convergence compared to optimization-based methods, making it a practical solution for real-world applications [22] Group 5: Conclusion and Future Outlook - PMI addresses the instability issues in RF models during inversion, providing a simple and effective solution that enhances reconstruction quality and reliability for subsequent editing tasks [26] - The method's plug-and-play nature allows for broad applicability across various RF models, showcasing its generalizability and engineering value [23] - Future developments in Flow models are expected to focus on efficiency and controllability, with stable inversion mechanisms like PMI playing a crucial role in advancing practical applications in video editing and multi-modal integration [27]
对话九合王啸:90%具身智能公司没未来,市场名额只有三五家
量子位· 2026-03-02 09:09
Core Viewpoint - The article discusses the investment landscape in the field of embodied intelligence, highlighting the influx of capital and the importance of discerning long-term trends in this emerging sector [1][2]. Group 1: Investment Philosophy and Strategy - Wang Xiao, a prominent investor, emphasizes the need for a balance between understanding technology and recognizing broader trends, advocating for a "fuzzy correctness" approach in investment decisions [12][14]. - The investment strategy focuses on early-stage technology investments, with a portfolio that includes various intelligent automation projects, such as robotics and autonomous driving [10][22]. - Wang Xiao believes that while there may be bubbles in valuations, they can drive industry progress, and he is cautious about excessive valuations in later-stage investments [32][33]. Group 2: Market Dynamics and Trends - The article notes a shift in the investment environment from a focus on consumer and internet sectors to a dominance of technology investments, particularly in the context of Chinese capital markets [16][90]. - The anticipated wave of IPOs in the embodied intelligence sector is seen as a positive development, potentially providing a feedback loop for growth in other tech areas [34][35]. - The article highlights the rapid development of embodied intelligence, comparing it to the early days of mobile internet, but suggests it has not yet reached a similar level of frenzy [40][41]. Group 3: Challenges and Opportunities - Wang Xiao identifies the challenge of distinguishing which companies will survive in a crowded market, suggesting that only a few will emerge as leaders [41][42]. - The article discusses the importance of technological paradigm shifts for new entrants to succeed, indicating that without such changes, competition will favor those with more resources [44][45]. - The potential for consumer markets in embodied intelligence is significant, with projections suggesting that even a small percentage of households could represent a vast market opportunity [51]. Group 4: Future Outlook - The article concludes with a focus on the evolving landscape of embodied intelligence, emphasizing the need for companies to adapt and innovate continuously to remain competitive [60][70]. - Wang Xiao expresses optimism about the future of technology and the importance of maintaining a forward-looking mindset in investment strategies [100][101].
量子位编辑作者招聘
量子位· 2026-03-02 09:09
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Group 1: Job Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. - Positions are open for various levels, including editors, lead writers, and chief editors, with a focus on matching roles to individual capabilities [6]. Group 2: Job Responsibilities - **AI Industry Direction**: Responsibilities include tracking innovations in infrastructure, such as chips, AI infrastructure, and cloud computing, as well as interpreting technical reports from conferences [6][7]. - **AI Finance Direction**: Focuses on venture capital, financial reports, and capital movements within the AI industry, requiring strong analytical skills and a passion for interviews [11]. - **AI Product Direction**: Involves monitoring AI applications and hardware developments, requiring a keen understanding of product experiences and market trends [11]. Group 3: Benefits and Growth - Employees can expect to engage with cutting-edge AI technologies, enhance their work efficiency through new tools, and build personal influence in the AI field [6]. - The company offers competitive salaries, comprehensive benefits, and a supportive environment for professional growth, including mentorship from senior editors [6][12]. Group 4: Company Impact - By 2025, Quantum Bit aims to have over 2.4 million subscribers on WeChat and more than 7 million users across platforms, with a daily reading volume exceeding 2 million [12]. - The company is recognized as the top new media outlet in the AI and frontier technology sector according to third-party data platforms [12].
让搜索Agent不「傻等」:人大团队依托扩散模型实现「一心二用」,边等搜索结果边思考,加速15%性能不减
量子位· 2026-03-02 09:09
Core Viewpoint - The article discusses the limitations of traditional search agents and introduces the concept of diffusion large language models (dLLM) as a potential solution to enhance search efficiency by allowing parallel reasoning and action during the search process [1][8][28]. Group 1: Limitations of Traditional Search Agents - Traditional search agents operate in a strictly serial manner, leading to delays as users wait for results before continuing their thought process [8]. - The current frameworks, such as ReAct, result in significant end-to-end time consumption due to this serial waiting [9]. - The article highlights that self-regressive models cannot perform parallel reasoning, which limits their efficiency in search tasks [10][16]. Group 2: Introduction of Diffusion Large Language Models (dLLM) - The dLLM can perform "dual-tasking," allowing it to think about the next steps while waiting for search results [5][11]. - Unlike traditional models, dLLMs utilize a non-sequential token generation process, enabling them to generate important parts of the output first [12][13]. - Initial tests of dLLMs as search agents showed poor performance, indicating that while they have potential, they require further training to be effective [14][16]. Group 3: Training Methodology for dLLM - The training process consists of two phases: Agentic SFT (Supervised Fine-Tuning) and Agentic VRPO (Variance-Reduced Preference Optimization) [18][20]. - The first phase involves generating high-quality search trajectories and ensuring the model learns to generate thoughts and tool calls without seeing the search results [19]. - The second phase focuses on refining the model's reasoning paths through preference learning, improving accuracy across various datasets [20]. Group 4: P-ReAct for Enhanced Efficiency - P-ReAct is introduced as a method to accelerate reasoning and tool calling without additional training [21][22]. - This method involves pre-filling boundary markers and adjusting confidence scores for tool calling areas, allowing the model to prioritize these calls [23][24]. - The implementation of P-ReAct resulted in significant improvements in response times and accuracy, demonstrating the effectiveness of the dLLM in search tasks [25][26]. Group 5: Performance and Implications - The dLLM-Searcher achieved an average accuracy of 57.0% on multiple benchmark datasets, surpassing traditional methods and showing strong generalization capabilities [25][27]. - The results indicate that dLLMs can match or exceed the reasoning capabilities of self-regressive models while leveraging their unique structural advantages [28]. - This advancement opens new avenues for optimizing search agent efficiency, suggesting a shift in how search tasks may be approached in the future [29].
OpenClaw最佳工具榜来了!这6款龙虾最受欢迎
量子位· 2026-03-02 04:53
Core Viewpoint - The article discusses the emergence of various ecological tools related to OpenClaw, highlighting a newly established directory that categorizes and ranks these tools for developers and users [1][2]. Group 1: OpenClaw Tools Directory - A third-party website, OpenClaw Directory, has gained recognition in developer communities, featuring 39 OpenClaw-related tools categorized by functionality [3][4]. - The directory allows users to filter tools by tags such as "essential," "customizable," and "open-source," and sort them by popularity, recency, ratings, and alphabetical order [4][5]. Group 2: Popular Tools Overview - The top six tools in the comprehensive ranking include: 1. **Claw for All**: A platform simplifying OpenClaw deployment and management for both developers and general users, accessible via web and mobile [9]. 2. **OpenClaw Launch**: A deployment tool enabling one-click setup of OpenClaw instances within 30 seconds, featuring a user-friendly interface and comprehensive support [11][12][14]. 3. **ClawTeam**: An AI Agent Teams tool that facilitates quick deployment and management of OpenClaw applications [14][15]. 4. **Vibeclaw**: Claims to run OpenClaw locally in a browser sandbox within one second, designed for developers needing rapid deployment [17][18]. 5. **Tinkerclaw**: A one-stop service platform for entrepreneurs to deploy and manage OpenClaw AI assistants without technical expertise [20][21]. 6. **ClawWrapper**: A tool aimed at simplifying the packaging and launching process of OpenClaw tools, offering customized deployment services and extensive support [22][23]. Group 3: Additional Resources - The OpenClaw Directory also features a blog section that provides comprehensive guides and tutorials, covering topics from basic introductions to advanced optimization strategies [26][27][30].
英伟达放弃GPU上LPU:新推理芯片被曝Groq即买即用,OpenAI第一个吃螃蟹
量子位· 2026-03-02 04:53
Core Viewpoint - Nvidia is set to unveil a new AI inference system at the upcoming GTC conference, featuring a chip optimized specifically for inference tasks, marking a significant architectural shift for the company [1][11]. Group 1: New Chip Development - The new chip's primary customer is OpenAI, which recently secured $110 billion in funding [2]. - This chip is based on the LPU (Language Processing Unit) architecture developed by the former Groq team, indicating Nvidia's first major integration of external architecture into its core AI computing products [5][6]. - The introduction of this chip is a direct result of Nvidia's $20 billion acquisition of Groq's core technology and team, showcasing a strategy of rapid deployment of mature solutions [7][8][9]. Group 2: Market Dynamics and Competition - The demand for inference solutions is rapidly increasing, prompting Nvidia to provide targeted solutions more quickly [17]. - Major clients like OpenAI are exploring more efficient inference alternatives, leading to partnerships with other chip companies [16][28]. - Competitors such as Cerebras and Amazon are enhancing their own inference architectures, with Cerebras claiming its chips can outperform Nvidia's GPUs in specific scenarios [31][40]. Group 3: Architectural Shift - The LPU architecture is designed to reduce latency and energy consumption by keeping data close to the processing unit, which is crucial for low-latency inference tasks [22][26]. - As AI applications evolve, the focus is shifting from training to inference, with inference becoming a more significant and frequent workload [24][29]. - Nvidia's move to incorporate LPU into its product line is a response to this shift, indicating a potential change in the company's computing focus [26][47]. Group 4: Future Prospects - Nvidia is expected to announce additional groundbreaking products at the GTC conference, including the new Rubin series GPUs and possibly the Feynman architecture chips [49][50].
13 vs 3,国产安全AI悄悄完成了对Claude的超越
量子位· 2026-03-02 03:28
Core Viewpoint - The article highlights the significant advancements of domestic AI security agents, particularly the HengNao security AI from Anheng Information, which has demonstrated superior capabilities in identifying and exploiting vulnerabilities compared to its international counterpart, Claude Code Security [2][3][15]. Group 1: Performance Comparison - HengNao security AI successfully replicated all three vulnerabilities identified by Claude Code Security and additionally discovered ten new 0day vulnerabilities, showcasing a "3+10" achievement [3][12][15]. - The replication process confirmed the accuracy of the vulnerabilities' exploitation conditions and impact range, aligning perfectly with the official disclosures [9][10]. Group 2: Technical Capabilities - HengNao's ability to identify vulnerabilities is attributed to its deep integration of AI capabilities with over a decade of security expertise and proprietary data from Anheng Information [19][20]. - The AI operates through a fully automated process, from code acquisition to vulnerability exploitation and report generation, enabling rapid and large-scale deep digging [21][22]. Group 3: Competitive Edge - HengNao's performance was validated in a competitive environment, where it ranked among the top three in the "Tianfu Cup" international cybersecurity competition, demonstrating its technical prowess against top human hackers [24][25]. - The article emphasizes that the advancements in AI security capabilities represent a qualitative leap from merely discovering known vulnerabilities to uncovering unknown risks through deep code reasoning and logical analysis [15][16]. Group 4: Future Directions - Anheng Information plans to leverage HengNao as a core engine to enhance proactive threat hunting and automated penetration testing services, aiming to transition from traditional manual security services to intelligent and automated solutions [26].