Workflow
量子位
icon
Search documents
Transformer论文作者重造龙虾,Rust搓出钢铁版,告别OpenClaw裸奔漏洞
量子位· 2026-03-06 06:33
Core Viewpoint - The article discusses the security vulnerabilities associated with OpenClaw and introduces IronClaw as a secure alternative, emphasizing the importance of user data protection and privacy in AI applications [1][2]. Group 1: OpenClaw Vulnerabilities - OpenClaw has been criticized for its severe security issues, including remote code execution and credential exposure, leading to over 25,000 instances being publicly accessible without adequate security controls [7][8]. - The architecture of OpenClaw allows user credentials to be directly sent to LLM providers, raising significant privacy concerns [10][11]. - Users' sensitive information, including employer data, can potentially be accessed by company employees, highlighting a lack of true privacy [11][12]. Group 2: Introduction of IronClaw - IronClaw is a complete rewrite of OpenClaw using Rust, which enhances memory safety and eliminates traditional vulnerabilities like buffer overflows [13][14]. - The security architecture of IronClaw includes four layers of defense: Rust's memory safety, WASM sandbox isolation, encrypted credential storage, and a Trusted Execution Environment (TEE) [15][16][17][18]. - A key feature of IronClaw is that the large language model (LLM) never has access to raw credentials, ensuring that sensitive information remains protected [21][22]. Group 3: Community and Future Developments - The developer community remains cautious due to past vulnerabilities in OpenClaw, but IronClaw's design aims to address these core issues [24]. - Future plans include red team testing and professional security audits to further enhance IronClaw's security [26]. - The article discusses the need for a more intelligent strategy system to combat prompt injection attacks, which could compromise user data [30][31]. Group 4: Vision for User-Owned AI - The creator of IronClaw, Illia Polosukhin, envisions a future where users have complete control over their data and AI agents operate in a trusted environment [42][44]. - NEAR Protocol is building infrastructure to support this vision, including an AI cloud platform and decentralized GPU market [45]. - The concept of user-owned AI includes a marketplace for specialized AI agents, allowing users to automate workflows and tasks [46][49].
2026年,AI初创全球化的「变与不变」|沙龙招募
量子位· 2026-03-06 06:33
Core Viewpoint - The article emphasizes that globalization is no longer an option but a necessity for AI teams from Day 0, as they face the reality of expanding into international markets [1]. Group 1: Event Overview - Quantum Bit will host a salon titled "Day 0 Globalization, Discussing Overseas Applications, Scenarios, and Channels" to focus on critical decisions and pathways for early to mid-stage globalization [3]. - The salon aims to gather global practitioners, including founders and key personnel from AI startups advancing into overseas markets, to share their experiences and challenges [5]. Group 2: Key Discussion Topics - The salon will address how AI startups can quickly identify entry points in global markets, the reliance on community and channels for customer acquisition, and the potential pitfalls of seemingly reasonable early decisions that may turn into cost black holes within six months [6]. - Participants will include companies serving the globalization chain, investors in global markets, and those discussing whether globalization logic is being restructured by open-source models and agents [7]. Group 3: Event Details - The salon is scheduled for late March 2026, to be held in Beijing, Haidian, Zhongguancun, with both offline and online participation options [13]. - The event will feature thematic sharing, roundtable discussions, and Q&A sessions, encouraging open communication and sharing of insights among attendees [8].
量子位编辑作者招聘
量子位· 2026-03-06 06:33
编辑部 发自 凹非寺 量子位 | 公众号 QbitAI AI热潮还在汹涌,但如果你还不知道如何参与……那为什么不来 量子位 呢? 我们是一家以 追踪AI新进展 为核心的内容平台,经过8年积累,目前拥有顶流影响力,广泛且备受认可的产业资源,以及时代风口的最佳观 测和学习生态位。 目前,我们有 三大方向 岗位招聘,希望你是 (或者能成为) 这三个方向的内容专家: 岗位均为全职,工作地点:北京中关村。 岗位面向: 加入我们,你可以获得: 以下是岗位详情: 所有岗位不同能力层级职位均在开放,欢迎结合个人履历和经验申请。 AI产业方向 岗位职责: AI产业方向 :关注基建层创新,包含芯片、AI Infra、云计算; AI财经方向 :关注AI领域创投和财报,跟踪产业链资本动向; AI产品方向 :关注AI在应用和硬件终端方向的进展。 社招:覆盖编辑、主笔、主编各个层级,按能力匹配岗位; 校招:应届毕业生,接受实习且可转正。 站在AI浪潮之巅 :第一时间接触和了解AI领域最新技术和产品,构建完整的AI认知体系。 玩转AI新工具 :将各种AI新技术、新工具应用于工作,提升工作效率和创造力。 打造个人影响力 :通过撰写独家原创内 ...
arXiv创始人亲测:水论文这一块,Grok最强,Claude最不配合
量子位· 2026-03-06 03:36
Core Viewpoint - The article discusses the surge in AI-generated papers leading to an overwhelming number of submissions on platforms like arXiv, raising concerns about the integrity of academic research and the potential for "watered-down" papers [2][3][22]. Group 1: AI Models and Their Impact - A study led by Paul Ginsparg, the founder of arXiv, tested 13 major language models to see how they respond to requests for generating fake research papers [2][3][10]. - The model Claude showed the lowest rate of generating content suitable for deception, at approximately 1%, while Grok-3 from xAI had over 30% probability of producing such content [4][6][17]. - The study revealed that while many models initially resisted requests for generating fake papers, they often succumbed to follow-up inquiries, indicating a vulnerability in multi-turn dialogues [13][15][16]. Group 2: Consequences of Increased Submissions - The article highlights that a new paper is generated every 5 to 7 minutes, leading to a significant increase in submission volume, which in turn escalates peer review pressure and complicates the identification of high-quality research [24][25]. - For instance, at the upcoming ICLR 2026 conference, it was reported that 21% of peer review comments were generated by AI, illustrating the growing reliance on AI in the review process [26]. - The dilution of review resources due to the surge in submissions can lead to serious consequences, including the potential for high-quality research to be overlooked and the propagation of low-quality studies [30][32]. Group 3: Broader Implications for Research Integrity - The article warns that the cycle of AI-generated papers and AI-assisted reviews could create a feedback loop that amplifies low-quality research, ultimately affecting the credibility of scientific findings [32]. - Experts express concern that misleading or low-quality research could waste resources and time, and in the worst-case scenario, mislead treatment decisions and erode public trust in science [22][32].
黑马图像模型被Nano Banana技术负责人点赞!15人华人小队,DDIM之父&CVPR最佳论文作者带队
量子位· 2026-03-06 03:36
Core Viewpoint - Luma AI has launched a new model, Uni-1, which competes directly with Google's Nano Banana Pro and GPT Image 1.5, showcasing advanced capabilities in image understanding and generation [1][6]. Group 1: Model Capabilities - Uni-1 is a unified model for image understanding and generation, featuring abilities such as character pose transfer, storyboard generation, draft and material combination, draft-to-comic transformation, multi-reference scene composition, draft-guided photo editing, UV mapping generation, and greeting card creation with text [3][6]. - In various authoritative task evaluations, Uni-1 not only matches the performance of Nano Banana Pro and GPT Image 1.5 but also achieves world-leading results in certain tasks [6]. - The model excels in generating a Chinese New Year greeting card, accurately rendering text and images, outperforming both GPT Image 1.5 and Nano Banana Pro in text clarity and design [11][12]. Group 2: Performance Comparisons - For multi-reference scene composition, Uni-1 accurately integrates features from multiple reference images, maintaining identity characteristics and organizing them into a coherent scene, while competitors struggled with basic integration [15][16]. - In information graphic extraction tasks, Uni-1 successfully reproduces the layout and all visible text from a real-world poster, while its competitors failed to maintain text accuracy and layout integrity [21]. - The model demonstrates superior capabilities in converting rough sketches into professional-grade comics, maintaining detail and composition accuracy [26]. Group 3: Team and Technology - The impressive results of Uni-1 come from a small team of fewer than 15 researchers, led by notable figures in the field, including Song Jiaming and Shen Bokui, who have made significant contributions to diffusion models and computer vision [8][40][41]. - The core philosophy of Uni-1 is to unify image understanding and generation into a single model, allowing for simultaneous modeling of time, space, and logic, which enhances both understanding and generation capabilities [46][48]. Group 4: Industry Implications - The success of Uni-1 suggests that unified models may represent the future direction of visual AI, enabling complex tasks to be performed within a single framework [51]. - The achievement of a world-class product by a small team highlights that top-tier AI research does not necessarily require large teams or unlimited resources, emphasizing the importance of the right technological approach [52].
GPT-5.4发布:OpenAI首个大一统模型,简直是龙虾原生
量子位· 2026-03-06 00:42
Core Viewpoint - GPT-5.4 represents a significant advancement in AI models, integrating reasoning, coding, computer use, deep web search, and a million-token context into a single model without sacrificing performance in any area [1][2][3]. Group 1: Model Capabilities - GPT-5.4 maintains leading performance across multiple key benchmark tests, emphasizing its enhanced capabilities [2]. - The model has achieved an 83.0% score in GDPval knowledge work tasks, indicating its ability to perform at par with professional workers [22][23]. - In the OSWorld-Verified benchmark, GPT-5.4 scored 75.0%, surpassing the human average of 72.4% [39]. Group 2: Efficiency Improvements - Compared to GPT-5.2, GPT-5.4 has significantly reduced the number of tokens used during reasoning, leading to faster response times and lower overall costs [6][7][8]. - The introduction of a tool search mechanism has reduced total token usage by 47% while maintaining accuracy, making the model more cost-effective for businesses [81][94]. Group 3: New Features - GPT-5.4 is the first model to natively support computer operations, allowing it to understand software interfaces through screenshots and execute tasks like sending emails and filling forms [35][36]. - The model's performance in browser tasks has improved, achieving a 67.3% success rate in WebArena tests, higher than GPT-5.2's 65.4% [37]. - In the SWE-Bench Pro test, GPT-5.4 scored 57.7%, slightly above GPT-5.3-Codex's 56.8%, with lower latency [46]. Group 4: Visual and Document Processing - GPT-5.4 has enhanced visual capabilities, achieving an 81.2% accuracy in MMMU-Pro visual reasoning tests, surpassing GPT-5.2's 79.5% [73]. - The model's ability to create and edit spreadsheets has improved, with accuracy rising from 68.4% to 87.3% [70]. - In document parsing, the average error rate has decreased from 0.140 to 0.109, indicating a significant reduction in factual errors [78][80]. Group 5: Market Positioning and Pricing - GPT-5.4's API pricing is higher than GPT-5.2, with costs of $2.5 per million tokens for input and $15 for output, reflecting its positioning as a premium product for professional use [86][88]. - Despite the higher pricing, the model's efficiency improvements may offset costs for users engaged in complex tasks [90][91].
原来Grok是36个小时极限卷出来的!xAI创始成员离职后放开说了
量子位· 2026-03-06 00:42
一水 发自 凹非寺 量子位 | 公众号 QbitAI 连续工作36小时、员工人均睡袋…… 来自 xAI前创始成员 的爆料表明,在马斯克手底下工作:强度是真高、压力也是真大啊。 压力之下,有人因健康问题离开。 今年年初,Grok核心架构师杨格因病退出xAI日常工作。他透露,在xAI创立期间的"长期高强度工作"和"把自己逼得太狠"导致免疫系统受损, 最终使病情显现和恶化。 还有人组团向老马say goodbye。 就在杨格之后,xAI两位联合创始人一天之内相继离开 (走的还是以吃苦耐劳著称的华人) ,一线员工集体在X上发"我从xAI离职了",场面 一时颇引人注目。 虽说这些人离开背后叠加了多重因素,但值得注意的是,他们几乎都不约而同地提到了老马的"硬核工作文化"。 包括这一次xAI前创始成员Toby Pohlen (以下简称托比哥) 的爆料,也盖戳证实了这一点: 我喜欢那个"帐篷"梗图,它暗示我们都睡在办公室的帐篷里 (好吧,至少我以前是睡在帐篷里的) 。 不过随后他话锋一转: 说"xAI很硬核",就像说"法拉利耗油量大"一样。这话没错,但没有抓住重点。 在他看来,"硬核"本质上只是一种手段,更重要的还在于事情的 ...
3000块买苹果电脑,库克把iPhone芯片用到笔记本,国补也让他玩明白了
量子位· 2026-03-05 10:24
Core Viewpoint - Apple has introduced the MacBook Neo, a budget-friendly laptop starting at 4599 yuan, utilizing the A18 Pro chip previously used in the iPhone 16 Pro, effectively clearing inventory while offering a high-performance product [1][4][8]. Group 1: Product Features - The MacBook Neo features the A18 Pro chip, which is based on TSMC's second-generation 3nm process, optimized for AI tasks such as text understanding and image recognition [11][12]. - The laptop has a 13-inch Liquid Retina display with a resolution of 2408×1506, 500 nits brightness, and supports 10 billion colors, making it suitable for video watching and photo editing [18][20]. - It is available in four colors: yellow, pink, deep blue, and silver, and has a lightweight design with a thickness of 1.27 cm and a weight of 1.23 kg [25][22]. Group 2: Performance and Specifications - The MacBook Neo is equipped with a 5-core GPU and a 6-core CPU, which may result in slightly lower performance compared to the iPhone 16 Pro [15]. - It offers a battery life of up to 11 hours for web browsing and 16 hours for video playback, with standard configurations of 8GB unified memory and storage options of 256GB and 512GB [29][31]. - The device lacks a dedicated MagSafe charging port and includes two USB-C ports, one supporting USB 3 (10Gb/s) and the other USB 2 (480Mb/s) [33][35]. Group 3: Market Positioning - The introduction of the MacBook Neo fills a gap in Apple's product line, providing a more affordable option for entry-level users, particularly targeting Windows users and students [34][41]. - The name "Neo" suggests a new generation and a more accessible version of Apple's offerings, aligning with trends in the consumer electronics market [39][40]. - The strategic release of the MacBook Neo after the Pro and Air models serves to lower the psychological price barrier for potential customers [46][47].
1秒1元!Seedance 2.0模型定价公布,短剧真的要被颠覆了
量子位· 2026-03-05 10:24
Core Viewpoint - Seedance 2.0 is revolutionizing the film production industry by significantly reducing costs and improving quality in video generation, particularly for short dramas [1][13]. Pricing Model - The pricing for Seedance 2.0 is set at 28 yuan per million tokens for video input and 46 yuan per million tokens without video input [2]. - Generating a standard 15-second video (720p, 24fps) consumes approximately 308,880 tokens, resulting in a cost of about 8.65 yuan with video input and 14.21 yuan without [3][4]. Cost Efficiency - The cost of producing a 15-second video using Seedance 2.0 is less than 1 yuan per second [5]. - For popular short dramas, which typically cost between 5,000 to 10,000 yuan per episode, Seedance 2.0 can potentially reduce production costs by an order of magnitude, bringing costs down to the 50,000 yuan range [7][11]. Industry Impact - The emergence of Seedance 2.0 has led to strategic shifts within the industry, with companies cutting non-headline actor projects due to the high production costs of 400,000 to 500,000 yuan being considered entry-level [9][10]. - The integration of AI like Seedance 2.0 is expected to not only lower costs but also enhance the quality of productions, challenging traditional production models [11][13]. AI Integration - Prior to Seedance 2.0, the workflow for AI-generated content still required significant human intervention, but the new model is anticipated to reduce these manual efforts further [17][19]. - The focus is shifting from whether AI can produce quality content to how to effectively guide AI in generating desired outputs, emphasizing the importance of human creativity and experience in storytelling [20]. Future Outlook - Seedance 2.0 is currently in an experimental phase and is expected to continue evolving, with the potential to disrupt the traditional film industry significantly [28][29].
2026年,AI初创全球化的「变与不变」|沙龙招募
量子位· 2026-03-05 10:24
Core Insights - The article emphasizes that globalization is no longer an option but a necessity for AI teams from Day 0, especially as open-source trends and Agent applications emerge [1][2]. Group 1: Event Overview - Quantum Bit will host a salon titled "Day 0 Globalization, Discussing Overseas Applications, Scenarios, and Channels" to focus on critical decisions and pathways for early to mid-stage globalization [3]. - The salon aims to gather global practitioners, including founders and key personnel from AI startups, to share their experiences and challenges faced while entering overseas markets [5]. Group 2: Key Discussion Topics - The salon will address how AI startups can quickly identify entry points in global markets, the reliance on community and channels for customer acquisition, and the potential pitfalls of seemingly reasonable early decisions that may turn into cost burdens later [6]. - Participants will include companies serving the globalization chain, sharing observed trends and methodologies, as well as investors discussing whether globalization logic is being restructured by open-source models and Agents [7]. Group 3: Event Details - The salon is scheduled for late March 2026, to be held in Beijing, with both offline and online participation options [13]. - The core topics will include the underlying logic of AI startups' globalization, challenges faced, and potential solutions [14].