Workflow
量子位
icon
Search documents
量子位编辑作者招聘
量子位· 2026-01-06 01:01
编辑部 发自 凹非寺 量子位 | 公众号 QbitAI AI热潮还在汹涌,但如果你还不知道如何参与……那为什么不来 量子位 呢? 我们是一家以 追踪AI新进展 为核心的内容平台,经过8年积累,目前拥有顶流影响力,广泛且备受认可的产业资源,以及时代风口的最佳观 测和学习生态位。 目前,我们有 三大方向 岗位招聘,希望你是 (或者能成为) 这三个方向的内容专家: 岗位均为全职,工作地点:北京中关村。 岗位面向: 加入我们,你可以获得: 以下是岗位详情: 所有岗位不同能力层级职位均在开放,欢迎结合个人履历和经验申请。 AI产业方向 岗位职责: 任职要求: AI财经商业方向 岗位职责: 任职要求: AI产品方向 AI产业方向 :关注基建层创新,包含芯片、AI Infra、云计算; AI财经方向 :关注AI领域创投和财报,跟踪产业链资本动向; AI产品方向 :关注AI在应用和硬件终端方向的进展。 社招:覆盖编辑、主笔、主编各个层级,按能力匹配岗位; 校招:应届毕业生,接受实习且可转正。 站在AI浪潮之巅 :第一时间接触和了解AI领域最新技术和产品,构建完整的AI认知体系。 玩转AI新工具 :将各种AI新技术、新工具应用 ...
老黄All in物理AI!最新GPU性能5倍提升,还砸掉了智驾门槛
量子位· 2026-01-06 01:01
Core Viewpoint - NVIDIA is shifting its focus entirely towards AI, as evidenced by its absence of gaming graphics cards at CES 2026 and the introduction of new AI products and architectures [2][10]. Group 1: AI Product Launches - NVIDIA unveiled the next-generation Rubin architecture GPU, which boasts inference and training performance that are 5 times and 3.5 times better than the Blackwell GB200, respectively [4][17]. - The company introduced five new product families targeting various AI applications, including the NVIDIA Nemotron for Agentic AI, NVIDIA Cosmos for physical AI, and NVIDIA Alpamayo for autonomous driving [6][8][39]. - The Vera Rubin NVL72 architecture was officially launched, featuring six core components designed to enhance AI data center capabilities [14][15]. Group 2: Performance Metrics - The Rubin GPU achieves an inference performance of 50 PFLOPS and a training performance of 35 PFLOPS under the NVFP4 data type, significantly surpassing its predecessor [17]. - Each Rubin GPU is equipped with 288GB of HBM4 memory and offers a bandwidth of 22 TB/s, supporting the high computational demands of modern AI models [18]. - The overall architecture of the Vera Rubin NVL72 can deliver 3.6 exaFLOPS of NVFP4 inference performance and 2.5 exaFLOPS of training performance [37]. Group 3: Networking and Connectivity - The introduction of NVLink 6 enhances interconnect bandwidth to 3.6 TB/s per GPU, with a total bandwidth of 260 TB/s across the entire NVL72 rack [20][21]. - The Vera CPU integrates 88 custom Arm cores and features a bandwidth of 1.8 TB/s for NVLink C2C interconnect, facilitating efficient communication between CPU and GPU [22]. Group 4: AI Model Developments - The Alpamayo model, a large-scale open-source visual-language-action model for autonomous driving, was launched with 10 billion parameters [41]. - The Nemotron series expanded to include specialized models for speech recognition, visual-language processing, and safety, enhancing AI applications across various sectors [49][51]. - The Cosmos model for robotics was upgraded to generate synthetic data that adheres to real-world physical laws, aiding in the development of AI agents [54][58]. Group 5: Industry Impact and Future Outlook - NVIDIA's comprehensive approach to AI, integrating models, data, and tools, is expected to strengthen its competitive edge and ecosystem lock-in [10]. - The company plans to begin mass production of the Vera Rubin NVL72 in the second half of 2026, indicating a strong commitment to advancing AI infrastructure [38].
悲报!Stack Overflow彻底凉了,比18年前上线首月问题数量还少
量子位· 2026-01-05 09:39
Core Viewpoint - Stack Overflow, once a thriving platform for developers, is experiencing a significant decline in user engagement, with the number of questions now lower than during its initial launch period 18 years ago [1][21]. Group 1: Historical Context - Stack Overflow was launched in 2008 to provide high-quality, reusable answers to programming questions, quickly becoming a vital resource for developers [7][9]. - The platform's unique voting and reputation system allowed for the creation of a structured knowledge base, making it the default destination for technical searches on Google for a long time [10][12]. Group 2: Decline in Engagement - Despite a significant increase in the global developer population and the emergence of numerous tools and languages, the act of asking questions on Stack Overflow has drastically decreased [4][21]. - The peak of Stack Overflow included over 180 sub-sites covering various STEM fields, but the platform is now facing challenges due to the rise of AI tools like GitHub Copilot and ChatGPT, which have changed developers' problem-solving habits [15][17][20]. Group 3: Impact of AI - The introduction of AI tools has led to a shift from public questioning to private inquiries, with developers now preferring to ask AI for solutions rather than posting on Stack Overflow [19][22]. - While AI tools rely on the quality content from Stack Overflow, they have diverted traffic away from the platform, leading to a decline in user engagement [23][24]. Group 4: Internal Challenges - Prior to the rise of AI, Stack Overflow was already facing issues due to its strict moderation policies, which discouraged new users from participating [26][27]. - The platform's attempt to integrate AI features resulted in a decline in content quality, further eroding user trust and engagement [28][29]. Group 5: Future Considerations - The future of Stack Overflow may hinge on whether it can refocus on niche technical areas to regain its unique value or fully embrace AI to restructure its operational model [32].
1人1假期,肝完10年编程量!马斯克锐评:奇点来了
量子位· 2026-01-05 07:04
Core Insights - The article discusses the significant advancements in programming agents, highlighting their impact on productivity and efficiency in software development [2][3][6]. Group 1: Programming Agents Impact - Midjourney founder David expresses that his programming projects during the holiday season surpassed those of the past decade, indicating a transformative shift in productivity due to programming agents [3][4]. - Elon Musk comments on the emergence of programming agents, stating, "We have entered the Singularity," reflecting a consensus among tech leaders about the profound changes brought by AI [5][6]. - Rohan Anil, an engineer at Anthropic, claims that with programming agents like Claude's Opus, he could compress six years of work into just a few months, showcasing the efficiency gains possible with these tools [9][15]. Group 2: Performance Metrics - The latest LiveBench benchmark results show Claude 4.5 Opus leading in various categories, including coding and reasoning, with scores of 79.65 in coding and 94.52 in mathematics, indicating its superior performance among AI models [23][24]. - Other models, such as GPT-5.1 Codex Max and Gemini 3 Pro Preview, follow behind, with Claude consistently outperforming them in agentic coding tasks [24]. Group 3: Industry Reactions and Developments - Greg Brockman notes that Anthropic has achieved what OpenAI aimed for but could not, emphasizing the practical utility of their tools [25][26]. - Boris Cherny, a developer of Claude Code, shares insights on how to effectively utilize the programming agent, highlighting its user-friendly setup and capabilities [28][29]. - The competitive landscape is evolving, with ByteDance's TRAE China version SOLO being made freely available, indicating a growing interest in programming agents within the industry [31][32].
量子位编辑作者招聘
量子位· 2026-01-05 05:00
Core Viewpoint - The article emphasizes the ongoing AI boom and invites individuals to join the company "Quantum Bit," which focuses on tracking AI advancements and has established itself as a leading content platform in the industry [1]. Group 1: Job Opportunities - The company is hiring for three main directions: AI Industry, AI Finance, and AI Product, with positions available for both experienced professionals and fresh graduates [2][4]. - Positions are open for various levels, including editors, lead writers, and chief editors, with a focus on matching roles to individual capabilities [6]. Group 2: Job Responsibilities - **AI Industry Direction**: Responsibilities include tracking innovations in infrastructure, such as chips, AI infrastructure, and cloud computing, as well as interpreting technical reports from conferences [6][7]. - **AI Finance Direction**: Focuses on venture capital, financial reports, and capital movements within the AI industry, requiring strong analytical skills and a passion for interviews [11]. - **AI Product Direction**: Involves monitoring AI applications and hardware developments, requiring a keen understanding of product experiences and market trends [11]. Group 3: Benefits and Growth - Employees can expect to engage with cutting-edge AI technologies, enhance their work efficiency through new tools, and build personal influence in the AI field [6]. - The company offers competitive salaries, comprehensive benefits, and a supportive environment for professional growth, including mentorship from senior editors [6][12]. Group 4: Company Impact - By 2025, Quantum Bit aims to have over 2.4 million subscribers on WeChat and more than 7 million users across platforms, with a daily reading volume exceeding 2 million [12]. - The company is recognized as the top new media outlet in the AI and frontier technology sectors according to third-party data platforms [12].
结构化预处理让DeepSeek准确率提升51%,现已开源丨清华&深言
量子位· 2026-01-05 05:00
Core Insights - The article introduces LingoEDU, a new method that enhances the accuracy of DeepSeek by 51% through a structured approach to information processing [1][7][46] - LingoEDU focuses on creating a clear semantic structure, allowing for precise tracking of information back to its original source, thereby addressing the issue of "hallucination" in AI-generated content [5][44] Group 1: Methodology and Implementation - LingoEDU employs a preprocessing model that segments text into Elementary Discourse Units (EDUs), assigning unique index markers to each unit for accurate referencing [1][5][21] - The method allows for structured pre-processing of context before it enters the main model, improving the efficiency and accuracy of information generation [2][10] - By creating a semantic tree structure, LingoEDU ensures that every generated output can be traced back to its original text, thus enhancing the reliability of AI outputs [4][46] Group 2: Experimental Results - Experimental results indicate that LingoEDU significantly outperforms baseline models in terms of segmentation accuracy, cost, and efficiency [7][35] - In a comparative study, DeepSeek-R1's accuracy improved from 9.0% to 13.6% after implementing LingoEDU, marking a 51% relative increase [7][40] - The method was tested on a dataset of 248 articles, demonstrating superior performance in tree edit distance (TED) and document-level accuracy (DLA) compared to existing models [34][35] Group 3: Advantages and Value Proposition - LingoEDU retains the semantic integrity of the original text while providing a structured format that enhances information management and reduces processing costs [6][45] - The approach addresses the critical industry challenge of AI hallucination by ensuring that AI-generated content is both accurate and traceable [44][46] - LingoEDU is positioned as a transformative technology that shifts AI applications from "black box" models to more interpretable and controllable systems, setting a new standard for reliable AI [46][47]
华为开源7B多模态模型,视觉定位和OCR能力出色,你的昇腾端侧“新甜点”来了
量子位· 2026-01-05 05:00
Core Viewpoint - Huawei has launched the open-source model openPangu-VL-7B, targeting key scenarios in edge deployment and personal development, showcasing its lightweight and high-performance capabilities [3][24]. Group 1: Model Features and Performance - The openPangu-VL-7B model is designed for various terminal scenarios, excelling in tasks such as image information extraction, document understanding, video analysis, and object localization [2][7]. - The model achieves a latency of only 160 milliseconds for single-image inference on a single Ascend Atlas 800T A2 card, enabling real-time inference at 5 FPS, with a training phase MFU of 42.5% [4]. - During pre-training, the model completed over 3 trillion tokens in stable training, providing valuable practical references for developers using Ascend clusters [5]. Group 2: Benchmarking and Comparison - In various core tasks, openPangu-VL-7B outperforms other models of similar scale, demonstrating strong overall capabilities [7]. - The model's performance in benchmarks includes: - General Visual Question Answering (MMBenchyl.I_DEV: 86.5) - OCR & Document Understanding (OCRBench: 907) - Video Understanding (MVBench: 74.0) [8]. Group 3: Technical Innovations - The model features a high-performance visual encoder optimized for Ascend hardware, achieving a 15% throughput improvement over traditional GPU-optimized encoders [15]. - A mixed training scheme using "weighted per-sample loss + per-token loss" addresses learning balance across varying sample lengths, enhancing the model's understanding of both long and short responses [17][19]. - The model employs a unique positioning data format that improves accuracy and efficiency in visual localization tasks [20][21]. Group 4: Market Implications - The open-source nature of openPangu-VL-7B is a significant advantage for Ascend users, providing a lightweight, high-performance, and versatile multimodal model that enriches the Ascend ecosystem and stimulates innovation [24].
融资35亿后,Kimi神秘模型现身竞技场
量子位· 2026-01-05 05:00
Core Viewpoint - The emergence of a new model named Kiwi-do from Kimi, which is speculated to be a significant player in the large model arena, especially with its upcoming release and potential capabilities in multi-modal applications [1][19]. Group 1: Model Development and Performance - Kiwi-do is suggested to be linked to Kimi's previously mentioned K2-VL model, with indications that it has successfully passed the Visual Physics Comprehension Test (VPCT), showcasing its ability to solve complex visual tasks [15][17]. - The model's performance in SVG drawing tasks has been compared to K2-Thinking, revealing distinct differences in output quality [4][8]. - There is speculation that Kiwi-do may be a smaller parameter model, which could indicate a strategic approach to model development [12][13]. Group 2: Funding and Strategic Goals - Kimi recently announced a $500 million (approximately 3.5 billion RMB) Series C funding round, led by IDG, with participation from major investors like Alibaba and Tencent, resulting in a post-money valuation of $4.3 billion [21][22]. - The funds raised will be utilized to aggressively expand GPU resources to accelerate the training and development of the K3 model, with a long-term goal of becoming a leading AGI company [24][25]. - Kimi's approach to financing differs from other companies in the sector, as it is not currently pursuing an IPO, focusing instead on private market funding to support its growth strategy [27][28]. Group 3: Market Position and Future Outlook - Kimi aims to leverage its funding to enhance computational capabilities, which are critical in the large model industry, where operational costs are substantial [25][26]. - The company plans to time its IPO strategically in the future as a means to further accelerate its AGI ambitions [29]. - The K3 model is expected to achieve a significant leap in pre-training performance, aiming to match world-leading models and enhance user experience through innovative training techniques [32].
「AI 100」榜单启动招募,AI产品“年会”不能停丨量子位智库
量子位· 2026-01-05 05:00
Core Insights - The article discusses the emergence of numerous keywords in the AI product sector in China by 2025, highlighting the rapid evolution and innovation in AI technologies and applications [4]. - The "AI 100" list by Quantum Bit Think Tank aims to evaluate and recognize the top AI products that represent China's AI capabilities, focusing on both current leaders and future potential [12]. Group 1: AI 100 List Overview - The "AI 100" list is divided into three main categories: "Flagship AI 100," "Innovative AI 100," and the top three products in ten popular sub-sectors [6]. - The "Flagship AI 100" will focus on the strongest AI products of 2025, emphasizing those that demonstrate significant technological breakthroughs and practical value [7]. - The "Innovative AI 100" aims to identify emerging products in 2025 that have the potential to lead industry changes in 2026 [8]. Group 2: Sub-sector Focus - The ten sub-sectors for the top three product nominations include AI Browser, AI Agent, AI Smart Assistant, AI Workbench, AI Creation, AI Education, AI Healthcare, AI Entertainment, Vibe Coding, and AI Consumer Hardware [9]. Group 3: Application and Evaluation Criteria - The evaluation of the "AI 100" list combines quantitative and qualitative assessments, focusing on user data such as user scale, growth, activity, and retention, as well as hardware product shipment volumes [13]. - Qualitative assessments consider long-term development potential through expert evaluations and user surveys, examining factors like underlying technology, market space, functionality, monetization potential, team background, and growth speed [13].
宇树IPO搁浅传闻满天飞,王兴兴:别当真,也不用和外人解释
量子位· 2026-01-05 03:22
Core Viewpoint - The company, Yushu Technology, has clarified that it has not applied for a "green channel" for its A-share listing and that its listing process is proceeding normally despite rumors to the contrary [2][10][11]. Group 1: Clarification on Listing Rumors - Yushu Technology responded to media reports claiming that its A-share listing green channel had been halted, stating that these reports were misleading and damaging to its reputation [10][12]. - The company emphasized that it has not applied for a "green channel" and that its listing work is progressing as planned [10][11]. - Yushu Technology has taken steps to report the misleading information to relevant authorities and reserves the right to pursue legal action against those spreading false reports [10][11]. Group 2: Recent Developments - On January 4, Yushu Technology released a training video of its humanoid robot H2, showcasing its capabilities, which coincidentally occurred during the discussion of its listing [3][4][15]. - The video featured a character resembling the company's founder, Wang Xingxing, which sparked discussions among viewers [5][6]. - Following the release of the video, the misleading reports regarding the green channel were subsequently taken down [17]. Group 3: Listing Timeline - Yushu Technology submitted its counseling registration materials on July 8, 2025, with CITIC Securities as the counseling institution [18]. - The company announced on September 2, 2025, that it was actively preparing for its initial public offering (IPO), with plans to submit listing application documents between October and December 2025 [19]. - The company completed its IPO counseling work on November 15, 2025, and intends to apply for an IPO in the domestic market [25][26].