量子位 - filings, earnings calls, financial reports, news

量子位

Search documents

量子位· 2025-12-09 05:39

Core Viewpoint - The MUSA Developer Conference (MDC 2025) will be held in Beijing on December 19-20, 2025, focusing on the development of domestic full-function GPUs and the exploration of breakthroughs in computing power for AI and GPU fields [1][2]. Group 1: Conference Overview - MDC 2025 aims to gather global developers, technology leaders, and industry pioneers to discuss the self-reliance in technology and industrial upgrades, with the theme "Create, Connect, Converge" [1]. - The conference will showcase the MUSA technology system and its full-stack capabilities, promoting the integration of GPU technology across various industries [1][2]. Group 2: Main Forum Highlights - The main forum will focus on intelligent computing as a core engine for digital transformation across industries, featuring a presentation by Zhang Jianzhong, the founder and CEO of Moole Technology, on the new GPU architecture and strategic vision [2]. - The forum will also include discussions on product systems, core technologies, industry solutions, and case studies [2][3]. Group 3: Technical Sessions - Over 20 technical sub-forums will be held, covering key areas such as intelligent computing, graphics computing, AI infrastructure, and developer tools, aimed at empowering developers and partners [4]. - The conference will facilitate deep integration of cutting-edge technologies with industry practices [4]. Group 4: Developer Empowerment - The "Moole Academy" will be established to support developer growth through systematic technology sharing, resource integration, and talent cultivation, fostering a sustainable domestic GPU application ecosystem [5]. Group 5: Interactive Experience - A 1000㎡ immersive "MUSA Carnival" will be created, featuring diverse thematic exhibition areas that cover advanced technologies and popular application scenarios such as AI models, intelligent manufacturing, and digital twins [6][9]. - Live demonstrations will provide an interactive experience, showcasing the real-world integration of technology and industry [7].

明天！量子位的这件大事就要来了｜MEET2026

量子位· 2025-12-09 05:39

Core Insights - The MEET2026 Smart Future Conference is set to take place on December 10, 2025, in Beijing, featuring prominent figures from academia and industry, including Tsinghua University and major tech companies like Baidu and Google Cloud [1][39]. Group 1: Conference Highlights - The conference will cover a wide range of topics related to AI, including large language models, embodied intelligence, autonomous driving, and cloud computing [3][39]. - Key discussions will focus on the advancements in AI technology, particularly the emergence of AI agents capable of autonomous operations and cross-system collaboration [5][6]. - The event will feature two significant dialogues: a GenAI Talk and an Agent Roundtable, addressing real industry challenges without exaggeration [7][16]. Group 2: Notable Speakers - The conference will host nearly thirty influential speakers from academia and industry, including Zhang Yaqin from Tsinghua University and executives from leading tech firms [17][21]. - The lineup includes representatives from various sectors, covering the entire AI ecosystem from foundational research to practical applications [33][34]. - Emerging companies in the AI space, such as Zhuoshijia Technology and Taichu Yuqi, will also participate, showcasing the breadth of innovation in the industry [28][31]. Group 3: Reports and Publications - The conference will release two important documents: the "2025 AI Top Ten Trends Report" and the "2025 Artificial Intelligence Annual List," summarizing key advancements and influential figures in the AI sector [35][39]. - The trends report will provide insights into technological developments, product solutions, and industry applications, serving as a comprehensive overview of the AI landscape [35][39].

BIDU(US:BIDU)

Artificial Intelligence

GenAI

Agent

Artificial Intelligence

Robotaxi

Agent

Artificial Intelligence

GenAI

Agent

Artificial Intelligence

Robotaxi

Agent

论文自动变漫画PPT！Nano Banana同款用秘塔免费生成，还有一对一语音讲解

量子位· 2025-12-09 05:39

克雷西发自凹非寺量子位 | 公众号 QbitAI 正当我们还在眼馋海外的新玩法时，国产AI的速度再次刷新了认知。就在刚刚，秘塔AI搜索默默放出了大招，直接对标Nano Banana 2的核心体验，上线了同款漫画式课件生成玩法。当然漫画只是最精彩的形式之一，秘塔一口气提供了20多种风格可供选择。 Nano Banana 2的知识变漫画玩法可谓是彻底火出了圈。这种"把书读薄"、甚至把晦涩原理直接画成连环画的能力，不仅让人大开眼界，更是一夜之间把AI辅助学习的标准卷到了天花板。以后面对那些几万字的学术论文，或者是密密麻麻的行业报告，再也不用硬着头皮啃纯文字了。关键是，秘塔不仅能把课件"画"出来，还能提供语音讲解，把枯燥的文字变成"有声绘本" 。这次依然是没有任何套路，不需要繁琐的内测申请，也不用苦等排队，而且发布即全员免费开放，主打一个零门槛上手。上传、解析、生成——一键就能把它们变成逻辑清晰、图文并茂的讲解PPT。从苦读文字到看图学习，获取知识的效率，这次是真的原地起飞了。知识自动变PPT，还有配套讲解这次秘塔提供的Nano Banana平替，主要瞄准的就是学习场景，现在这类漫画用 ...

量子位· 2025-12-09 05:39

编辑部发自凹非寺量子位 | 公众号 QbitAI AI热潮还在汹涌，但如果你还不知道如何参与……那为什么不来量子位呢？我们是一家以追踪AI新进展为核心的内容平台，经过8年积累，目前拥有顶流影响力，广泛且备受认可的产业资源，以及时代风口的最佳观测和学习生态位。目前，我们有三大方向岗位招聘，希望你是（或者能成为）这三个方向的内容专家：岗位均为全职，工作地点：北京中关村。岗位面向：加入我们，你可以获得：以下是岗位详情：所有岗位不同能力层级职位均在开放，欢迎结合个人履历和经验申请。 AI产业方向岗位职责： AI产业方向：关注基建层创新，包含芯片、AI Infra、云计算； AI财经方向：关注AI领域创投和财报，跟踪产业链资本动向； AI产品方向：关注AI在应用和硬件终端方向的进展。社招：覆盖编辑、主笔、主编各个层级，按能力匹配岗位；校招：应届毕业生，接受实习且可转正。站在AI浪潮之巅：第一时间接触和了解AI领域最新技术和产品，构建完整的AI认知体系。玩转AI新工具：将各种AI新技术、新工具应用于工作，提升工作效率和创造力。打造个人影响力：通过撰写独家原创内 ...

Artificial Intelligence

Artificial Intelligence Media

Artificial Intelligence

Artificial Intelligence Media

准确率腰斩！大模型视觉能力一出日常生活就「失灵」

量子位· 2025-12-09 01:21

针对此类问题，EgoCross项目团队聚焦跨域第一人称视频问答评测。新工作系统揭示现有MLLM在外科、工业、极限运动与动物视角等场景下的泛化瓶颈。目前大多数第一人称视频基准均集中于日常生活活动，而忽略了真实世界应用中巨大的领域差异。来自华东师范大学、INSAIT的研究团队，首次提出跨域第一视角视频问答基准EgoCross，覆盖4个高价值专业领域、包含近千条高质量QA 对，同时提供闭卷（CloseQA）和开卷（OpenQA）双评测格式，彻底填补了该领域的评估空白。 EgoCross团队投稿量子位 | 公众号 QbitAI 我们习惯了AI在屏幕上侃侃而谈、生成美图，好像它无所不知。但假如把它"扔"进一个真实的手术室，让它用主刀医生的第一视角来判断下一步该用哪把钳子，这位"学霸"很可能当场懵圈。同时，团队通过8款主流MLLM的全面测试，揭示了现有模型的跨域短板，并验证了微调（SFT）、强化学习（RL）等方法的改进潜力。目前该项研究已入选AAAI 2026，所有数据集、代码已全部开源。打破日常"舒适圈" Egocentric Video Question Answering （Ego ...

Domain Shift

Multi - Modal Large Language Model

Artificial Intelligence

Multi - Modal Large Language Model

Artificial Intelligence

GPT - 4.1

Gemini 2.5 Pro

Qwen2.5 - VL

梁文锋，Nature全球年度十大科学人物！

量子位· 2025-12-09 01:21

Core Points - Liang Wenfeng has been recognized as one of the top ten scientists of 2025 by the prestigious journal Nature for his significant contributions to the AI field through the DeepSeek model [1][3] - DeepSeek's model has disrupted the AI industry by achieving remarkable cost-effectiveness and enhancing the global visibility of domestic large models [9][10] - The recent release of DeepSeek's V3.2 model has set a new benchmark in the Agent evaluation, marking a significant advancement in open-source models [11][12] Group 1: Recognition and Impact - Liang Wenfeng is described as a "Tech disruptor" by Nature, highlighting his dual identity as a financial expert and a pioneer in AI [4][5] - The introduction of DeepSeek has been a game-changer for the AI sector, proving that high-performance models can be developed without excessive data or resources [10][21] - The model's cost efficiency has positioned it as a competitive player in the global AI landscape [9] Group 2: Background of Liang Wenfeng - Liang Wenfeng was born in 1985 in Guangdong and excelled academically, earning a place at Zhejiang University [14][15] - He transitioned into quantitative investment in 2008, capitalizing on the emerging trend of quantitative trading in China [17][18] - In 2021, his firm became one of the largest quantitative private equity firms in China, prompting him to explore opportunities in large models [19][20] Group 3: Other Recognized Scientists - Mengran Du, another Chinese researcher, was also recognized for her groundbreaking work in deep-sea ecology [6][22] - Du's research led to the discovery of the deepest known animal ecosystems, challenging existing models of extreme life and carbon cycling [25][26] - Her academic journey includes significant contributions to deep-sea science and technology, with multiple publications in prestigious journals [33]

量子位· 2025-12-08 12:00

Core Insights - The article discusses the capabilities of the newly upgraded AI model GLM-4.6V, highlighting its ability to generate comprehensive content, including articles and reports, from minimal input [8][10][27]. Group 1: AI Capabilities - GLM-4.6V can interpret academic papers and create structured articles by dividing content into logical sections such as introduction, core issues, and conclusions [4]. - The model can process images and tables, incorporating them into articles with appropriate captions, demonstrating its proficiency in visual content integration [5][7]. - It allows users to compare research papers or financial reports by generating visual and textual analyses quickly [16][22]. Group 2: Financial Analysis - The article provides a comparative analysis of Q3 2025 financial results for major companies, including Alphabet, Amazon, Meta, and Apple, showcasing their revenue and profit growth rates [19]. - Alphabet reported Q3 2025 revenue of 102.346 billion, a 16% increase from the previous year, while Amazon's revenue was 180.169 billion, reflecting a 13% growth [19]. - Meta experienced the highest growth rate at 26%, with Q3 2025 revenue of 51.242 billion, while Apple reported a 10% increase with revenue of 94.036 billion [19]. Group 3: Cost Efficiency - The pricing for using GLM-4.6V has been reduced by 50% compared to its predecessor, with input costs as low as 1 yuan per million tokens and output costs at 3 yuan per million tokens [39]. - This cost reduction enhances the model's accessibility for various applications, including document analysis and coding tasks [38][39]. Group 4: Technical Advancements - GLM-4.6V features a context window size of 128K tokens and has achieved state-of-the-art results in multiple multimodal benchmarks, indicating significant advancements in its technical capabilities [67]. - The model integrates function call capabilities into its architecture, enabling seamless transitions from visual perception to actionable tasks, which is crucial for real-world applications [69].

Artificial Intelligence

GLM-4.6V

GLM-4.6V-Flash

Artificial Intelligence

GLM-4.6V

GLM-4.6V-Flash

100万亿Token揭示今年AI趋势！硅谷的这份报告火了

量子位· 2025-12-08 11:36

Core Insights - The report titled "State of AI: An Empirical 100 Trillion Token Study with OpenRouter" analyzes the usage of over 300 models on the OpenRouter platform from November 2024 to November 2025, focusing on real token consumption rather than benchmark scores [3][6][8]. Group 1: Open Source vs. Closed Source Models - Open source models (OSS) have evolved from being seen as alternatives to closed source models to finding their unique positioning, becoming the preferred choice in specific scenarios [9]. - The relationship between open source and closed source models is now more complementary, with developers often using both types simultaneously [10]. - The usage of open source models is expected to reach approximately one-third by the end of 2025, with Chinese models experiencing significant growth from 1.2% to 30% in weekly usage share [12][13]. Group 2: Market Dynamics and Model Diversity - The dominance of DeepSeek as the largest contributor to open source model usage is diminishing as more models enter the market, leading to a diversified landscape [16]. - By the end of 2025, no single model is expected to maintain over 25% of token usage, with the market likely to be shared among 5 to 7 models [17][18]. - The report indicates a shift towards medium-sized models, which are gaining market favor, while small models are losing traction [20][21]. Group 3: Evolution of Model Functionality - Language models are transitioning from dialogue systems to reasoning and execution systems, with reasoning token usage surpassing 50% [22]. - The use of model invocation tools is increasing, indicating a more competitive and diverse ecosystem [29][31]. - AI models are evolving into "intelligent agents" capable of independently completing tasks rather than just responding to queries [43]. Group 4: Usage Patterns and User Retention - The complexity of tasks assigned to AI has increased, with users now requiring models to analyze extensive documents or codebases [35]. - The average input to models has quadrupled, reflecting a growing reliance on contextual information [36]. - The "glass slipper effect" describes how certain users become highly attached to models that perfectly meet their needs upon release, leading to high retention rates [67][70]. Group 5: Regional Insights and Market Trends - The share of paid usage in Asia has doubled from 13% to 31%, indicating a shift in the global AI landscape [71]. - North America's AI market share has declined to below 50%, while English remains dominant at 82%, with Simplified Chinese holding nearly 5% [80]. - The impact of model pricing on usage is less significant than expected, with a 10% price drop resulting in only a 0.5%-0.7% increase in usage [80].

小冰之父李笛智能体创业，公司取名Nextie！陆奇是股东

量子位· 2025-12-08 10:53

Core Viewpoint - The article discusses the emergence of a new startup called Nextie, founded by Li Di, who previously created the AI chatbot Xiaobing. The company aims to leverage "collective intelligence" to enhance AI cognition and decision-making processes, moving beyond traditional models. Group 1: Company Overview - Li Di, known for developing Xiaobing, has launched a new company named Nextie, which means "next journey" [4][7] - The core team of Nextie consists of key members from the Xiaobing project, including co-founder Zeng Min and algorithm head Wang Wenlan [4][45] - Nextie is currently planning to raise tens of millions of dollars in funding, with Qiji as one of the investors [5][8] Group 2: Technology and Innovation - Nextie aims to teach AI about "cognition" through a framework of collective intelligence, which allows multiple AI agents to collaborate and debate to reach better conclusions [11][12] - The company has compiled a comprehensive database of human papers from 1800 to 2020 to support its technology development [18] - Nextie's internal product, "Tuanzi," operates in two modes: a sister group for personal issues and a research group for academic inquiries [22][24] Group 3: Product Features - Tuanzi distinguishes itself from traditional AI by showcasing the interactions and debates among AI agents rather than relying on a single reasoning chain [24][30] - The product has achieved state-of-the-art (SOTA) results during internal testing, outperforming existing single large models [31][32] - Nextie plans to adopt a pricing model based on task outcomes rather than token usage, reflecting the varying value of tasks [33][35] Group 4: Future Prospects - The technology testing for Nextie is nearing completion, with a public launch expected on January 7 of the following year [36] - Li Di's transition to Nextie follows his departure from Xiaobing, where he remains a significant shareholder [41][42] - The article draws parallels between Li Di's new venture and Steve Jobs' NeXT, suggesting a potential for significant impact in the AI industry [62][63]

群体智能

认知

Artificial Intelligence

Artificial Intelligence

量子位· 2025-12-08 06:07

编辑部发自凹非寺量子位 | 公众号 QbitAI AI热潮还在汹涌，但如果你还不知道如何参与……那为什么不来量子位呢？我们是一家以追踪AI新进展为核心的内容平台，经过8年积累，目前拥有顶流影响力，广泛且备受认可的产业资源，以及时代风口的最佳观测和学习生态位。目前，我们有三大方向岗位招聘，希望你是（或者能成为）这三个方向的内容专家：岗位均为全职，工作地点：北京中关村。岗位面向：加入我们，你可以获得：任职要求：以下是岗位详情：所有岗位不同能力层级职位均在开放，欢迎结合个人履历和经验申请。 AI产业方向岗位职责： AI产业方向：关注基建层创新，包含芯片、AI Infra、云计算； AI财经方向：关注AI领域创投和财报，跟踪产业链资本动向； AI产品方向：关注AI在应用和硬件终端方向的进展。社招：覆盖编辑、主笔、主编各个层级，按能力匹配岗位；校招：应届毕业生，接受实习且可转正。站在AI浪潮之巅：第一时间接触和了解AI领域最新技术和产品，构建完整的AI认知体系。玩转AI新工具：将各种AI新技术、新工具应用于工作，提升工作效率和创造力。打造个人影响力：通过撰 ...