多模态模型
Search documents
A股超4600股上涨,AI应用批量涨停,港股智谱狂飙近40%
21世纪经济报道· 2026-02-09 07:43
Market Overview - The A-share market saw significant gains, with the Shanghai Composite Index rising over 1% and the Shenzhen Component Index increasing over 2%. The total trading volume reached 2.25 trillion yuan, an increase of 103.8 billion yuan compared to the previous trading day, with over 4,600 stocks rising [1][2]. Sector Performance - The semiconductor sector experienced notable growth, with stocks like Juguang Technology rising over 17%, Chipone Technology nearly 15%, and Guoxin Technology increasing over 12%. Other semiconductor stocks also saw gains of over 6% [5]. - The precious metals sector saw a short-term rally, with Hunan Silver hitting the daily limit, and other stocks like Silver Industry and Zhongjin Gold also rising. Spot gold increased by 0.91% to $5,012 per ounce, while spot silver rose by 4.97% to $81.62 per ounce [5]. - The communication services sector saw afternoon gains, with stocks like Guanghuan New Network rising over 10% [5]. Company Highlights - Chinese online literature company Zhongwen Online reported a 20% increase in stock price, reaching 35.28 yuan [3]. - The company SFC announced a restructuring investment agreement with Anhui Guozhi, which could lead to a change in control for the company [5]. - Pop Mart, a Hong Kong-based company, saw its stock price rise nearly 6%, reaching 257 HKD, marking a new high since October 2025. The company reported over 10,000 global employees and over 100 million registered members [8]. Technological Advancements - Recent advancements in AI video generation were highlighted, with companies like Keling AI and ByteDance launching new models that enhance video and image generation capabilities. These developments are expected to drive growth in the content creation sectors such as film, gaming, and advertising [4].
行业周报:周观点:2026年,多模态模型有望迎来DS时刻
KAIYUAN SECURITIES· 2026-02-08 10:45
Investment Rating - The industry investment rating is "Positive" (maintained) [1] Core Insights - The report highlights that 2026 is expected to be a pivotal year for multimodal models, which are anticipated to drive growth in the film, gaming, and advertising sectors due to significant improvements in capabilities and cost reductions [5][12][15] - The launch of the Keling 3.0 series and Byte's Seedance 2.0 marks a competitive breakthrough in the multimodal field, enabling comprehensive video production processes and enhancing user experience [6][13] - Keling AI has rapidly commercialized its multimodal models, boasting over 60 million creators and generating more than 600 million videos, with an annual revenue run rate of $240 million [7][14] Summary by Sections Industry Overview - The computer index fell by 3.27% during the week of February 2-6, 2026, while the CSI 300 index decreased by 1.33% [4][16] Market Dynamics - The report discusses the competitive landscape with the introduction of advanced models like Keling 3.0 and Seedance 2.0, which are set to redefine video creation and content production [6][13] - The report emphasizes the importance of commercial viability for multimodal models, suggesting that 2026 will be crucial for cost reduction and quality enhancement [14] Investment Recommendations - Beneficiary companies include Wanjing Technology, Haitai Ruisheng, Hongsoft Technology, and others, with a focus on AI application investment opportunities [8][15]
行业周报:周观点:2026年,多模态模型有望迎来DS时刻-20260208
KAIYUAN SECURITIES· 2026-02-08 10:13
Investment Rating - The investment rating for the computer industry is "Positive" (maintained) [1] Core Insights - The report highlights that 2026 is expected to be a pivotal year for multimodal models, which are anticipated to drive growth in the film, gaming, and advertising sectors due to significant improvements in capabilities and reductions in costs [5][12][15] - The launch of the Keling 3.0 series and Byte's Seedance 2.0 models marks a significant advancement in the multimodal field, enabling comprehensive video production processes and enhancing competition among industry players [6][13] - Keling AI has rapidly commercialized its multimodal models, boasting over 60 million creators and generating more than 600 million videos by the end of 2025, with an annual revenue run rate of $240 million [7][14] Summary by Sections Industry Overview - The computer index fell by 3.27% during the week of February 2-6, 2026, while the CSI 300 index decreased by 1.33% [4][16] Multimodal Model Developments - The initial Sora model by OpenAI, launched in February 2024, is compared to a significant breakthrough in video technology, with subsequent models showing substantial advancements [5][12] - The Keling 3.0 series, launched on February 5, 2026, integrates various multimedia capabilities, marking a new era in AI-driven content creation [6][13] Commercialization Potential - The report emphasizes that 2026 will be crucial for the commercialization of multimodal models, driven by enhanced model capabilities and reduced costs, which will lower barriers to entry for users [7][14] Investment Recommendations - Beneficiaries of the anticipated growth in the multimodal sector include companies such as Wanjing Technology, Haitai Ruisheng, and Hongsoft Technology, among others [8][15]
Attention真的可靠吗?上海大学联合南开大学揭示多模态模型中一个被忽视的重要偏置问题
机器之心· 2026-02-04 01:04
一、研究意义 近年来,视觉 — 语言模型(Vision-Language Models,VLMs)在图像理解、视觉问答、多模态对话等任务中表现突出,并逐渐成为通用人工智能的重要技术基 础。然而,这类模型在实际部署时往往面临一个现实挑战: 模型推理成本高,速度慢。 近年来,Vision-Language Models(视觉 — 语言模型)在多模态理解任务中取得了显著进展,并逐渐成为通用人工智能的重要技术路线。然而,这类模型在实际应 用中往往面临推理开销大、效率受限的问题,研究者通常依赖 visual token pruning 等策略降低计算成本,其中 attention 机制被广泛视为衡量视觉信息重要性的关键 依据。 近日,上海大学曾丹团队联合南开大学研究人员,从 attention 可靠性的角度出发,系统揭示了 Vision-Language Models 中普遍存在的 attention 偏置问题,并提出了 一种无需重新训练的 attention 去偏方法,在多个主流模型、剪枝策略及图像与视频基准上验证了其有效性,为多模态模型的高效、可靠部署提供了新的思路。 为提升效率,研究者通常会采用 visual t ...
京产大模型成果登上国际顶级期刊
Xin Lang Cai Jing· 2026-01-29 20:54
2018年以来,GPT采用"预测下一个词元(Next-token prediction,NTP)"的自回归路线,实现了语言大 模型重大突破,开启了生成式人工智能浪潮。而擅长同时处理文字、图片、视频等多种形态信息的多模 态模型主要依赖对比学习、扩散模型等专门路线。在此背景下,一个重要问题困扰行业数年:能否用一 种简单、统一的方法即自回归路线,让AI(人工智能)同时学会高效地处理文字、图片和视频? 智源这项名为"通过预测下一个词元进行多模态学习的多模态大模型"的成果给出了肯定的答案。该成果 表明,只采用自回归路线,就可以统一多模态学习,训练出优秀的原生多模态大模型,这对于确立自回 归成为生成式人工智能统一路线具有重大意义。 (来源:千龙网) 值得一提的是,基于这一核心路径的迭代版本Emu3.5模型,已展现出对物理世界运行规律的初步学习 与模拟能力,能够尝试预测场景的下一步变化,为发展更通用、更接近人类认知方式的大模型与智能体 奠定了基础。 当地时间1月28日,北京智源人工智能研究院的一项突破性研究成果在国际顶级学术期刊《自然》 (Nature)上线,这是我国科研机构主导的大模型成果首次在《自然》正刊发表。 据悉, ...
Kimi发布并开源K2.5模型
Zheng Quan Shi Bao Wang· 2026-01-27 06:37
人民财讯1月27日电,1月27日,月之暗面Kimi发布并开源Kimi K2.5模型。据介绍,K2.5模型是Kimi迄 今最全能的模型,原生的多模态架构设计,同时支持视觉与文本输入、思考与非思考模式、对话与 Agent任务。 ...
又见印奇
3 6 Ke· 2026-01-27 00:25
Core Insights - The article discusses the evolution of AI commercialization, focusing on the experiences and insights of Yin Qi, founder of Megvii Technology, and his current role at StepFun. It highlights the challenges faced in the AI 1.0 era and the shift towards more viable business models in the AI 2.0 landscape. Group 1: AI Commercialization Challenges - Yin Qi reflects on the difficulties of closing the commercial loop during the AI 1.0 era, which significantly impacted his ventures [3] - He emphasizes that once a business model fails, it is challenging to revert, leading to a lack of scalable profits and viable products [4] - The majority of the "Six Little Tigers" in the AI sector are still in the early stages of commercialization, struggling to find effective business models [4] Group 2: Insights on Competitors and Market Dynamics - Yin Qi expresses skepticism about the commercialization strategies of many AI startups in Silicon Valley, noting that Google has an advantage due to its established revenue streams [4] - He identifies xAI, associated with Tesla, as having a potentially successful commercial model due to its strong integration of software and hardware capabilities [5] Group 3: StepFun's Strategic Direction - StepFun has recently secured over 5 billion RMB in funding, setting a record for single financing rounds in the domestic large model sector [6] - The company aims to combine AI with smart terminals, focusing on hardware development alongside foundational model research [7][10] - StepFun's recent release of the Step3-VL-10B model demonstrates superior performance in benchmarks compared to larger models, indicating a strong position in the market [8] Group 4: Talent and Team Composition - StepFun's team comprises top talents from Megvii and Microsoft, maintaining a high density of expertise and a balanced skill set [12] - Yin Qi hopes to attract back some of the talent that has left for other companies in the sector, emphasizing the importance of a strong team for future success [13] Group 5: Long-term Vision and Philosophy - Yin Qi advocates for a long-term approach to business, focusing on delivering tangible commercial results rather than merely pursuing theoretical advancements [15] - He acknowledges a shift from a passionate to a more pragmatic mindset, prioritizing clear customer and commercial value in AI developments [15]
北京形成人工智能闭环式产业生态
Bei Jing Shang Bao· 2026-01-25 17:18
Core Insights - The artificial intelligence industry has transitioned from a phase of technological exploration to a focus on practical applications, with a notable shift towards multi-agent systems that outperform single-agent systems in specific tasks [1] - AI is expanding beyond digital realms into the physical world, moving towards multimodal models and addressing core challenges such as temporal and spatial cognition [1] - Beijing is positioned as a central hub for AI development, benefiting from a comprehensive ecosystem that supports industry growth [1] Industry Development - By 2025, Beijing's core AI industry is expected to reach a scale of 450 billion yuan, with over 2,500 companies, accounting for approximately half of the national figures [2] - The city is home to nearly 60 listed companies and around 40 unicorns in the AI sector, including the first domestic AI chip and large model companies [2] - Beijing has 148 scholars listed in the "AI 2000 Global Most Influential Scholars" list, representing over 40% of the national total, with a total of 15,000 AI scholars in the city [2] Ecosystem and Policy Support - A comprehensive policy framework and a complete layout from foundational computing power to application scenarios have created a closed-loop industrial ecosystem in Beijing [2] - The collaboration between research institutions, enterprises, and policy levels is driving breakthroughs in new technologies and applications in the AI field [2] - There is an expectation that 2026 will be a pivotal year for the explosion of intelligent agents in China [2]
2026北京两会|对话市政协委员王仲远:北京形成了人工智能闭环式产业生态
Bei Jing Shang Bao· 2026-01-25 11:17
Core Insights - The artificial intelligence industry has transitioned from a phase of rapid development to a more pragmatic focus on application efficiency, particularly moving from single-agent systems to multi-agent systems [2][5] - Beijing is positioned as a core hub for AI development, with a comprehensive ecosystem that supports the industry through policies, talent, and technological advancements [3][6] Industry Trends - The development of foundational models, especially large language models, has slowed, while the application of these models is accelerating, emphasizing the shift towards multi-agent systems [5][9] - AI is expanding beyond digital realms into the physical world, necessitating advancements in multi-modal models and world models to tackle challenges in time-space cognition and physical reasoning [2][5] Market Potential - By 2025, Beijing's AI core industry is expected to reach a scale of 450 billion yuan, with over 2,500 companies, accounting for about half of the national figures [3] - The city is home to nearly 60 listed AI companies and around 40 unicorns, showcasing its leadership in the AI sector [3] Talent and Education - Beijing boasts a significant talent pool, with 148 individuals listed in the "AI 2000 Global Influential Scholars" ranking, representing over 40% of the national total [3][7] - The city has a complete talent development chain, supported by top universities and research institutions, fostering the growth of AI professionals [7][8] Policy and Ecosystem - The policy framework in Beijing is comprehensive and practical, supporting both disruptive innovations and the development of new research institutions, which contributes to a closed-loop industrial ecosystem [6][8] - The collaboration between research institutions, enterprises, and policy-makers is driving breakthroughs in new technologies and applications in the AI field [3][6] Future Outlook - The year 2026 is anticipated to be a pivotal year for the explosion of intelligent agents in China, with expectations for significant advancements in multi-agent systems [3][8] - The focus is on achieving commercial viability for large models, which is essential for high-quality development in the industry [9][10]
国内外AI应用冰火两重天-模型和应用的矛盾加剧
2026-01-20 01:50
Summary of Key Points from Conference Call Industry Overview - The AI application landscape is experiencing a stark contrast between domestic and international markets, with increasing contradictions between models and applications [1] - The semiconductor industry is in a significant expansion phase, driven by TSMC's increased capital expenditure forecast of 30%-40%, indicating strong demand confidence for the next two to three years [1][4] - Storage prices are rising rapidly due to resource factors, while power equipment supply and capacity issues may become long-term constraints [1][5] Core Insights and Arguments - TSMC's capital expenditure is projected to exceed $50 billion, marking the largest increase in recent years, which alleviates concerns about a peak in capital spending [4] - The AI industry in the US and China shows a clear divergence in stock performance, attributed to differences in technological development paths and market demands [3] - Multi-modal models, such as Google's NanoBanana, are expected to transform from generative tools to productivity tools by 2025, significantly enhancing potential applications in programming and healthcare [1][6] Storage Demand Changes - There is a noticeable shift in storage demand from training to inference, driven by the development of reasoning models that require extensive context information [7][8] - The demand for SSDs is expected to grow in tandem with the Agent market stabilizing, reflecting a critical change in storage needs [8] AI Model Development - The leading companies in foundational models are Anthropic, OpenAI, and Gemini, with significant advancements in multi-modal models enhancing AI's ability to process visual information [6][9] - Reinforcement learning is being integrated into vertical models, allowing AI to mimic human problem-solving approaches, which is particularly beneficial in specialized fields [10][11] Market Focus Differences - The domestic market is more focused on consumer (C-end) development, with major players like Alibaba, ByteDance, and Tencent leading the competition, while the overseas market emphasizes business-to-business (B-end) development [12] - Alibaba's Tongyi Qianwen integrates various traffic sources into a single entry point, enhancing product parsing capabilities and potentially stabilizing stock price fluctuations [14] Competitive Strategies - ByteDance's approach involves consolidating AI functions within its operating system, while Alibaba's strategy focuses on integrating its ecosystem into a super app format [13] - Tencent is transforming mini-programs into Agents, distributing AI functionalities across applications [13] International AI Company Developments - OpenAI and Anthropic have reached valuations in the tens of billions, with Anthropic gaining significant market attention due to its focus on programming workflows [15][17] - Google's release of automated node editing tools is impacting traditional workflow tools, although its primary focus remains on consumer applications [16] Investment Considerations - Companies like Google, Tencent, Alibaba, and Kuaishou are seen as clear investment targets due to their self-owned traffic ecosystems and proprietary model capabilities [21] - In the B2B application space, companies like Figma and Adobe need to demonstrate resilience against AI disruptions, while those focused on vertical model development are less affected [21]