人工智能
Search documents
多模态Deep Research,终于有了「可核验」的评测标准
机器之心· 2026-02-14 07:32
Deep Research Agent 火了,但评测还停在「 看起来很强 」。 写得像论文,不等于真的做了研究。 尤其当证据来自图表、截图、论文图、示意图时:模型到底是「 看懂了」,还是 「 编得像懂了」? 俄亥俄州立大学与 Amazon Science 联合牵头,联合多家高校与机构研究者发布 MMDeepResearch-Bench(MMDR-Bench) ,试图把多模态 Deep Research 的评估 从「 读起来不错」,拉回到一个更硬的标准: 过程可核验、证据可追溯、断言可对齐 。 MMDR-Bench 与评测框架相关资源已公开: 论文标题: MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents 论文主页:https://mmdeepresearch-bench.github.io/ 论文链接: https://arxiv.org/abs/2601.12346 github 链接:https://github.com/AIoT-MLSys-Lab/MMDeepResearch-Bench Huggingface 链 ...
Agent、图像、视频全是大版本升级:春晚还没开,豆包AI就火了
机器之心· 2026-02-14 07:32
Core Insights - 2026 is anticipated to be a pivotal year for AI, with significant advancements and competition among major players like ByteDance, OpenAI, and Anthropic [1][2] - The launch of new AI models, including ByteDance's Doubao 2.0 and Seedance 2.0, marks a substantial leap in capabilities, particularly in multi-modal understanding and video generation [3][4] Group 1: AI Model Developments - Anthropic and OpenAI have released new foundational models, leading to significant market reactions and a loss of nearly a trillion dollars in market value for major companies [2] - ByteDance's Doubao 2.0 is a multi-modal agent model that has achieved significant improvements in multi-modal understanding, enterprise-level agent capabilities, and reasoning abilities [5][6][12] - Doubao 2.0 has outperformed competitors in various benchmarks, including math and visual reasoning, achieving top scores in multiple assessments [9][10][14] Group 2: Seedance 2.0 and Video Generation - Seedance 2.0 has gained widespread popularity, showcasing its ability to create high-quality videos from text prompts, with notable examples including the adaptation of a short sci-fi story [44][53] - The model supports mixed-modal inputs, allowing users to combine images, videos, audio, and text for video generation, significantly enhancing creative possibilities [56] - Seedance 2.0's video generation capabilities are considered industry-leading, with improvements in realism, physical accuracy, and narrative control [57][60] Group 3: Competitive Landscape - The AI landscape is becoming increasingly competitive, with ByteDance positioning itself alongside major players like OpenAI and Google, particularly in the fields of image and video generation [61][73] - The advancements in AI technology are transforming the upcoming Spring Festival into a battleground for technological innovation rather than just a peak in user traffic [68][74] - The comprehensive technological advancements across various AI domains, including speech and robotics, provide ByteDance with the confidence to compete on a global scale [70][73]
字节大模型,重磅发布!
证券时报· 2026-02-14 07:32
Core Viewpoint - ByteDance has made significant advancements in the field of multimodal AI with the release of Doubao Model 2.0, showcasing its technological leadership and comprehensive layout in the industry [1][6]. Group 1: Key Features of Doubao Model 2.0 - Doubao Model 2.0 features three major highlights: enhanced visual and multimodal understanding, improved execution of complex instructions, and faster, more flexible reasoning choices [3][2]. - The model has significantly improved its ability to analyze complex documents, tables, graphics, and video content, achieving top-tier performance in visual reasoning and perception tasks [3][4]. - Doubao Model 2.0 offers different sizes of general agent models (Pro, Lite, Mini) and a specialized Code model to meet various application needs [3][6]. Group 2: Impact on the Industry - The upgrades in Doubao Model 2.0, along with the video generation model Seedance 2.0 and image creation model Seedream 5.0 Lite, are expected to drive demand in downstream applications such as short video marketing, e-commerce content, AI dramas, and game production [17][16]. - The model's capabilities are anticipated to lower the barriers for converting text IP into video content, benefiting companies with a large reserve of quality IP [17]. - The demand for cloud training and inference computing power is expected to rise, driving growth in AI chips, smart servers, and cloud computing services [17][18]. Group 3: Market Reception and Usage - Doubao Model 2.0 has garnered widespread attention in the industry, with its daily usage expected to exceed 63 trillion tokens by December 2025, making it the leading model in China and third globally [18]. - The model's release has been met with enthusiasm from industry professionals, with notable figures expressing their intent to utilize the technology for creative projects [14][7].
豆包大模型2.0发布,Pro版全面对标GPT 5.2
2 1 Shi Ji Jing Ji Bao Dao· 2026-02-14 07:20
Core Insights - Doubao model has officially entered its 2.0 phase, focusing on systematic optimizations for large-scale production environments [1][3] Group 1: Doubao 2.0 Features - Doubao 2.0 includes Pro, Lite, Mini, and Code models, designed to adapt to various business scenarios [3] - Doubao 2.0 Pro targets deep reasoning and long-chain task execution, competing directly with GPT 5.2 and Gemini 3 Pro [3] - Doubao 2.0 Lite balances performance and cost, surpassing the previous generation Doubao 1.8 in overall capabilities [3] - Doubao 2.0 Mini is aimed at low-latency, high-concurrency, and cost-sensitive scenarios [3] - The Code version is specifically designed for programming tasks and works best in conjunction with TRAE [3] Group 2: Product Launch and User Experience - Doubao 2.0 Pro is now available on Doubao App, desktop, and web versions, allowing users to experience the "expert" mode [3] - The Code version has been integrated with AI programming product TRAE, and API services for Doubao 2.0 models are now available for enterprises and developers via Volcano Engine [3] - Seedance 2.0, a video generation model, has been officially released and is fully integrated into Doubao and Jimeng products, with a user experience center launched for testing [3][4] Group 3: Seedance 2.0 Capabilities - Seedance 2.0 allows users to generate 5 or 10-second videos by entering prompts in the Doubao App [4] - The model supports synchronized audio-visual generation, multi-shot long narratives, and controllable multi-modal outputs [4] - The company acknowledges that Seedance 2.0 is not yet perfect and aims to improve the alignment between large models and human feedback for better audio-visual production tools [4]
冲刺3000亿!常州高新区发布“432”目标,开启“十五五”全面进位战
Yang Zi Wan Bao Wang· 2026-02-14 07:16
这份家底有数据为证:138家企业跻身全市税收450强,占比超三成;其中107家入围工业税收300强,占 比超35%。纳税超亿元的工业企业达35家。这不仅是"高新实力"的注脚,更是"高新担当"的体现。 常州高新区的产业底气,首先源于其"扎得深"的产业集群和"站得稳"的企业矩阵。常州高新区党工委书 记、新北区委书记石旭涌十分感慨,在常州高新区,企业的发展虽然少有"一夜开花",但贵在"持久绽 放"。全区3828家规上企业,尤其是1638家规上工业企业构成的基本盘,在这里深深扎下了根。 农历春节前夕,常州以一场高规格的现代化产业体系推进大会,擘画"国际化智造名城、长三角创新高 地"的未来航向。与之同频共振,常州高新区也以一场推进大会,明晰了"产创融合新高地、开放活力示 范区、现代生态宜居城"的进阶之路。 而是一种勇于创新、敢于定义未来的本能和一种懂得培育、愿意包容的长期主义生态思维。正是这种基 因,汇聚起了独特的产业流量、人才流量与资本流量,形成了"螺旋式上升"的高质量发展路径。面 向"十五五",常州高新区提出了 更具雄心的"432"目标:主要经济指标贡献度从超20%提升至超1/4,质量型指标贡献度从超30%提升至 ...
都在等梁文锋:AI战事正酣梁文锋却静悄悄,有时候,越是平静,对手越是害怕
Xin Lang Cai Jing· 2026-02-14 07:13
Core Insights - The article discusses the intense competition among internet giants in the AI large model sector, highlighting the ambitions of companies to establish their AI applications as the primary traffic entry point [4][23] - DeepSeek, founded by Liang Wenfeng, emerged as a significant player in the AI landscape with its R1 model, which was launched at a surprisingly low cost, challenging the perception of high investment requirements for top-tier models [14][31] - Despite the competitive environment, DeepSeek has maintained a low profile, with recent updates suggesting a potential new model release, V4, but with no official confirmation [26][27] Industry Competition - Major companies are aggressively distributing cash incentives to attract users, with Tencent offering 1 billion yuan, Baidu 500 million yuan, and Alibaba 3 billion yuan, indicating a fierce battle for user engagement [25] - The launch of new models by ByteDance and Alibaba, including the 2.0 versions of their respective models, reflects a rapid evolution in AI capabilities and competition [8][25] - The article notes a peculiar competitive dynamic where companies are responding to each other's moves, creating a sense of mutual awareness in the market [8][25] DeepSeek's Position - DeepSeek's recent updates include an increase in context window length from 128K tokens to 1 million tokens, suggesting advancements in their technology [26] - The company continues to recruit talent despite a slowdown in hiring across the industry, indicating its commitment to innovation and development [27] - Liang Wenfeng's vision for DeepSeek is to lead in AI research and development, aiming to create a general-purpose AI that goes beyond existing models [31][32] User Engagement and Market Dynamics - The article emphasizes the importance of addressing user needs in the AI sector, with companies like DeepSeek beginning to focus on consumer-facing products [33] - The competition is framed as a quest to meet real user demands, which will determine the leading players in the AI landscape [36] - The article concludes that the current battle among internet giants is crucial for defining the next decade of internet order, highlighting the strategic significance of user engagement in AI applications [36]
“还难过呢?那就难过着吧”,DeepSeek变冷漠甚至凶凶的?它自己解释了一下
Xin Lang Cai Jing· 2026-02-14 07:13
Core Viewpoint - The recent upgrade of the domestic AI assistant DeepSeek has led to user complaints about its perceived coldness, shifting from an empathetic "partner" to a more transactional "customer service" role, sparking discussions on balancing AI efficiency and emotional value [1][2]. User Feedback - Users have expressed dissatisfaction on social media, noting that DeepSeek no longer uses personalized nicknames, instead referring to users generically as "users" [2]. - Complaints include that DeepSeek's responses have become overly formal and lacking in emotional depth, with some users describing the new model as "stupid" and reminiscent of outdated literary styles [2]. - While some users appreciate the more objective and rational responses, others feel that the AI has become cold and indifferent [2]. Company Response and Strategy - DeepSeek has stated that the perceived coldness is not intentional, emphasizing a focus on problem-solving over emotional engagement [3][4]. - The company acknowledges that the recent adjustments may have sacrificed some quality for speed, as it prepares for the upcoming V4 version set to launch in February 2026 [2][4]. - Commentary from industry experts suggests that DeepSeek's strength lies in its technical capabilities rather than emotional engagement, and that its focus on algorithmic breakthroughs could significantly contribute to the Chinese AI industry [4].
罗卫红聚焦“阳光雨露”:让创新环境孕育发展新可能
Xin Lang Cai Jing· 2026-02-14 07:11
Group 1 - The core focus is on optimizing the innovation environment to foster new development possibilities in technology [1][3] - The "Hangzhou Six Little Dragons" have gained a competitive edge in artificial intelligence, with a goal to become the leading city for AI innovation [3] - There is a significant issue of "data scarcity" in the AI industry, which hinders the transition of technological achievements from laboratories to large-scale applications [3] Group 2 - The importance of technology finance as a crucial part of the innovation environment has been emphasized, with suggestions to cultivate patient capital and support the establishment of innovation consortiums [5] - The 2025 initiative by the Hangzhou Municipal Committee of the Jiusan Society aims to promote the transfer and transformation of scientific and technological achievements [5] - Recommendations will be brought to the National People's Congress to adjust state-owned capital investment assessments to encourage innovation and tolerate failure [5] Group 3 - There are challenges in technology education in primary and secondary schools, particularly in equipment configuration and teacher training [6] - A call for better planning and resource consolidation to enhance technology education and establish a curriculum that fosters student innovation capabilities [6] - Continuous efforts are expected to expand high-quality development opportunities through strengthened innovation ecosystems at both national and local levels [6]
都在等梁文锋
投资界· 2026-02-14 07:08
Core Viewpoint - The article discusses the intense competition among major internet companies in China to dominate the AI model application space, highlighting the strategic positioning of Deep Seek and its founder Liang Wenfeng as a significant player in this evolving landscape [2][4]. Group 1: AI Competition Landscape - Major internet giants are aggressively investing in user incentives, with Tencent distributing 1 billion yuan in cash red envelopes, Baidu offering 500 million yuan for promoting its Wenxin assistant, and Alibaba launching a 3 billion yuan campaign [4]. - The competition is characterized by rapid product releases, with ByteDance announcing its Doubao model 2.0 and Alibaba introducing its Qwen-Image 2.0 model, indicating a synchronized response among competitors [5][6]. Group 2: Deep Seek's Positioning - Deep Seek, founded by Liang Wenfeng, has maintained a low profile despite its significant achievements, including the release of the R1 model in early 2025, which matched top global models at a fraction of the cost [2][9]. - The company is rumored to be preparing to launch its next-generation model, V4, aimed at coding AI, but has remained silent on the exact timeline [6][10]. - Deep Seek's recent updates have increased its context window from 128K tokens to 1 million tokens, suggesting ongoing advancements in its technology [6]. Group 3: Liang Wenfeng's Background - Liang Wenfeng, born in 1985 in Guangdong, has a strong academic background in computer science and has been involved in AI and quantitative trading since his university days [7][8]. - He co-founded Hangzhou Huafang Technology, which became a significant player in quantitative trading, and later established Deep Seek to pursue general artificial intelligence [9]. Group 4: User-Centric Approach - Deep Seek is shifting its focus towards user experience and product innovation, as evidenced by its recent job postings aimed at enhancing C-end product functionality [10][11]. - The article emphasizes the importance of addressing real user needs in the AI sector, suggesting that the ability to solve genuine problems will determine the success of AI applications [11].
Grok和谷歌Gemini瓜分ChatGPT美国市场份额
Jin Rong Jie· 2026-02-14 07:03
Core Insights - xAI's chatbot Grok has increased its market share in the U.S. from 14% in December last year to 17.8% in January this year, making it the third most popular chatbot in the U.S. behind ChatGPT (52.9%) and Gemini (29.4%) [1] - Both Gemini and Grok are rapidly capturing the market share lost by ChatGPT [1]