多模态AI

Search documents
火山引擎多模态数据湖架构升级,驱动企业迈向AI原生时代
Cai Fu Zai Xian· 2025-06-17 08:15
火山引擎多模态数据湖解决方案在此背景下持续迭代。此前,该方案已实现海量结构化、半结构化及非 结构化数据的统一管理,为LLM(大语言模型)全生命周期训练提供数据支持。此次升级进一步强化了多 模态数据处理能力:新增模型数据处理蒸馏与多模态分析能力,优化与火山引擎各平台的联动机制,通 过MCP(多模态认知平台)简化数据开发流程,帮助企业高效识别与利用多模态数据资产。 在技术落地层面,火山引擎多模态数据湖聚焦三大核心场景: 2025年6月,火山引擎FORCE原动力大会在北京举办。火山引擎数智平台正式发布多模态数据湖全新产 品架构。该架构通过存储与计算能力的深度优化,构建兼容文本、图像、音频、视频等多元数据的处理 框架,为企业打造适应Agentic AI(智能体人工智能)时代的新一代AI Native数据基础设施,助力企业从 传统商业智能向AI驱动的决策模式转型。 随着全球数据规模爆发式增长,非结构化数据与多模态AI解决方案的占比正快速攀升。IDC预测,到 2028年全球数据总量将达393ZB,其中超80%为非结构化数据;Gartner则指出,到2027年,40%的生成 式AI解决方案将采用多模态技术,较2023年的1 ...
MiniMax发布推理模型对标DeepSeek,算力成本仅约53万美元
Di Yi Cai Jing· 2025-06-17 07:26
Core Insights - MiniMax, one of the "Six Little Dragons," has announced significant updates, starting with the release of its first open-source inference model, MiniMax-M1 [1] - MiniMax-M1 has shown competitive performance in benchmark tests, comparable to leading overseas models like DeepSeek-R1 and Qwen3 [3] - The model's training was completed in just three weeks using 512 H800 GPUs, with a total computing cost of only $534,700, which is an order of magnitude lower than initially expected [3][8] Performance Metrics - MiniMax-M1's context window length is 1 million tokens, which is eight times that of DeepSeek R1 and matches Google's Gemini 2.5 Pro, allowing superior performance in long-context understanding tasks [5] - In the TAU-bench evaluation, MiniMax-M1 outperformed DeepSeek-R1-0528 and Google's Gemini 2.5 Pro, ranking just below OpenAI o3 and Claude 4 Opus globally [7] - The model excels in coding capabilities, significantly surpassing most open-source models, with only a slight gap behind the latest DeepSeek R1 [7] Innovations and Cost Efficiency - MiniMax-M1 utilizes a hybrid architecture based on a lightning attention mechanism, enhancing efficiency in long-text input and deep reasoning tasks [7] - The introduction of the CISPO reinforcement learning algorithm has resulted in faster convergence performance compared to Byte's recent DAPO algorithm, contributing to the low training cost [8] - MiniMax's pricing strategy is tiered based on input length, with costs ranging from $0.8 to $2.4 per million tokens for input and $8 to $24 for output, offering competitive pricing against DeepSeek [8] Competitive Landscape - Concurrently, another competitor, Moonlight, has released its programming model Kimi-Dev-72B, which reportedly achieved the highest open-source model level in SWE-bench tests, surpassing the new DeepSeek-R1 [8] - However, Kimi-Dev-72B faced scrutiny for potential overfitting, as it generated less code than required for certain tasks, raising questions about its performance reliability [9] - The AI industry is witnessing renewed competition among the "Six Little Dragons," with MiniMax expected to release further updates in the coming days, potentially impacting the multi-modal AI landscape [9]
【公告全知道】谷子经济+多模态AI+短剧游戏+华为鸿蒙!公司多款谷子产品上线即售罄
财联社· 2025-06-12 14:31
Group 1 - The article highlights the importance of weekly announcements from Sunday to Thursday, which include significant stock market updates such as suspensions, increases or decreases in holdings, investment wins, acquisitions, earnings reports, and unlocks [1] - A company has successfully obtained multiple international IP licenses for domestic derivative products, with several of its millet products selling out immediately upon launch [1] - Another company has delivered samples of humanoid robot dexterous hand reducer bearings to clients, showcasing advancements in controllable nuclear fusion, solid-state batteries, nuclear energy, and state-owned enterprise reform [1] - The company focusing on innovative drugs has entered the maintenance dose phase for its semaglutide injection project, with expectations to apply for market approval in China by 2026 [1]
传媒行业周报:关注火山引擎原动力大会,聚焦AI应用及IP商业化行业周报
KAIYUAN SECURITIES· 2025-06-09 01:13
投资评级:看好(维持) 行业走势图 58% 传媒 沪深300 2025 年 06 月 08 日 数据来源:聚源 -29% -14% 0% 14% 29% 43% 2024-06 2024-10 2025-02 相关研究报告 《模型与应用再升级,新游表现亮眼, 继续布局 AI 、 IP — 行业周报》 -2025.6.2 《AI 社交应用不断推新,IP 产业资本 化、多元化加快 — 行 业 周 报 》 -2025.5.25 《多模态 AI 继续迭代,IP 产业资本化 或加快—行业周报》-2025.5.18 关注火山引擎原动力大会,聚焦 AI 应用及 IP 商业化 ——行业周报 | 方光照(分析师) | 田鹏(分析师) | | --- | --- | | fangguangzhao@kysec.cn | tianpeng@kysec.cn | | 证书编号:S0790520030004 | 证书编号:S0790523090001 | tianpeng@kysec.cn 证书编号:S0790523090001 火山引擎原动力大会及苹果 WWDC25 将举行,快手可灵 AI 商业化加快 6 月 11-12 日,字节跳 ...
一度飙涨超180%!可控核聚变概念,大爆发
Zheng Quan Shi Bao Wang· 2025-05-26 09:20
Market Overview - A-shares experienced slight fluctuations with major indices showing mixed results, as the North Stock Exchange 50 surged nearly 2% near the close, while the ChiNext Index barely held above the 2000-point mark, and the Shanghai 50 fell below 2700 points, marking a two-week low [1] - The total trading volume shrank to below 1 trillion yuan, the lowest in over a month [1] Index Performance - The Shanghai Composite Index closed at 3346.84, down 0.05% with a trading volume of 400.53 billion yuan [2] - The Shenzhen Component Index was at 10091.16, down 0.41% with a trading volume of 609.43 billion yuan [2] - The ChiNext Index closed at 2005.26, down 0.80% with a trading volume of 268.70 billion yuan [2] - The North Stock Exchange 50 rose to 1396.59, up 1.94% with a trading volume of 24.10 billion yuan [2] Sector Performance - The controllable nuclear fusion, gaming, artificial intelligence, and millet economy sectors saw significant gains, while passenger vehicles, chemical pharmaceuticals, energy metals, and liquor sectors faced declines [2] - The electronic industry attracted over 6.2 billion yuan in net inflow, while the automotive sector saw a net outflow of over 2.9 billion yuan [3] Investment Insights - Zhongyou Securities noted that the A-share index has rebounded to levels prior to the US-China trade war 2.0, indicating a need for new catalysts to boost market confidence [3] - According to招商证券, external tariff uncertainties remain, and more policy support is needed for stable internal growth, with a focus on sectors like automotive, non-ferrous metals, defense, and chemical pharmaceuticals [3] Nuclear Energy Sector - The nuclear energy sector experienced a significant rally, with the controllable nuclear fusion sector leading the gains, and related stocks like 哈焊华通 (20% limit up) and 常辅股份 also performing strongly [3][4] - The global narrative around nuclear power is expected to strengthen due to new policies from the Trump administration, which aims to expand the US nuclear energy sector significantly by 2050 [5][8] Artificial Intelligence Sector - The artificial intelligence sector saw a strong upward trend, with multiple sub-sectors closing at their highest points, driven by recent conferences and the introduction of new AI standards [5][6] - Companies like 中邮科技 and 星宸科技 saw significant gains, with many stocks hitting their daily limit [5] Conclusion - The current market dynamics indicate a mixed performance across sectors, with notable strength in nuclear energy and artificial intelligence, while broader market indices face challenges that require new catalysts for growth [3][5][6]
目标出货一亿台,Altman和Ive的新公司「io」到底要做什么硬件?
Founder Park· 2025-05-23 11:01
Core Insights - Sam Altman and Jony Ive are collaborating to create a new hardware device, which aims to be the third core device on users' desks after the MacBook Pro and iPhone [1][4][5] - OpenAI has announced the acquisition of Jony Ive's AI hardware startup "io" for nearly $6.5 billion, with plans to ship 100 million units of the new device [1][4][8] - The device is designed to reduce users' reliance on screens and is not intended to be a smartphone or wearable technology [1][5][10] Summary by Sections Acquisition and Collaboration - OpenAI's acquisition of "io" is seen as a significant opportunity, with Altman suggesting it could generate up to $1 trillion in additional value for the company [4][9] - The collaboration between Altman and Ive has evolved over the past 18 months, with a focus on developing a device that serves as a core interaction point between users and OpenAI [10] Device Concept and Design - The new device will be pocket-sized and designed for easy placement on desks, emphasizing a low-profile design [5][10] - Altman and Ive believe that existing devices do not meet user needs, and the new product aims to change how users interact with AI [10] Market Context and Competition - The announcement comes amid other tech giants like Google and Apple launching their own AI hardware products, including smart glasses [2][9] - Altman acknowledges the challenges of entering the hardware market, especially against established companies like Apple and Google [8][9]
数据复盘丨银行、保险等行业走强 29股获主力资金净流入超亿元
Zheng Quan Shi Bao Wang· 2025-05-22 09:57
涨停股中,从连续涨停天数来看,大于或等于2天的个股有19只,其中,宜宾纸业、棕榈股份、浪莎股份、三生国健、 滨海能源、ST岭南、*ST苏吴、*ST岩石、*ST节能、*ST双成、*ST赛隆均4连板,连续涨停板数量最多;其次是慧博云 通、莱绅通灵均3连板;永安药业、廊坊发展、重庆港、宝光股份、汇得科技、通达电气均2连板。 沪深两市主力资金净流出252.63亿元 6个行业主力资金呈现净流入 5月22日,上证指数全天窄幅震荡;深证成指、创业板指全天震荡走低;科创50指数早盘探底回升,随后震荡回落,临 近午盘有所回升,午后回落走低。截至收盘,上证指数报3380.19点,跌0.22%,成交额4383.35亿元;深证成指报 10219.62点,跌0.72%,成交额6643.55亿元;创业板指报2045.57点,跌0.96%,成交额2940.63亿元;科创50指数报 990.71点,跌0.48%,成交额164.4亿元。沪深两市合计成交11026.9亿元,成交额较上一交易日减少707.88亿元。 银行、保险等行业走强宜宾纸业、棕榈股份等股4连板 从盘面上来看,行业板块、概念跌多涨少。其中,银行、保险、传媒等行业涨幅靠前;重组蛋 ...
对话阶跃星辰段楠:“我们可能正触及 Diffusion 能力上限”
AI科技大本营· 2025-05-20 01:02
Core Viewpoint - The article discusses the advancements and future potential of video generation models, emphasizing the need for deeper understanding capabilities in visual AI, moving beyond mere generation to true comprehension [1][5][4]. Group 1: Video Generation Models - The team at Jumpscale has open-sourced two significant video generation models: Step-Video-T2V and Step-Video-TI2V, both with 30 billion parameters, which have garnered considerable attention in the AI video generation field [1][12]. - Current diffusion video models, even at 30 billion parameters, show limited generalization capabilities compared to language models, but possess strong memory capabilities [5][26]. - The future of video generation models may involve a shift from mere generation to models that possess deep visual understanding, requiring a change in learning paradigms from mapping learning to causal prediction learning [5][20]. Group 2: Challenges and Innovations - The article outlines six major challenges in AI-generated content (AIGC), focusing on data quality, efficiency, controllability, and the need for high-quality data [39][32]. - The integration of autoregressive and diffusion models is seen as a promising direction for enhancing video generation and understanding capabilities [21][20]. - The importance of high-quality, diverse natural data is highlighted as a critical factor in building robust foundational models, rather than relying heavily on synthetic data [14][16]. Group 3: Future Predictions - Predictions indicate that foundational visual models with deeper understanding capabilities may emerge within the next 1-2 years, potentially leading to a "GPT-3 moment" in the visual domain [4][36]. - The convergence of video generation with embodied intelligence and robotics is anticipated, providing essential visual understanding capabilities for future AI applications [37][42]. - The article suggests that the future of AIGC will enable individuals to easily create high-quality content, democratizing content creation [38][48].
百度居然悄悄拿了个榜单第一,关键是……他们自己好像还不知道?
3 6 Ke· 2025-05-19 11:57
再去刷了一圈AI圈子,结果发现好多KOL也都一脸蒙圈: 周末,躺在公园百无聊赖刷手机的我,差点被一条消息惊掉下巴: 全球AI圈公认的权威视频生成评测榜单VBench刚刚更新了最新一期图生视频(I2V)排名,排在第一,不是大名鼎鼎的OpenAI Sora,也不是 风头正劲的谷歌Imagen Video,而是百度的视频生成模型Steamer-I2V,总分更是飙到了89.38%! 讲真,我第一眼看到的时候也是满脸问号:百度?图生视频?榜单第一???这是啥情况? 和圈内朋友打了一通电话后,我发现,这是一个基于市场实际需求的明智选择。 首先,大家卷T2V(文生视频)热闹归热闹,但是真正用下来就发现问题不少:比如生成结果不可控,经常会"惊喜"变"惊吓",商业化难度很大。 相较于文生视频常见的不确定性和难以控制的结果,I2V(图生视频)的模式更像是给AI一个"明确的起点",提供了更高的可控性和稳定性。 只要上传一张图片,再输入一些简单的描述,就能自动生成一条专业级视频,成本甚至不到传统制作的1/20——自然,也就更容易被品牌和企业用户接 受。 "什么情况?VBench榜单第一怎么突然被百度承包了?" 想象一下,如果你是一个 ...
【私募调研记录】龙赢富泽调研海天瑞声
Zheng Quan Zhi Xing· 2025-05-16 00:13
Group 1 - The core viewpoint of the article highlights the recent research conducted by Longying Fuze on the listed company Haitan Ruisheng, focusing on its revenue growth driven by advancements in multimodal AI technology and data service operations in Southeast Asia [1] - Haitan Ruisheng's revenue growth in Q1 2025 is attributed to the iteration of multimodal large models, high-quality image/video data acquisition, increased demand for scenario-based text data, and the operation of its Southeast Asia data delivery system [1] - The company is exploring the development of public data resources, training data annotation talents, and establishing local data annotation bases, indicating a strategic focus on enhancing its data capabilities [1] Group 2 - Haitan Ruisheng has become a significant data service provider in collaboration with telecom operators, suggesting a sustained increase in data demand in the future [1] - The revenue growth points for 2025 are expected to stem from the evolution of multimodal AI technology, deep applications in vertical industries, and the operation of the Southeast Asia data delivery system [1] - The company’s core competitiveness lies in its dual-mode service products, technological platform capabilities, supply chain resource management, and data security and compliance abilities [1] Group 3 - The main competitors of Haitan Ruisheng include domestic companies such as Data Hall and Biao Bei, as well as international firms like Appen and potentially Scale AI, which possess stronger technological attributes [1]