Workflow
SenseNova
icon
Search documents
大模型“上海队”进入丰产阶段(神州看点) 生成的“猫跳水”视频一周获三亿播放量
Ren Min Ri Bao· 2025-07-03 00:10
Core Insights - MiniMax, a Shanghai-based AI company, has launched the world's first open-source large-scale hybrid architecture inference model, MiniMax-M1, which ranks second globally among open-source models [1] - The company has also released video generation model Hailuo 02, which achieved 300 million views within a week of its release on social media [1][6] - MiniMax distinguishes itself by not following mainstream dense architectures and traditional attention mechanisms, focusing instead on AGI since before the rise of ChatGPT [1][8] Performance and Cost Efficiency - The competition in large models is shifting from mere parameter scale to efficiency, cost, and overall implementation capabilities [2] - M1 supports an impressive context input of 1 million tokens, comparable to Google's latest closed-source model Gemini 2.5 Pro, while its reinforcement learning phase cost only $535,000 [2] - Hailuo 02 directly competes with Google's third-generation video generation model Veo3, showcasing superior performance in generating coherent and logical video sequences [3] Innovation in AI Video Generation - Hailuo 02 has pioneered a new category of AI video called "Animal Olympics" [4] - The development of Hailuo 02 involved collaboration with a diverse team of directors, screenwriters, and artists to ensure high-quality output [5] - High-quality data, innovative algorithms, and meticulous training processes are cited as key factors in the success of Hailuo 02 [6] Strategic Positioning - MiniMax remains one of the few startups still committed to foundational model research amidst a trend of major companies reducing their efforts in this area [7] - The company is exploring "sparse activation" MoE architecture to reduce computational costs, diverging from the prevalent dense architecture approach [8] - MiniMax aims to stay competitive in the long-term race of large model development, collaborating with other major players in Shanghai's AI ecosystem [9]
中金 | AI智道(9):多模态推理技术突破,向车端场景延伸
中金点睛· 2025-06-02 23:45
文 / 于钟海 , 魏鹳霏 , 肖楷 , 赵丽萍 中金研究 以MiniMax V-Triune新框架成果为例,推理感知统一框架在可拓展性、泛化性初步验证。 V-Triune以三层组件架构实现视觉推理和感知任务统一至强化学 习框架:1)多模态样本数据格式化;2)验证器奖励计算,采用异步客户端-服务器架构,奖励计算和主训练循环解耦;3)数据源级指标监控,便于溯源 和提升稳定性。结合动态IoU奖励机制、冻结ViT参数等工程优化,Orsta系列模型(32B参数)在MEGA-Bench Core基准测试中实现了最高14.1%的性能提 升。 多模态推理助力智能驾驶能力升阶。 在智能驾驶场景,多模态推理是增强道路交通标志识别判断能力、提升复杂场景泛化性的重要途径,正成为头部智 能驾驶企业算法演进的重点。2025年5月30日,蔚来世界模型NVM首个版本正式开启推送,具备全量理解、想象重构和推理能力,能够对实时环境多模信 息进行理解和推演,在选择最优ETC车道通行、停车场自主寻路等场景的性能提升显著。此外,理想自研的VLA大模型亦具备思维链推理能力,以多模态 推理模拟人类驾驶员的思维运作方式。 图表1:MiniMax多模态RL ...
商汤-TechNet China 2025_推出基础模型,拓展人工智能驱动的应用场景
2025-06-02 15:44
27 May 2025 | 7:19AM HKT TechNet China 2025: SenseTime (0020.HK) Foundation model introduced; expanding AI-powered user case We hosted SenseTime's management on May 21 at our TechNet Conference China 2025 in Shanghai. Management remains positive on the generative AI trend in China, and highlights their newly launched foundation model, SenseNova V6, carrying upgraded features with competitive costs across training and inferencing. The company also newly signed a MOU with the Faculty of Law at the Chinese Uni ...
TechNet中国2025:商汤科技(0020.HK)推出基础模型;拓展AI驱动的用户案例
Goldman Sachs· 2025-05-28 05:15
27 May 2025 | 7:19AM HKT TechNet China 2025: SenseTime (0020.HK) Foundation model introduced; expanding AI-powered user case We hosted SenseTime's management on May 21 at our TechNet Conference China 2025 in Shanghai. Management remains positive on the generative AI trend in China, and highlights their newly launched foundation model, SenseNova V6, carrying upgraded features with competitive costs across training and inferencing. The company also newly signed a MOU with the Faculty of Law at the Chinese Uni ...
AI终端深圳“秀肌肉”:AI现场批改作业,机器人能文能武
Nan Fang Du Shi Bao· 2025-05-23 08:14
Core Insights - The 2025 Global AI Terminal Exhibition in Shenzhen showcased over 300 companies from 15 countries, highlighting the transition of AI terminals from experimental to practical applications [1] - Shenzhen launched two industrial funds totaling 7 billion yuan, focusing on AI smartphones, humanoid robots, and large model integrated machines, signaling a priority on terminal development [1] - The exhibition serves as a window into Shenzhen's evolution from hard technology innovation to a collaborative industrial ecosystem [1] AI Glasses - TCL's latest AR smart glasses, the Raybird X3 Pro, allow users to take photos and provide real-time object recognition and explanations through voice commands [2][3] - The glasses weigh only 76 grams and feature multiple functions including photo capture, translation, and navigation, making them a true multi-modal AI terminal [2] - Raybird has maintained the largest market share in AR glasses in China for three consecutive years, with over 50% market share in Q1 of this year [3] AI Large Models in Education - SenseNova V6 series by SenseTime integrates AI large models with educational scenarios, capable of correcting homework and providing step-by-step explanations [4][5] - The model supports multi-modal recognition and real-time interaction, enhancing the learning experience by identifying errors and guiding students [5] - The education sector is seen as a high-frequency area for large model deployment due to its structured data and high user acceptance [6] Humanoid Robots - The exhibition featured humanoid robots like TORA-ONE, demonstrating advanced dexterity and precision in tasks such as screwing in light bulbs [7] - The robot's capabilities are supported by proprietary high-precision tactile sensing technology, allowing it to perceive various physical properties [7] - Shenzhen's robot industry is projected to exceed 200 billion yuan in total output by 2024, with a year-on-year growth of 12.58% [8] Investment and Ecosystem Development - Shenzhen's AI and humanoid robot industry funds aim to support startups and foster unicorns, with a total of 7 billion yuan allocated [8] - The city has over 2,600 AI-related companies, forming a robust ecosystem that supports rapid product iteration and market application [9] - The robot industry in Shenzhen is expected to reach a total output value of 2,012 billion yuan by the end of 2024, with a significant number of enterprises contributing to the sector's growth [9]
模速空间重塑AI全链条生态
模速空间 记者 宋薇萍 摄 ◎记者 宋薇萍 谭镕 在上海黄浦江畔的徐汇滨江,一座被称为"AI超级工厂"的创新引擎正以惊人的速度重塑人工智能产业版 图。 上海证券报记者近日探秘模速空间看到,现场访客络绎不绝,"模速路演日""模速观察"等品牌活动以及 各类技术沙龙如火如荼,为创业者提供跨界交流、商业合作的舞台。稀宇极智(MiniMax)计划发布最 新模型,阶跃星辰发布并开源3D大模型Step1X-3D……在"上下楼即上下游"的生态滋养下,企业的"模 力"不断升级,算法优化与产品创新在咖啡杯的碰撞中迸发。 作为上海参与全球AI竞争的核心载体,模速空间的目标直指"比肩硅谷的未来坐标",将联合中国科学 院、上海交大等顶尖科研力量,以及千亿级产业基金和万卡算力资源,构建产学研用全链条生态。 空间有限"模力"无边 模速空间集聚了超百家创新企业,"北斗七星"标杆企业与秘塔科技、无限光年、它石智航等表现突出的 大模型生态企业在模速空间共同闪耀,以协同之势加速突破技术边界。其中,无问芯穹、阶跃星辰、 MiniMax等企业,分别在算力、AIGC视频等各类型场景中释放其与场景结合的能力优势,通过构建"技 术研发—场景验证—商业闭环" ...
上海点亮AI“北斗七星” 群星闪耀浦江西岸
第一财经· 2025-05-14 10:01
以下文章来源于IT时报 ,作者孙妍 IT时报 . 做报纸,也懂互联网,这里是《IT时报》(IT Times)微信版。作为上海一份IT类周报的新媒体产品,这 里汇聚了关注全球IT业的魔都资深IT记者。我们追求原创独家新锐,以及读视听多种表达方式。ps. 使 用IT产品有问题?留言与编辑互动。 人工智能作为国家战略级重点发展领域,正在以惊人的速度自我迭代,更推动了千行百业转型升级。 成立于2014年的商汤,于2017年落户上海市徐汇西岸,见证并参与了上海人工智能的发展。在它落 沪这一年,上海市人民政府办公厅印发《关于本市推动新一代人工智能发展的实施意见》通知。 从AI视觉识别时代一直走到大模型时代,前瞻性布局让商汤走在了正确的道路上。 2025年是大模型应用元年。商汤徐立与阿里马云隔空对话时,不约而同地提到,AI要成为老百姓 的"日用品"。毕竟,性价比才是大模型应用生死线。今年4月10日,商汤宣布发放1亿元算力代金 券,加速大模型商业化落地。 除了算力这一基础设施外,商汤更担当着基础大模型的角色,不断突破性能,要与世界同行比肩。4 月10日,商汤发布日日新SenseNova V6大模型体系,多模态推理能力对标Op ...
中金公司 AI产业动态更新:Agent密集发布、MCP生态快速繁荣
中金· 2025-04-22 04:46
中金公司 AI 产业动态更新:Agent 密集发布、MCP 生态 快速繁荣 2025042120250416 摘要 • OpenAI 发布 O3 和 O4 mini 系列模型,结合图片推理能力,内置联网搜 索、文档解析、图片生成等功能,虽未引起轰动,但展示了其在 AI 技术上 的持续投入。Sora 更新中文生图功能具备良好的指定遵循和风格切换能力。 • 谷歌在 Google Cloud Next 大会上推出 Gemini 2.5 系列推理模型,具 备 Hybrid reasoning 能力,并推出 agent-to-agent 协议以促进协作。 同时,谷歌还更新了视频、语言、音乐生成及图片编辑功能,并与 Google Workspace 深度集成,提升企业级产品能力。 • Meta 发布 LLAMA4,作为全球开源社区广泛使用的基础模型,其革新为 社区带来显著进步。LLAMA4 有三个版本,其中最大的版本仍在训练中。 Maverick 版本表现不错,但存在争议,总体展现出强大的工具调用能力、 高速度及性价比。 • 商汤科技发布 SenseNova V6 系列模型,具有超长思维链,支持图文多 模态推理能力,与阿 ...
AI动态汇总:MetaLIama4开源,openAI启动先锋计划
China Post Securities· 2025-04-15 10:50
研究所 分析师:肖承志 SAC 登记编号:S1340524090001 Email:xiaochengzhi@cnpsec.com 研究助理:冯昱文 SAC 登记编号:S1340124100011 Email:fengyuwen@cnpsec.com 近期研究报告 《小市值持续,高低波风格交替—— 中邮因子周报 20250413》 - 2025.04.14 《4 月是否还会有"最后一跌"? ——微盘股指数周报 20250406》 - 2025.04.07 《"924"以来融资资金防守后均见到 行情低点,仍关注科技配置机会—— 行业轮动周报 20250330》 - 2025.03.31 证券研究报告:金融工程报告 发布时间:2025-04-15 《英伟达召开 GTC 2025 大会, Skywork-R1V、混元 T1 等推理模型接 连上线——AI 动态汇总 20250324》 - 2025.03.25 《反转效应强势,GRU 模型新高——中 邮因子周报 20250323》 - 2025.03.24 《微盘领涨创下历史新高,4 月临近仍 有调整压力 ——微盘股指数周报 20250316》 - 2025.03.1 ...
540亿商汤,甩出一张新牌
一上台,商汤科技董事长兼CEO 徐立就感叹,"如果三个月不更新自己的认知,可能就会被淘汰。" 4月10日,商汤举办2025技术交流日,徐立正式发布全新升级的"日日新SenseNova V6"(以下简称"日日 新V6")大模型体系。 在徐立看来,多模态模型和通用人工智能的发展,画上约等号,以计算机视觉起家的商汤,从视觉能力 到原生多模态模型的布局,则是自然延伸。 商汤科技联合创始人兼大模型首席科学家林达华向《21CBR》记者表示,公司去年5、6月份就在做多模 态的探索,到了9、10月,技术路线基本跑通。 林达华称,之所以专注多模态推理,而非纯文本赛道的竞争,在于坚信未来的交互,必然是多模态的。 日日新V6,作为拥有超6000亿参数的MoE原生多模态通用大模型,凭借单一模型就可以完成文本、多 模态等各类任务。 其技术能力上的突破,重在四个方面: 长思维链:超过200B高质量多模态长思维链数据,最长64K思维链;数理能力:数据分析能力大幅领先 GPT-4o;推理能力:多模态深度推理国内第一,对标OpenAI o1;全局记忆:率先在国内突破长视频理 解,支持10分钟的视频理解及深度推理。 值得一提的是,长记忆。林达华 ...