Workflow
多模态
icon
Search documents
【公告全知道】谷子经济+多模态AI+短剧游戏+华为鸿蒙!公司多款谷子产品上线即售罄
财联社· 2025-06-12 14:31
Group 1 - The article highlights the importance of weekly announcements from Sunday to Thursday, which include significant stock market updates such as suspensions, increases or decreases in holdings, investment wins, acquisitions, earnings reports, and unlocks [1] - A company has successfully obtained multiple international IP licenses for domestic derivative products, with several of its millet products selling out immediately upon launch [1] - Another company has delivered samples of humanoid robot dexterous hand reducer bearings to clients, showcasing advancements in controllable nuclear fusion, solid-state batteries, nuclear energy, and state-owned enterprise reform [1] - The company focusing on innovative drugs has entered the maintenance dose phase for its semaglutide injection project, with expectations to apply for market approval in China by 2026 [1]
不靠价格战,豆包大模型靠技术杀出重围
Jing Ji Guan Cha Wang· 2025-06-12 13:51
Core Insights - ByteDance's subsidiary Volcano Engine launched new AI models, including Doubao 1.6 and Seedance 1.0 pro, at the Force Original Power Conference, marking a significant step towards the Agentic AI era [1][2] - The Doubao model has achieved a daily token usage of over 16.4 trillion, a 137-fold increase since its initial release, and holds a 46.4% market share in China's public cloud model market [1][2] - The company emphasizes long-term investment in technology innovation to enhance industrial applications and maintain a competitive edge in the AI landscape [2][13] Product Development - Doubao 1.6 supports multi-modal understanding and graphical interface operations, allowing it to perform tasks such as booking hotels and organizing receipts into Excel [3][5] - Seedance 1.0 pro can generate high-quality 1080P videos with seamless transitions, ranking first globally in video generation tasks [3][5] - The introduction of a pricing model based on input length significantly reduces costs, making advanced AI capabilities more accessible to enterprises [5][8] Market Positioning - Doubao models are utilized by 9 out of the top 10 global smartphone manufacturers, 80% of mainstream automotive brands, and 70% of systemically important banks in China [2][6] - The rapid growth in token consumption across various applications indicates a deepening integration of AI models in multiple industries, including finance, automotive, and education [4][6] Strategic Vision - The company aims to redefine the role of AI in business processes, transitioning from traditional software to Agent-based systems that enhance productivity [13][16] - ByteDance's commitment to technology innovation and cost reduction reflects a balanced approach to achieving commercial success while addressing social responsibilities [14][15] Industry Impact - The rise of Agentic AI is seen as a pivotal moment for digital transformation across industries, with the potential to reshape business processes and industry dynamics [16] - ByteDance's advancements in AI technology are expected to drive significant changes in how enterprises operate, enhancing efficiency and fostering innovation [16]
何小鹏:大模型道路,大家都在摸着石头过河
news flash· 2025-06-12 11:31
Core Viewpoint - The CEO of Xiaopeng Motors, He Xiaopeng, emphasized the importance of the new driving assistance chip "Turing" during the launch of the G7 SUV, indicating that the industry is still exploring the path of large models in autonomous driving technology [1] Group 1: Company Insights - Xiaopeng Motors introduced its latest SUV model, the G7, on June 10, highlighting the significance of the "Turing" chip for driving assistance [1] - The majority of the launch event was dedicated to discussing the capabilities and features of the "Turing" chip, showcasing the company's focus on advanced technology [1] Group 2: Industry Trends - The VLA solution is emerging as a preferred choice among leading players in China's driving assistance sector, with competitors like Li Auto also developing this solution [1] - There is a divergence in approaches between domestic companies and Tesla, with Tesla continuing to focus on an "end-to-end" solution rather than engaging with multi-modal large models [1]
格灵深瞳: 国泰海通证券股份有限公司关于北京格灵深瞳信息技术股份有限公司部分募投项目变更实施地点的核查意见
Zheng Quan Zhi Xing· 2025-06-12 10:28
Fundraising Overview - The company raised a total of RMB 182,622.31 million from the public offering of 46,245,205 shares at a price of RMB 39.49 per share, with a net amount of RMB 167,009.02 million after deducting fees [1][4] - The company has an excess raised fund of RMB 67,009.02 million [1] Project Investment Status - The company announced the use of raised funds for the "Multimodal Large Model Technology and Application R&D Project," with a total investment of RMB 100,006.17 million allocated for this project [1][2] Change of Project Implementation Location - The implementation location for the "Multimodal Large Model Technology and Application R&D Project" is being changed from Yanqing District to Daxing District, while still maintaining the original location in Haidian District [1][2] - The new location in Daxing District is strategically positioned with ample office space and proximity to key transportation hubs, enhancing operational efficiency and project management [1][2] Impact of Location Change - The change in location aligns with the company's long-term development strategy and does not affect the project's content or the intended use of raised funds [3][4] - The company will adhere to relevant regulations and strengthen supervision over the use of raised funds to ensure legality and effectiveness [3][4] Review and Approval Process - The change in project location was approved by the company's board and supervisory committee, confirming compliance with regulatory requirements [3][4]
展览展示|抢位2025智能机器人关键技术大会!高曝光商务合作虚位以待,共赴解锁新机遇
机器人圈· 2025-06-12 10:14
会 议 通 知 各有关单位: 主办单位: 《机器人技术与应用》杂志社 由《机器人技术与应用》杂志社发起,中国自动化学会机器人专业委员会,中国人工智能学会智能机器人专业委 员会、中国仪器仪表学会智能车与机器人专委会和 中国工程建设焊接协会机器人及智能焊接专业委员会 联合支持的" 2025智能机器人关键技术大会 "将于 2025年7月22-24日 在 齐齐哈尔市 举办,大会以"具身智能与多模态交互技 术的融合与突破"为主题,围绕机器人及人工智能领域前沿技术、关键共性技术、产业化路径与标准化建设和跨学科融 合等领域展开交流。 大会由张建伟院士、刘连庆、吴新宇、宋锐、訾斌、付宜利、张建华等教授联合发起,以"具身智能与多模态交互 技术的融合与突破"为主题,邀请王田苗、孙立宁、杨广中、赵杰、董凯、孙富春、刘辛军、喻俊志、喻洪流、刘洪 海、文力、徐静等行业专家学者与大家分享前沿技术和科研进展。 本次大会将携手行业顶流期刊联合征文,录用稿件均将于正刊发表,并于 2025年12月底前出版 ,诚邀大家积极 踊跃投稿!我们也热忱欢迎国内机器人领域相关企业、实验设备研发机构等报名参展,展示最新研究成果 诚挚邀请广大深耕于智能机器人及 ...
姜大昕走“窄门”
3 6 Ke· 2025-06-12 10:11
Core Insights - The article discusses recent personnel changes and strategic shifts at Jumpspace, highlighting the departure of Tech Fellow Duan Nan to JD's research institute and the cessation of investment in the role-playing agent product "Bubbling Duck" [1][32] - Jumpspace aims to focus on developing a native multimodal large model, which is seen as a challenging path with limited visibility in the competitive landscape of AI startups [4][22] Group 1: Personnel Changes and Strategic Shifts - Duan Nan, previously the head of video generation models at Jumpspace, has left to lead the visual and multimodal lab at JD's research institute [1][32] - The company has reportedly merged the team behind "Bubbling Duck" into its dialogue product, now known as "Jumpspace AI," retaining only a few employees for maintenance [1][4] - Jumpspace's response to the changes indicates a strategic pivot towards focusing on agent development as multimodal and reasoning capabilities mature by 2025 [1][4] Group 2: Market Position and Competitiveness - Despite being recognized as a "multimodal king," Jumpspace has struggled to gain significant market presence compared to competitors like Kimi and MiniMax, which have clearer branding and market strategies [4][6][22] - As of March 2025, Jumpspace's AI application has not made it to the top 15 in monthly active users, suggesting a lack of traction in the market [6][12] - The company’s cautious approach to marketing and investment contrasts sharply with competitors who have more aggressive funding and marketing strategies [8][28] Group 3: Technical Ambitions and Challenges - Jumpspace's ambition to create an end-to-end native multimodal large model is seen as a bold but risky strategy, with the potential for significant technological breakthroughs if successful [15][17][22] - The company faces challenges in attracting developers and users, as its models are perceived as lacking distinctiveness compared to offerings from other firms [14][22] - The competitive landscape is intensifying, with established players and emerging startups vying for talent and market share, putting pressure on Jumpspace to deliver results [25][30] Group 4: Future Outlook and Funding Needs - Jumpspace's future success hinges on its ability to demonstrate tangible results in its ambitious multimodal model development, which remains in the conceptual phase [22][24] - The company needs to secure additional funding to support its long-term goals, especially as the investment climate for AI startups has become more challenging [26][28] - The urgency for Jumpspace to prove its value proposition to investors is critical, as the competitive environment continues to evolve rapidly [30][31]
CVPR2025视频生成统一评估架构,上交x斯坦福联合提出让MLLM像人类一样打分
量子位· 2025-06-12 08:17
Video-Bench 视频评估框架,能够通过模拟人类的认知过程,建立起连接文本指令与视觉内容的智能评估体系。 简单地说,能够让多模态大模型(MLLM)"像人一样评估视频"。 实验结果表明,Video-Bench不仅能精准识别生成视频在物体一致性(0.735相关性)、动作合理性等维度的缺陷,还能稳定评估美学质量等 传统难题,显著优于现有的评估方法。 Video-Bench团队 投稿 量子位 | 公众号 QbitAI 视频生成技术正以前所未有的速度革新着当前的视觉内容创作方式,从电影制作到广告设计,从虚拟现实到社交媒体,高质量且符合人类期望 的视频生成模型正变得越来越重要。 那么,要如何评估AI生成的视频是否符合人类的审美和需求呢? Video-Bench的研究团队来自上海交通大学、斯坦福大学、卡内基梅隆大学等机构。 Video-Bench:基于MLLM的自动化视频评估框架 Video-Bench团队在面对已有的视频评估方法时,发现了两个问题: 1.简单的评分规则往往无法捕捉视频流畅度、美学表现等复杂维度—— 那么,当评判"视频质量"时,如何将人类出于"直觉"的模糊感受转化为可量化的评估指标? 2.现有基于大语 ...
CVPR2025视频生成统一评估架构,上交x斯坦福联合提出让MLLM像人类一样打分
量子位· 2025-06-12 08:16
Video-Bench团队 投稿 量子位 | 公众号 QbitAI 视频生成技术正以前所未有的速度革新着当前的视觉内容创作方式,从电影制作到广告设计,从虚拟现实到社交媒体,高质量且符合人类期望 的视频生成模型正变得越来越重要。 那么,要如何评估AI生成的视频是否符合人类的审美和需求呢? Video-Bench 视频评估框架,能够通过模拟人类的认知过程,建立起连接文本指令与视觉内容的智能评估体系。 简单地说,能够让多模态大模型(MLLM)"像人一样评估视频"。 实验结果表明,Video-Bench不仅能精准识别生成视频在物体一致性(0.735相关性)、动作合理性等维度的缺陷,还能稳定评估美学质量等 传统难题,显著优于现有的评估方法。 Video-Bench的研究团队来自上海交通大学、斯坦福大学、卡内基梅隆大学等机构。 Video-Bench:基于MLLM的自动化视频评估框架 Video-Bench团队在面对已有的视频评估方法时,发现了两个问题: 1.简单的评分规则往往无法捕捉视频流畅度、美学表现等复杂维度—— 那么,当评判"视频质量"时,如何将人类出于"直觉"的模糊感受转化为可量化的评估指标? 2.现有基于大语 ...
实测豆包1.6,最火玩法all in one!Seedance登顶视频生成榜一,豆包APP全量上线
量子位· 2025-06-12 07:11
海淀区高考模拟卷,豆包1.6文理科成绩全部突破700分,理科成绩更是比去年的豆包提升了154分。 | 海淀模拟全卷 | | | --- | --- | | 豆包大模型1.6: | 豆包-240615: | | 理科: 656+50=706 | 理科: 502+50=552 | | 文科:662+50=712 | 文科:572+50=622 | 视频领域, Seedance 1.0 Pro 亮相即登顶全球竞技场文生视频、图生视频双料第一。 明敏 发自 凹非寺 量子位 | 公众号 QbitAI 不愧是字节,一发大模型,各模态榜单格局全部被重构! 最新豆包大模型1.6系列 ,"小版本"更新但推理、数学、多模态能力全部冲入 全球第一梯队 。 | Artificial Analysis Video Arena Leaderboard | | | | | | Artificial Analysis Video Arena Leaderboard | | | | | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | | | Text to Video | ...
作业帮亮相2025AI+研发数字峰会 展示多模态交互技术创新成果
Zhong Jin Zai Xian· 2025-06-12 06:48
Group 1 - The 2025 AI+ R&D Digital Summit was held in Shanghai, focusing on "Embracing AI to Reshape R&D," featuring leading internet companies and experts sharing cutting-edge topics [1] - Zhou Shuran, a senior algorithm expert from Zuoyebang, highlighted the limitations of traditional voice interaction and the potential of large model technology to enhance user experience [3] - Zuoyebang has integrated voice recognition, natural language processing, and voice generation into a "Understanding-Reasoning-Generating" multimodal solution, significantly improving interaction efficiency and intelligence [3] Group 2 - In 2024, Zuoyebang plans to launch a fully end-to-end voice and streaming full-duplex interaction system, reducing first response time (TTFT) and first voice generation time (TTFS) through innovative design and optimization [4] - Voice interaction is positioned as the most natural human-computer interface, with Zuoyebang committed to advancing Voice-Agent technology to enhance educational experiences [4] - Zuoyebang's multimodal interaction technology has been scaled in various products, including the top educational app "Kuaidui AI," which has over 12 million daily active users and features an AI oral teacher solution [6]