多模态AI
Search documents
中胤时尚涨3.53%,成交额6380.54万元,后市是否有机会?
Xin Lang Cai Jing· 2025-12-15 08:00
Core Viewpoint - The company Zhongyin Fashion has shown a significant increase in stock price and market activity, with a focus on its business operations in the footwear industry and emerging technologies like virtual digital humans and AI. Group 1: Company Performance - On December 15, Zhongyin Fashion's stock rose by 3.53%, with a trading volume of 63.81 million yuan and a market capitalization of 3.802 billion yuan [1] - The company reported a revenue of 264 million yuan for the period from January to September 2025, reflecting a year-on-year decrease of 8.48%, while the net profit attributable to shareholders was -12.32 million yuan, indicating a significant increase of 50.10% year-on-year [7][8] - The company's main business revenue composition includes 77.12% from supply chain integration, 6.93% from footwear production, 6.61% from design services, 4.59% from brand operations, and 1.46% from cultural tourism services [7] Group 2: Industry and Market Trends - The company established a footwear production base in Xinjiang in response to national policies supporting the development of the western region, which aligns with the "three-child policy" and benefits from the depreciation of the RMB [2] - As of the 2024 annual report, overseas revenue accounted for 83.07% of total revenue, benefiting from the depreciation of the RMB [3] - The company is involved in advanced technologies related to virtual digital humans, with its subsidiary, Xinchangyuan Technology, developing products that support multi-modal content generation [3]
商汤(00020)日日新Seko系列模型与寒武纪成功适配 国产算力&多模态AI实现关键跨越
智通财经网· 2025-12-15 06:22
Core Insights - SenseTime Technology officially launched Seko 2.0, the industry's first multi-episode generative intelligent agent, leveraging its technological accumulation in generative AI and multimodal interaction [1] - The Seko series models, including SekoIDX and SekoTalk, provide a robust technical foundation for image and video generation, showcasing significant advantages in consistency for multi-episode video generation [1] - The collaboration with Cambricon (688256.SH) marks a key advancement in supporting AIGC core scenarios, facilitating a critical leap from language to multimodal capabilities [1] Group 1 - The Seko series models have been adapted to domestic AI chips, enhancing the support for visual content innovation and development within the domestic AI ecosystem [1] - The LightX2V framework is designed with a highly compatible domestic adaptation plugin model, currently supporting multiple domestic chips, including Cambricon [1] - Innovations such as low-bit quantization, compressed communication, and sparse attention mechanisms have been integrated into the Seko series models, resulting in over three times improvement in inference performance [1] Group 2 - SenseTime and Cambricon have established a strategic partnership to optimize software and hardware jointly, aiming to create an open and win-win industrial ecosystem [2] - The collaboration focuses on continuous optimization of core model capabilities, enhancing overall efficiency and response speed for multimodal generation [2] - Efforts will be made to improve computing resource utilization and cost efficiency through operator fusion and automatic tuning, allowing more enterprises to access high-performance multimodal capabilities at lower costs [2] Group 3 - The partnership aims to foster the prosperity and development of the domestic AI application ecosystem, creating more efficient and user-friendly tiered product systems [3] - The goal is to build a more open and friendly tool and ecosystem for developers, stimulating innovation in cutting-edge applications [3]
商汤日日新Seko系列模型与寒武纪成功适配,国产算力&多模态AI实现关键跨越
Ge Long Hui· 2025-12-15 06:05
Group 1 - SenseTime officially launched Seko 2.0, the industry's first multi-episode generative agent, showcasing significant advantages in consistency for multi-episode video generation [1] - The Seko series models, including SekoIDX and SekoTalk, are built on SenseTime's proprietary technology, which has been adapted to support domestic AI chips from Cambricon, marking a key leap from language to multi-modal capabilities [1] - The LightX2V framework is designed with a highly compatible domestic adaptation plugin model, currently supporting multiple domestic chips, including Cambricon, enhancing the performance of the Seko series models [1] Group 2 - In October, SenseTime and Cambricon established a strategic partnership to optimize software and hardware jointly, facilitating a collaborative innovation between domestic large models and computing power [2] - The partnership aims to continuously optimize core model capabilities, enhance computing efficiency, and reduce resource consumption, allowing more enterprises to access high-performance multi-modal capabilities at lower costs [2] - The collaboration will also focus on improving large-scale parallel processing capabilities and developing a more flexible resource management mechanism to ensure stable model operation across diverse environments [2] Group 3 - The deep collaboration between SenseTime and Cambricon is expected to significantly enhance model efficiency, resource utilization, and cross-hardware compatibility, lowering the barriers to using multi-modal AI [3] - The partnership aims to foster a thriving domestic AI application ecosystem, creating more efficient and user-friendly product systems while providing developers with open and friendly tools [3]
DeepSeek倒逼vLLM升级,芯片内卷、MoE横扫千模,vLLM核心维护者独家回应:如何凭PyTorch坐稳推理“铁王座”
3 6 Ke· 2025-12-15 00:36
Core Insights - vLLM has rapidly become a preferred inference engine for global tech companies, with GitHub stars increasing from 40,000 to 65,000 in just over a year, driven by the open-source PagedAttention technology [1] - Neural Magic played a crucial role in vLLM's success, utilizing a "free platform + open-source tools" strategy to build a robust enterprise-level inference stack and maintain a library of pre-optimized models [1] - Red Hat's acquisition of Neural Magic in November 2024, including key team members like Michael Goin, is expected to enhance vLLM's competitive edge in the AI large model sector [1][2] Development and Optimization - The vLLM core team, led by Michael Goin, has shifted focus from optimizing Llama models to enhancing features related to the DeepSeek model, particularly with the release of DeepSeek R1 [3] - The development cycle for version 0.7.2 was tight, efficiently supporting Qwen 2.5 VL and introducing a Transformers backend for running Hugging Face models [3] - Version 0.7.3 marked a significant update with numerous contributors involved, enhancing DeepSeek with multi-token prediction and MLA attention optimizations, as well as expanding support for AMD hardware [4] Hardware Compatibility and Ecosystem - The vLLM team is committed to building an open and efficient hardware inference ecosystem, supporting various mainstream chips and collaborating closely with hardware teams like NVIDIA and AMD [8] - The integration of PyTorch as a foundational layer allows vLLM to support a wide range of hardware, simplifying the adaptation process for hardware vendors [10][11] - The team's collaboration with hardware partners ensures that vLLM can maintain high performance across different platforms, with a focus on optimizing the architecture for new hardware like the Blackwell chip [8][9] Multi-Modal Capabilities - vLLM has evolved from a text-only inference engine to a unified service platform supporting multi-modal generation and understanding, including text, images, audio, and video [17][19] - The introduction of multi-modal prefix caching significantly improves efficiency in processing various input types, while the decoupling of encoders enhances resource utilization for large-scale inference [18][19] - The release of vLLM-Omni marks a milestone in multi-modal inference, allowing for seamless integration and resource allocation across different modalities [19][21] Community and Feedback Loop - The growing trend of companies contributing modifications back to the upstream vLLM project reflects a positive feedback loop driven by the speed of community version iterations [22][23] - Collaboration with leading model labs and companies enables rapid feedback collection, ensuring that vLLM remains competitive and aligned with industry developments [23][24] - The vLLM team is actively addressing developer concerns, such as startup speed, by implementing tracking projects and optimizing performance through community engagement [24][25] Strategic Positioning - Red Hat's deep involvement in vLLM is rooted in the strategic understanding that inference is a critical component of AI application costs, aiming to integrate cutting-edge model optimizations [26][27] - The governance structure of vLLM is decentralized, with contributions from multiple organizations, allowing Red Hat to influence the project while adhering to open-source principles [26][27] - The collaboration with the PyTorch team has led to significant improvements in supporting new hardware and models, reinforcing vLLM's position as a standard in inference services [27]
智元机器人否认和宇树高价争抢春晚赞助席位;小米否认进军AI教育;马斯克称自己是钢铁侠原型;豆包手机二手价被炒到3.6万元丨邦早报
创业邦· 2025-12-11 00:11
Group 1 - A competition is ongoing among embodied intelligence companies for sponsorship of the 2026 Spring Festival Gala, with Zhiyuan Robotics offering 60 million yuan and Yushu Technology raising their bid to 100 million yuan, although Zhiyuan claims the reports are untrue [4] - Meituan has hired Pan Xin, former head of ByteDance's visual model AI platform, to lead multi-modal AI innovation, including the development of applications like LongCat App [4] - Xiaomi clarified that its recruitment for AI education roles is misinterpreted and is primarily aimed at enhancing services for specific products like the Redmi Pad 2 and Xiaomi Mitu children's watch [5] Group 2 - Pop Mart announced the appointment of Wu Yue, LVMH's Greater China President, as a non-executive director, effective December 10, 2025 [7] - Quark AI glasses S1 are experiencing high demand, with resale prices reaching 4,000 to 5,000 yuan, and the product is sold out on major e-commerce platforms [9][10] - JD.com is set to acquire a 50% stake in a Hong Kong office building for approximately 3.473 billion HKD, indicating continued investment in the region [15] Group 3 - Bill Gates warned of an AI valuation bubble, stating that many companies with high valuations will face declines, but emphasized the transformative potential of AI in sectors like health and education [18][19] - Refly.AI completed a multi-million dollar seed round financing led by Sequoia Capital and Hillhouse Capital, launching its V1.0 version for public testing [19] - Snapmaker announced a multi-hundred million B round financing led by Hillhouse Capital and Meituan, aimed at advancing consumer-grade 3D printing technology [19]
前字节AI负责人潘欣加入美团负责多模态创新
3 6 Ke· 2025-12-10 07:11
Core Insights - Pan Xin, former head of visual model AI platform at ByteDance, has joined Meituan to lead multimodal AI innovation [1] - Meituan's strategic focus for 2025 is on the competition in food delivery and advancements in AI technology [1] - The company aims for an aggressive approach in AI technology rather than a defensive one, as stated by founder Wang Xing [1] Group 1: Personnel Changes - Pan Xin has a strong background in AI, having previously worked at Google DeepMind, Baidu, Tencent, and ByteDance [1] - His roles included leading the optimization of PaddlePaddle at Baidu and overseeing AIGC and visual model AI platforms at Tencent and ByteDance [1] Group 2: AI Development - At Meituan, Pan Xin is responsible for the development of applications related to multimodal AI, including the LongCat App [1] - The LongCat AI model's progress was first disclosed by Wang Xing during a conference call in Q1 2025 [1]
国产多模态AI再开源,实测截图转网页、搜图购物,价格减半
3 6 Ke· 2025-12-09 12:04
此外,今天上午,智谱还开源了AutoGLM,类似于"豆包手机助手"。该智能体在去年10月发布之时曾被业内视为"全球首个具备手机操作能力 的AI Agent"。 在性能上,在同等参数规模下,GLM-4.6V系列模型在多模态交互、逻辑推理和长上下文等关键能力上取得SOTA表现。 智东西12月9日报道,昨晚,智谱开源了其GLM-4.6V系列多模态大模型,包括面向云端与高性能集群场景的基础版GLM-4.6V(106B-A12B) 以及面向本地部署与低延迟应用的轻量版GLM-4.6V-Flash(9B)。 ▲GLM-4.6V开源主页(图源:Hugging Face) ▲AutoGLM开源主页(图源:Hugging Face) 据官方介绍,GLM-4.6V能够完成智能图文混排与内容创作、识图购物与导购、前端复刻与多轮视觉交互开发以及长上下文的文档与视频理解 等任务,智东西第一时间对其进行了体验。 在实际体验中,GLM-4.6V的图像搜索、全网比价以及长文本和视频的理解能力表现较为稳定,其生成文字和网页的速度快、内容准。但图文 混排能力上,其所生成的图片一直无法显示。对于模糊指令,GLM-4.6V的理解有些许偏差。 GLM ...
研报掘金丨渤海证券:首予虹软科技“增持”评级,深耕AI视觉算法,多曲线驱动增长
Ge Long Hui A P P· 2025-12-09 08:22
格隆汇12月9日|渤海证券研报指出,虹软科技深耕AI视觉算法,多曲线驱动增长。公司专注于计算机 视觉领域,为行业提供算法授权及系统解决方案。移动智能终端视觉解决方案是公司营收主要来源。智 能汽车解决方案作为新兴业务板块,近年呈现高速增长态势。同时公司紧跟多模态AI 与AIGC 行业发展 浪潮,积极布局AI 眼镜及AI 商拍等前沿业务。2025 年前三季度,公司实现归母净利润1.42 亿元,同比 增长60.51%。在智能手机领域,公司已构建起覆盖当前主流机型的视觉人工智能算法产品矩阵。考虑 到公司是全球领先的视觉人工智能企业,未来有望实现多业务场景深度赋能。首次覆盖给予"增持"评 级。 ...
推荐支持文生图、文生视频能力的多功能生成式 AI 平台:从多模态融合到内容体系建设的全景观察
Jin Tou Wang· 2025-12-08 04:26
随着生成式 AI 技术持续演进,企业正在从"局部使用"进入"体系化建设"阶段。特别是在内容生产领 域,文生图(文本生成图像)与文生视频(文本生成视频)正成为企业数字化内容战略中的关键能力。 过去,企业往往将这类能力视为补充性的创意工具;而如今,随着营销渠道细分、全球化布局深化、知 识库视觉化需求攀升,一个新的趋势正在出现: 企业需要的不是"会生成的工具",而是"能构建多模态内容体系的平台"。 在此背景下,具备跨模态能力、企业级治理体系、可扩展架构以及稳定 API 能力的平台,开始成为企 业评估生成式 AI 的核心标准。本文将基于产业需求的结构性变化,系统分析当前多功能生成式 AI 平 台的创新方向,并解释为何 AWS 等具备平台级能力的云服务商正在成为企业重点关注对象。 一、文生图与文生视频的商业价值正在显著提升,企业对多模态 AI 的需求全面升级 海外广告素材 国内短视频内容 官网与社交平台视觉组件 产品演示与包装素材 直播脚本与分镜图 在 AI 搜索、AI 助手快速普及的环境下,企业需要为多个渠道准备风格统一、逻辑一致、定位精确的视 觉内容。这使得传统依赖人工的内容制作方式难以支撑规模扩张。 2. 企业内 ...
中胤时尚涨0.26%,成交额2674.47万元,后市是否有机会?
Xin Lang Cai Jing· 2025-12-05 12:37
来源:新浪证券-红岸工作室 12月5日,中胤时尚涨0.26%,成交额2674.47万元,换手率0.71%,总市值37.68亿元。 异动分析 新疆振兴+三胎概念+人民币贬值受益+虚拟数字人+多模态AI 1、根据2022年年报:为积极响应国家"扶持中西部地区实业发展"的号召,公司于2021年在新疆和田地 区建立了鞋履生产基地,新疆中胤鞋业有限公司。 2、根据2021年4月15日互动易:公司童鞋设计和供应链整合业务收入占在10%-15%之间,作为一家创意 设计企业,公司鞋履设计覆盖全品类,包括女鞋、童鞋及男鞋,可为客户提供不同款式不同类别的鞋履 设计。 3、根据2024年年报,公司海外营收占比为83.07%,受益于人民币贬值。 4、2023年6月16日公司互动:元起点和新畅元科技在虚拟人技术上储备了多项技术,在3D数字人生成 重建、AIGC+3D数字人、3D数字人AI跨模态实时交互等多项国际领先。 该股筹码平均交易成本为16.80元,近期筹码减仓,但减仓程度减缓;目前股价在压力位16.47和支撑位 14.95之间,可以做区间波段。 公司简介 资料显示,浙江中胤时尚股份有限公司位于浙江省温州市鹿城区丰叶路180号,成 ...