阿里云百炼平台

Search documents
AI算力引领沪指反弹 市场风格切换暗流涌动
2 1 Shi Ji Jing Ji Bao Dao· 2025-10-21 12:36
10月21日,A股迎来反弹,沪指收复3900点整数关口。 截至当日收盘,上证指数上涨1.36%,报3916.33点;深证成指上涨2.06%,报13077.32点;创业板指上涨3.02%,报3083.72点。全部A股成交额1.89万亿元, 较前一个交易日增加了1400多亿元。 全市场超4600只个股上涨,近百只个股涨停。 行业板块走向分化。AI算力板块强势拉升,Wind光模块(CPO)指数全天涨超6%,光芯片指数上涨近5%,领涨概念板块。同时,受隔夜美股苹果股价上涨 影响,苹果产业链走强,闻泰科技、环旭电子涨停、工业富联上涨9.57%。而Wind煤炭开采、锂电电解液指数全天分别下跌了1.30%、1.59%,跌幅较大。 值得注意的是,随着近期科技板块波动加大,市场风格变化趋势成为当前的市场焦点。 | | | Wind热门概念指数 | | | | | | --- | --- | --- | --- | --- | --- | --- | | 光模块(CPO) 消费电子代工 | | 光芯片 | HBM | 一级地产商 | 通信设备 | 能源设备 | | 6.25% | 4.92% | 4.71% | 4.55% | ...
资金动向|北水减持阿里超17亿港元,连续8日抛售中芯国际
Ge Long Hui· 2025-10-20 12:57
| | 沪股通 | | | | | 深股通 | | | --- | --- | --- | --- | --- | --- | --- | --- | | 名称 | 潔跌幅 | 净买入额(亿) | 成交额 | 名称 | 潔跌幅 | 净买入额(亿) | 成交额 | | 阿里巴巴-W | 4.9% | -12.05 | 57.87 Z | 阿里巴巴-W | 4.9% | -5.49 | 39.11亿 | | 中心国际 | 3.9% | -5.14 | 32.31亿 | 中心国际 | 3.9% | 1.89 | 21.66 Z | | 脸用控股 | 3.2% | -1.62 | 29.33 Z | 腾讯控股 | 3.2% | 2.58 | 15.58 Z | | 小米斯 W | 2.6% | -3.92 | 21.68亿 | 作就非常体 | 2.8% | -1.38 | 12.64 Z | | 华虹半导体 | 2.8% | 1.05 | 17.11亿 | 小米菲国-W | 2.6% | 0.51 | 11.97亿 | | 中国海洋石油 | 2.3% | 1.46 | 17.07 Z | 南方恒生科技 | 3.2% ...
阿里云AI成果入选顶会 GPU用量削减82%
财联社· 2025-10-19 05:47
Core Viewpoint - Alibaba Cloud's Aegaeon solution addresses the common issue of GPU resource waste in AI model services, significantly improving GPU utilization rates and has been recognized at the prestigious SOSP 2025 conference [2][4]. Group 1: Aegaeon Solution - Aegaeon was successfully selected for the SOSP 2025 conference, highlighting its innovative approach to solving GPU resource waste in AI model services [2][4]. - During a beta test lasting over three months, Aegaeon reduced the number of NVIDIA H20 GPUs required for serving large models from 1192 to 213, achieving an 82% reduction in GPU usage [5]. - The system's ability to pool GPU resources breaks the inefficient model-to-GPU binding, allowing for more effective resource allocation [8]. Group 2: Technical Innovations - Aegaeon's core innovation is token-level scheduling, which dynamically decides whether to switch models after generating each token, enabling fine-grained management of resources [8]. - The system can support up to seven different models simultaneously on a single GPU, improving effective throughput by 1.5 to 9 times and achieving 2 to 2.5 times the request processing capability compared to existing solutions [9]. - Aegaeon reduces model switching overhead by 97% through various optimizations, ensuring real-time responsiveness for model switching [8]. Group 3: Industry Implications - The integration of system software and AI model technology is emerging as a new trend, with a focus on optimizing underlying systems to better support AI applications [9]. - The future of AI development will rely not only on hardware advancements but also on software innovations that maximize existing hardware potential [9].
Tokens经济崛起:中国AI云服务半年用量飙四倍,火山引擎领跑市场
2 1 Shi Ji Jing Ji Bao Dao· 2025-10-17 07:47
Core Insights - The AI market driven by large models is accelerating with a new metric, Token consumption, becoming a "real benchmark" for AI application deployment [1] - The IDC report reveals a staggering growth projection, with the volume of large model calls on public cloud in China expected to reach 536.7 trillion Tokens in the first half of 2025, a nearly 400% increase from 114 trillion Tokens in 2024 [1] - The market landscape is becoming clearer, with Volcano Engine holding a 49.2% market share, expanding its lead from 46.4% in 2024 [1] Market Dynamics - Volcano Engine leads the Chinese large model public cloud service market with a 49.2% share, followed by Alibaba Cloud at 27.0% and Baidu Smart Cloud at 17.0% [2] - A different report by Omdia shows Alibaba Cloud leading with a 35.8% share when considering the entire cloud service chain, indicating a shift from infrastructure competition to deepening model applications [2] Token Consumption as a Metric - The choice of "Token call volume" as a core statistic reflects a rethinking of evaluation standards in the AI industry, focusing on actual model usage rather than just computational supply [3] - Token consumption is closely tied to application deployment, showcasing a more sustainable and exponentially growing model for the AI industry [4] Growth Catalysts - Two key technological breakthroughs have significantly impacted market growth: the first in July 2024, when the YoY growth rate for large model public cloud services exceeded 160% following cost reductions from the Doubao model [5][6] - The second breakthrough occurred in February 2025, marked by the popularity of the DeepSeek-R1 inference model, indicating a shift from model training to inference services [6] Volcano Engine's Competitive Edge - Volcano Engine's rapid growth in the large model business is attributed to its strategic, technological, and scale advantages [7] - The Doubao model family has a leading iteration speed in the industry, covering multiple modalities including text, image, audio, and video [8] - The performance of Volcano Engine's MaaS platform, "Volcano Ark," has been significantly enhanced, with output rates for the DeepSeek-R1 model being 2.6 times that of some competitors [9] Industry Penetration - The AI cloud service market is expanding from the internet sector into traditional industries, with Volcano Engine serving major clients across various sectors, including automotive and finance [10] - The market is expected to have hundreds of times growth potential, with multi-modal models and Agent applications driving future growth [11] Future Trends - Volcano Engine is continuously upgrading its products and services, recently launching several new models and a "smart model routing" service to balance performance and cost [11] - The daily Token consumption has surpassed 30 trillion, reflecting a growth of over 80% since May 2025 [11] - The competition in the "Tokens economy" will favor those who provide the best performance at the lowest cost, shaping a more mature ecosystem in the AI cloud market [12]
阿里云AI基础设施成果入选顶级学术会议,显著提升GPU利用率
Yang Zi Wan Bao Wang· 2025-10-16 08:29
Core Insights - The top academic conference SOSP2025 held in Seoul, South Korea, accepted only 66 papers, with Alibaba Cloud's GPU pooling service multi-model research being successfully included, proposing the Aegaeon multi-model hybrid service system that significantly enhances GPU resource utilization [1][2] - The conference highlighted the trend of integrating system software with AI large model technology, as the number of global models continues to grow, with Hugging Face hosting over 1 million models [1] Group 1 - Alibaba Cloud's Aegaeon system innovatively implements scheduling at the token level, allowing for model switching based on precise execution time predictions and a novel token-level scheduling algorithm, achieving a 97% reduction in model switching overhead [2] - Aegaeon supports simultaneous service of up to 7 different models on a single GPU, improving effective throughput by 1.5 to 9 times and achieving 2 to 2.5 times the request processing capability compared to existing mainstream solutions [2] - The core technology of Aegaeon has been deployed on Alibaba Cloud's Bailian platform, reducing the required GPU count for serving multiple models by 82% [2] Group 2 - The Alibaba Cloud Bailian platform has launched over 200 leading industry models, including Qwen, Wan, and DeepSeek, with a 15-fold increase in model invocation over the past year [2]
当AI有了大脑和身体,世界将如何改变?| 品牌新事
吴晓波频道· 2025-09-28 00:31
Core Viewpoint - The article discusses the evolution and future direction of Alibaba's cloud and AI strategies, emphasizing the significance of the annual Yunqi Conference as a platform for showcasing advanced technologies and innovations in AI and cloud computing [4][10]. Group 1: Conference Overview - The Yunqi Conference has transformed from a local internet technology exhibition to an international technology event, attracting a diverse audience including foreign participants [9][8]. - The theme for this year's conference is "Cloud Intelligence Integration, Silicon-Carbon Symbiosis," highlighting the focus on AI and cloud technologies [4]. Group 2: ASI Declaration and Strategic Path - Alibaba's CEO, Wu Yongming, introduced the "ASI Declaration," outlining a three-step path towards achieving Super Artificial Intelligence (ASI) [11]. - The first phase involves AI learning from vast human knowledge, the second phase focuses on AI's autonomous actions to assist humans, and the third phase aims for AI to surpass human capabilities through self-learning and real-world data integration [13][14][15]. Group 3: Investment and Infrastructure - Alibaba plans to invest 380 billion yuan over the next three years in cloud and AI infrastructure, exceeding the total investment of the past decade [18]. - The company aims to enhance its AI capabilities significantly, with projections indicating a tenfold increase in energy consumption for global data centers by 2032 [18]. Group 4: AI Supercomputer and Technological Advancements - Alibaba is developing a comprehensive AI supercomputer that integrates AI chips, cloud computing platforms, and foundational models, representing a full-stack approach to AI [20][30]. - The newly released models, such as Qwen3-Max and Qwen3-Next, demonstrate significant advancements in performance and efficiency, with Qwen3-Max surpassing GPT-5 and Claude Opus 4 in various benchmarks [23][24]. Group 5: Agent Development and Ecosystem - The introduction of the ModelStudio-ADK framework allows for the rapid development of intelligent agents, enabling them to perform complex tasks autonomously [27]. - Over 200,000 developers have created more than 800,000 agents using Alibaba's cloud services, significantly impacting various industries, including automotive and finance [28]. Group 6: Competitive Landscape - Alibaba has established itself as a leading player in the global AI cloud market, competing with major international firms and aiming to secure a position among the few remaining global super cloud computing platforms [36][35]. - The demand for AI computing power has surged, with Alibaba's AI computing capacity increasing over fivefold in the past year, indicating a robust growth trajectory in the AI infrastructure sector [35].
高代码时代来临,阿里云百炼要让 Agent 真正跑在业务里
3 6 Ke· 2025-09-27 07:19
"在AI时代,智能体的开发是今天一个重要的开发范式。"在9月24日举办的2025云栖大会上,阿里云智能集团首席技术官周靖人提到了38次Agent,33次智 能体。 毫无疑问,Agent正在成为企业追求效率提升过程中绕不开的路径。在今年5月,普华永道调研了全球300名高管后,发现79%的受访公司已经在某些业务 中应用 AI Agent;其中66%表示生产力提升,57%看到了成本下降,55%感受到决策效率加快,54%提升了客户体验。 眼下,Agent平台更是已经成为了互联网大公司的兵家必争之地。海外,LangChain/LangGraph、微软AutoGen、Google ADK纷纷升级,国内,腾讯、华 为、百炼在 2025年9月集中推出和升级其Agent智能体平台。 然而,行业热度之下,现实中的落地体验或许还有待考究,比如,近两年涌现的低代码Agent平台,大多依赖"预定义编排"逻辑,适合做简单的问答或流 程自动化,但一旦涉及跨系统调用、长链条任务,或需要多轮反思与自主决策时,往往力不从心。更重要的是,企业真正关心的不是"能不能做出一个 Agent",而是"Agent能否稳定运行在业务体系"里,最好还能实现"规 ...
吴晓波探展模力工场:开发者从技术到商业化的关键一跃
AI前线· 2025-09-26 12:07
Core Viewpoint - The article discusses the current challenges and opportunities in the AI application market, emphasizing the need for effective connections between technology and business solutions, akin to how platforms like Dazhong Dianping (大众点评) helped consumers find suitable restaurants [4][6][9]. Group 1: Current AI Market Landscape - The AI application market is compared to the restaurant market a decade ago, highlighting the issue of information asymmetry [6][7]. - Despite a significant increase in AI-related projects on platforms like GitHub, with numbers rising from under 700,000 in 2020 to 1.81 million in 2023, only 25% of companies believe they have successfully implemented AI projects [8][9]. - The gap between technological advancements and commercial application is identified as a critical missing link in the current AI ecosystem [9]. Group 2: AI Infrastructure and Development - Alibaba Cloud announced major advancements in AI infrastructure, aiming to create a "super AI cloud," with the adoption rate of generative AI in China projected to rise from 8% in 2024 to 43% in 2025 [10][11]. - The need for application-level growth is emphasized, as foundational technologies are now mature [11]. Group 3: Challenges in AI Application Implementation - The AI Super Exchange hosted by Moduli Factory aims to address three main barriers to AI application deployment: unclear demand, lack of visibility for solutions, and inefficient matching between demand and supply [15][18]. - The exchange features a demand diagnosis platform, a real-time display of application features, and a matchmaking process for proposals and collaborations [17][18]. Group 4: Industry-Specific Solutions - Seven applications presented at the AI Super Exchange target specific industry pain points, such as: - Cloud operation automation, addressing the need for proactive maintenance in the industrial AI market, projected to reach $43.6 billion by 2024 [20][21]. - Intelligent bidding assistants that significantly reduce the time and error rates in the bidding process [26][28]. - AI-driven human resources solutions that shorten recruitment cycles and improve talent matching [30][31]. - Content creation tools that enhance efficiency for new media creators [34][36]. - Automation tools for repetitive office tasks, freeing up time for knowledge workers [37][38]. Group 5: Commercialization of AI Applications - Moduli Factory serves as an accelerator for AI application commercialization, providing exposure, user feedback, and industry connections to developers [44][49]. - The platform aims to bridge the gap between technology demos and commercial products, addressing the fact that 46.3% of companies are still hesitant to adopt AI due to a lack of suitable solutions [53][54]. Group 6: Developer Ecosystem and Future Opportunities - The "Autumn Competition" initiated by Moduli Factory is designed to create a self-reinforcing ecosystem for developers, offering support from model vendors, cost optimization, and guidance on sustainable business models [57][58]. - The article concludes by highlighting the historical opportunity for AI application developers to participate in this evolving landscape, as the focus shifts from technological breakthroughs to practical application [61][62].
赛道Hyper | 阿里Fun-ASR:语音AI新阶段演进方向
Hua Er Jie Jian Wen· 2025-09-01 02:49
Core Viewpoint - Alibaba Cloud's DingTalk has launched a new end-to-end speech recognition model, Fun-ASR, which enhances contextual understanding and transcription accuracy, capable of recognizing industry-specific terminology across ten sectors [1][2]. Group 1: Technological Advancements - Fun-ASR represents a significant iteration in speech recognition technology, moving from mere comprehension to contextual understanding [2]. - The model incorporates context awareness, allowing it to track specific terms and contexts during multi-turn conversations, improving accuracy in scenarios like meeting minutes [6][9]. - Fun-ASR's robustness enhances its usability in real-world business environments, effectively handling accents, noise, and specialized vocabulary [6][9]. Group 2: Market Positioning - Fun-ASR is positioned as a knowledge assistant rather than just an input tool, facilitating structured documentation and real-time knowledge base integration in various business scenarios [9][10]. - Unlike consumer-focused models, Fun-ASR targets B-end clients through Alibaba Cloud's services, aligning with a strategy similar to Microsoft's enterprise-focused approach [10][11]. - The model's integration into Alibaba's Baolian platform signifies its role as a foundational service in enterprise cloud computing, akin to databases and search functionalities [13][20]. Group 3: Industry Implications - The evolution of speech recognition is shifting towards becoming a digital infrastructure, similar to OCR, where high accuracy allows seamless integration into various systems [12][20]. - Fun-ASR's development reflects a broader trend in the industry, where speech AI is becoming a critical component of digital productivity rather than a standalone tool [9][20]. - The future of AI interaction is likely to be characterized by natural dialogue rather than traditional input methods, with Fun-ASR serving as a stepping stone towards this vision [21].
阿里云百炼平台首个停车MCP服务上线 捷停车提供全维数据和场景支持
Zheng Quan Shi Bao Wang· 2025-08-15 02:47
Core Insights - The launch of "Jie Parking - Parking Information MCP Service" on Alibaba Cloud's Bailian platform marks the first and only MCP service in the parking industry, providing efficient parking information queries and AI capabilities for developers [1][2] - Jie Parking, a subsidiary of Shenzhen Shunyi Tong Information Technology Co., Ltd., is a leading smart parking service platform in China, covering over 400 cities and more than 59,000 parking lots, with over 140 million registered users [1][2] - The MCP service aims to break the "information island" status of complex parking data, enabling a professional-grade parking capability that is ready to use [1][2] Industry Impact - The MCP service allows small and medium-sized enterprises in the parking industry to access professional parking capabilities at a low cost, enhancing efficiency without the need for in-house development [3] - It facilitates innovation in business models by providing access to previously hard-to-obtain static traffic data for related industries such as navigation, vehicle services, and payment systems, thereby improving consumer service experiences and generating additional revenue [3] - The integration of parking data into urban traffic management systems can help alleviate traffic congestion and promote staggered parking demand sharing through real-time parking space saturation alerts [3]