Workflow
星辰MaaS平台
icon
Search documents
超10万亿Tokens的高质量数据集是怎么炼成的?专访中国电信天翼AI阮宜龙
量子位· 2025-09-26 02:08
金磊 发自 凹非寺 量子位 | 公众号 QbitAI 正所谓 "得数据者得天下" ,这家央企算是把 高质量数据集 给玩明白了—— 超过 10万亿 tokens的通用大模型语料数据,以及覆盖 14个 关键行业的专业数据集,总存储量高达 350TB! 如此庞大的体量,还不是杂乱无章的原始数据,而是经过精心标注和优化且包含多模态在内的行业数据,是随时可以在行业里"上岗"的那 种。 或许有小伙伴就要问了,这很重要吗?答案是非常确定的。 高质量数据集是经过采集、加工等数据处理,可直接用于开发和训练人工智能模型,能有效提升模型性能的数据的集合。建设高质量数据集 至关重要,因为它直接决定了AI模型的准确性、泛化性和可用性——优质数据是训练出高效准确模型的基础。 重要程度,可见一斑了。 那么这家央企到底是谁? 不卖关子,它正是AI国家队—— 中国电信天翼AI ,其打造的 星辰MaaS平台 是建设高质量数据集的关键。 星辰MaaS平台像是一个数据精炼厂,通过四大核心协同运作,构建"数据—模型—服务"的完整闭环。 其中, 基模 作为"动力引擎",提供基础认知与推理能力; 数据工具链 作为"原料库",持续输送高质量的数据资源; 模 ...
DeepSeek一句话让国产芯片集体暴涨!背后的UE8M0 FP8到底是个啥
量子位· 2025-08-22 05:51
Core Viewpoint - The release of DeepSeek V3.1 and its mention of the next-generation domestic chip architecture has caused significant excitement in the AI industry, leading to a surge in stock prices of domestic chip companies like Cambricon, which saw an intraday increase of nearly 14% [4][29]. Group 1: DeepSeek V3.1 and UE8M0 FP8 - DeepSeek V3.1 utilizes the UE8M0 FP8 parameter precision, which is designed for the upcoming generation of domestic chips [35][38]. - UE8M0 FP8 is based on the MXFP8 format, which allows for a more efficient representation of floating-point numbers, enhancing performance while reducing bandwidth requirements [8][10][20]. - The MXFP8 format, defined by the Open Compute Project, allows for a significant increase in dynamic range while maintaining an 8-bit width, making it suitable for AI applications [8][11][20]. Group 2: Market Reaction and Implications - Following the announcement, the semiconductor ETF rose by 5.89%, indicating strong market interest in domestic chip stocks [4]. - Cambricon's market capitalization surged to over 494 billion yuan, making it the top stock on the STAR Market, reflecting investor optimism about the company's capabilities in supporting FP8 calculations [29][30]. - The adoption of UE8M0 FP8 by domestic chips is seen as a move towards reducing reliance on foreign computing power, enhancing the competitiveness of domestic AI solutions [33][34]. Group 3: Domestic Chip Manufacturers - Several domestic chip manufacturers, including Cambricon, Hygon, and Moore Threads, are expected to benefit from the integration of UE8M0 FP8, as their products are already aligned with this technology [30][32]. - The anticipated release of new chips that support native FP8 calculations, such as those from Huawei, is expected to further strengthen the domestic AI ecosystem [30][33]. - The collaboration between DeepSeek and various domestic chip manufacturers is likened to the historical "Wintel alliance," suggesting a potential for creating a robust ecosystem around domestic AI technologies [34].
中国电信上半年营收增长净利润增长,研发投入助力业绩提升
Xin Lang Cai Jing· 2025-08-15 02:47
Core Insights - China Telecom reported a revenue of RMB 2,694.22 billion for the first half of 2025, representing a year-on-year growth of 1.30% [1][2] - The net profit attributable to the parent company was RMB 230.17 billion, with a year-on-year increase of 5.53% [1][2] - The growth in revenue was primarily driven by an increase in service revenue, particularly mobile communication services, which reached RMB 1,066 billion, also growing by 1.3% year-on-year [1][2] Revenue Breakdown - Total service revenue amounted to RMB 2,491 billion, reflecting a year-on-year growth of 1.2% [2] - Fixed-line and smart home service revenue reached RMB 641 billion, with a slight increase of 0.2% [2] - The revenue from industrial digitalization business was RMB 749 billion, indicating strong performance [1][2] User Metrics - The penetration rate of 5G network users increased by 6.1 percentage points compared to the end of the previous year [1][2] - The average revenue per user (ARPU) for mobile users reached RMB 46.0 [1][2] - The broadband comprehensive ARPU was RMB 48.3 [2] Technological Advancements - China Telecom is advancing its "5G+AI+Cloud+Applications" integrated product system, launching products like the 5G industrial control intelligent body [2][3] - The company has developed nearly 500,000 new integrated gateways [2] - Over 80 industry models and 30 industry intelligent bodies have been launched, along with the Starry MaaS platform and Starry Industry Agent platform [2] Infrastructure Development - The total number of 5G base stations reached 4.549 million, and the number of internet broadband access ports was 1.234 billion [3] - Significant progress has been made in new infrastructure construction, particularly in 5G-A network capability upgrades and industrial digitalization [3] Strategic Focus - The company emphasizes the importance of artificial intelligence, computing services, and cloud business as key development directions [3] - There is a commitment to enhancing research and development efficiency and strengthening the transformation of innovative results [3] - The company aims to build a talent center and innovation hub to support its strategic emerging businesses and future industries [3]