大语言模型
Search documents
GPT-5落地,Kimi掉队,大模型“腰部危机”或将提前
Xi Niu Cai Jing· 2025-08-19 08:34
2024年春节,Kimi在B站投放广告超1亿元,单用户获客成本(CPA)30元,带动MAU短时飙升。然而,随着豆包、通义、元宝等厂商把长文本上限卷到"百 万tokens"标配,Kimi的先发优势被快速抹平。更致命的是,垂直场景工具(Wind、知网)通过绑定专业数据库,以更高精度切走金融、学术等高价值客 群,通用长文本需求被"降维打击"。 此外, Kimi还受到数据、算力、资本的三重制约。在数据上,阿里、字节、腾讯可借助电商、短视频、社交生态闭环持续训练,Kimi只能依赖公开语料与 有限合作方,数据飞轮难以启动。 算力上,美国高端GPU禁售后,国产替代性能折损30%—50%;Kimi需外采云资源,训练成本较自建云大厂高20%以上。 在资本上,2023年8月后,月之暗面未再获得新融资;2024年底创始人与投资人股权纠纷发酵,潜在出资方观望情绪浓厚。 近日,OpenAI正式推出GPT-5,将大语言模型与推理模型深度耦合,官方宣称事实错误率较GPT-4o下降47%,多轮复杂任务一次性通过率刷新SOTA(State- of-the-Art)。资本市场迅速投票:当日英伟达涨6.4%、微软涨3.1%,而A股算力租赁板块整体 ...
“数”看期货:大模型解读近一周卖方策略一致观点-20250819
SINOLINK SECURITIES· 2025-08-19 07:33
Group 1: Stock Index Futures Market Overview - The four major index futures contracts experienced an overall increase last week, with the CSI 1000 index futures rising the most by 5.21%, while the SSE 50 index futures had the smallest increase of 2.19% [3][11] - The average trading volume for the current, next, and quarterly contracts of IF, IC, IH, and IM increased compared to the previous week, with IH showing the largest increase of 65.56% and IM the smallest at 30.52% [3][11] - As of last Friday's close, the annualized basis rates for the current contracts of IF, IC, IM, and IH were -1.00%, -7.95%, -8.22%, and 1.71% respectively, indicating a narrowing of the basis for IF, IC, and IM, while IH shifted from a discount to a premium [3][11] Group 2: Cross-Period Price Differences - As of last Friday's close, the cross-period price difference rates for the current contracts of IF, IC, IM, and IH were at the 18.10%, 32.40%, 14.20%, and 9.00% percentiles since 2019 [4][12] - Currently, there are no arbitrage opportunities for the IF main contract based on the closing prices, as the required basis rates for both long and short arbitrage strategies do not meet the necessary thresholds [4][12] Group 3: Dividend Forecasts - After August, the strength of dividends is expected to weaken, but it will still impact the four major index futures. The estimated impact of dividends on the September main contracts for the CSI 300, CSI 500, SSE 50, and CSI 1000 indices is 3.62, 1.40, 1.39, and 0.89 respectively [5][11] - The correlation between basis changes and dividend impacts, as well as investor trading sentiment, is expected to remain high under unchanged trading rules for index futures [5][13] Group 4: Market Expectations - The shift to a premium structure for the IH and IF main contracts, along with the continued narrowing of the discount for IC and IM, indicates a sustained positive sentiment towards the A-share market [5][13] - Recent developments, such as the US-China tariff agreement and supportive monetary policy from the central bank, are expected to maintain a stable or narrowing basis in the upcoming week [5][13] Group 5: Recent Sell-Side Strategy Insights - A consensus among 10 brokerage firms indicates that incremental capital is continuously entering the market, with increased activity from foreign and insurance capital, while 8 firms noted a high market sentiment and active trading [6][37] - There is a general positive outlook on technology growth, dividend stocks, and upstream resource sectors among the brokerage firms surveyed [6][37]
端到端VLA的起点:聊聊大语言模型和CLIP~
自动驾驶之心· 2025-08-19 07:20
Core Viewpoint - The article discusses the development and significance of end-to-end (E2E) algorithms in autonomous driving, emphasizing the integration of various advanced technologies such as large language models (LLMs), diffusion models, and reinforcement learning (RL) in enhancing the capabilities of autonomous systems [21][31]. Summary by Sections Section 1: Overview of End-to-End Autonomous Driving - The first chapter provides a comprehensive overview of the evolution of end-to-end algorithms, explaining the transition from modular approaches to end-to-end solutions, and discussing the advantages and challenges of different paradigms [40]. Section 2: Background Knowledge - The second chapter focuses on the technical stack associated with end-to-end systems, detailing the importance of LLMs, diffusion models, and reinforcement learning, which are crucial for understanding the future job market in this field [41][42]. Section 3: Two-Stage End-to-End Systems - The third chapter delves into two-stage end-to-end systems, exploring their emergence, advantages, and disadvantages, while also reviewing notable works in the field such as PLUTO and CarPlanner [42][43]. Section 4: One-Stage End-to-End and VLA - The fourth chapter highlights one-stage end-to-end systems, discussing various subfields including perception-based methods and the latest advancements in VLA (Vision-Language Alignment), which are pivotal for achieving the ultimate goals of autonomous driving [44][50]. Section 5: Practical Application and RLHF Fine-Tuning - The fifth chapter includes a major project focused on RLHF (Reinforcement Learning from Human Feedback) fine-tuning, providing practical insights into building pre-training and reinforcement learning modules, which are applicable to VLA-related algorithms [52]. Course Structure and Learning Outcomes - The course aims to equip participants with a solid understanding of end-to-end autonomous driving technologies, covering essential frameworks and methodologies, and preparing them for roles in the industry [56][57].
限时价23.59万元起 奥迪 E5 Sportback开启预售
Bei Jing Shang Bao· 2025-08-18 14:27
Core Points - AUDI has launched its first mass-produced model, the E5 Sportback, with a starting price of 235,900 yuan [1] - The E5 Sportback features a closed grille design, integrated lighting modules, electronic side mirrors, and a continuous spoiler to reduce drag [3] - The vehicle is powered by front and rear permanent magnet synchronous motors, achieving a maximum speed of 21,000 rpm and accelerating from 0 to 100 km/h in 3.4 seconds [3] - The E5 Sportback is equipped with a CATL CTP battery, offering a maximum range of 773 km and enabling a quick charge of 370 km in just 10 minutes [3] - The car incorporates the new AUDI OS operating system and Qualcomm Snapdragon 8295 digital cockpit chip, creating an interactive smart cockpit [3] - The vehicle features an advanced voice assistant powered by a customized language model, enabling semantic understanding and multi-turn dialogue [3] - AUDI has partnered with Momenta to develop an advanced driver assistance system, integrating 27 perception hardware components for various driving scenarios [4] - The E5 Sportback includes laser radar, long-range millimeter-wave radars, ultrasonic radars, cameras, and NVIDIA Orin-X chip for enhanced computational power [4]
鸿海攻机器人大脑 英伟达助阵 人形产品将会思考、判断、解决问题
Jing Ji Ri Bao· 2025-08-17 23:13
Group 1 - Foxconn is collaborating with NVIDIA to develop the latest generation of humanoid robots, which will be showcased at the upcoming Foxconn Technology Day in November [1] - The humanoid robots will utilize a multi-skill AI model based on Foxconn's first traditional Chinese AI large language model (code-named FoxBrain), which has been trained by NVIDIA [1] - The AI model demonstrates strong capabilities in understanding and reasoning, excelling in data analysis, decision support, document collaboration, mathematics, logical reasoning, and code generation [1] Group 2 - Foxconn's subsidiary, Hon Hai Precision Industry, is developing the industrial robot brand "FoxBot" to enhance factory automation, with Hon Hai being the main manufacturing and assembly unit [2] - Hon Hai has the capacity to produce over 10,000 FoxBot robotic arms annually and plans to invest $1 billion to expand manufacturing in the United States [2] - Guangyu, another subsidiary, is set to acquire a Belgian robotics company to enhance its production capabilities for robotic joints, aiming to improve load-bearing and torque [2]
“智汇丝路·融通四海”,第六届“一带一路”出版合作经验交流会举办
Bei Jing Ri Bao Ke Hu Duan· 2025-08-17 05:58
Group 1 - The sixth "Belt and Road" Publishing Cooperation Experience Exchange Conference was successfully held in Guangzhou, focusing on the theme of "Intelligent Integration of the Silk Road: Innovation-Driven Collaborative Development of 'Belt and Road' Publishing" [1][3] - Since 2013, the Chinese publishing industry has translated over 3,000 classic Chinese works, established topic databases with publishers from more than 50 countries, and launched over 1,200 cooperative books, resulting in 18,000 copyright trades [3][5] - The China Publishing Group is promoting the translation of works like "Xi Jinping's Stories of Poverty Alleviation" to inject Chinese wisdom into global governance and is enhancing its competitive edge through innovative projects like large language models [5][12] Group 2 - The Guangdong Publishing Group is leveraging its geographical advantage as the starting point of the "Maritime Silk Road" to upgrade publishing cooperation through digital innovation and regional collaboration [7][8] - The group is implementing initiatives such as the "South Guangdong Mutual Translation Plan" and the "Guangdong Books Entering Overseas Libraries Plan" to promote local publications internationally [8][10] - The global number of AI research papers has increased significantly from 36,000 in 2004 to 486,000 in 2024, with China leading in AI research contributions [12][14] Group 3 - The conference highlighted the importance of cultural exchange and mutual understanding in publishing, emphasizing the need for innovative thinking to overcome cross-cultural communication barriers [14][15] - Suggestions were made to establish a joint digital publishing platform between China and Arab countries, create a translation and digital support fund, and establish a "Belt and Road Publishing and Culture Academy" to enhance international communication [10][15] - The need for a collaborative network for copyright protection and a cross-border rights protection mechanism was emphasized to address piracy challenges and ensure the vitality of innovation [15]
机器人,还当不了打工人
创业邦· 2025-08-16 03:15
Core Viewpoint - The article discusses the rapid evolution and increasing popularity of humanoid robots, highlighting their capabilities and the challenges they face in terms of intelligence and cost [6][7][25]. Group 1: Humanoid Robot Capabilities - Humanoid robots have advanced significantly, showcasing skills such as dancing, performing in competitions, and assisting in household tasks [6][7]. - They can interact with humans, understand speech, and perform simple conversations, moving beyond the label of "artificial intelligence" [6][7]. - Despite their advancements, they still exhibit limitations such as single-action performance, slow efficiency in tasks, and high costs, with top models priced comparably to luxury cars [6][7]. Group 2: Technical Aspects - Humanoid robots typically feature a human-like structure, with variations in hand and foot designs, affecting their functionality and cost [9][10]. - The cost of high-end dexterous hands can reach 100,000 to 200,000 yuan, making them a significant portion of the robot's total cost [9][10]. - Control methods for humanoid robots include remote operation, isomorphic arms, and voice control, but true AI autonomy remains a challenge [12][13]. Group 3: Application Scenarios - Humanoid robots are categorized into B2B (business) and B2C (consumer) applications, with B2B focusing on entertainment, industrial manufacturing, tourism services, and healthcare [14][15]. - The entertainment sector is currently the most developed application area, while other sectors are still in basic application stages [14][15]. Group 4: Industry Challenges - The main challenges for humanoid robots are their intelligence and cost, with current software capabilities being limited and requiring significant data for improvement [16][18]. - The industry consensus indicates that while physical capabilities are maturing, the software intelligence is lagging, restricting the robots' operational scope [18][19]. Group 5: Market Outlook - Despite the challenges, the humanoid robot industry is experiencing rapid growth, with a reported 27.8% increase in revenue in the first half of the year [25]. - Over 15,280 new robot-related companies were registered in the first seven months of the year, marking a 43.81% increase compared to the previous year [25]. - More than 20 humanoid robot companies are pursuing IPOs, with 16 based in China, indicating strong market interest and investment potential [25][26]. Group 6: Company Strategies - Companies like Yushutech focus on core technology and commercialization, while others like Zhiyuan Robotics emphasize a full-chain layout from hardware to software [25][26]. - The industry is characterized by diverse strategies and focuses, with some companies prioritizing intelligence and others emphasizing hardware capabilities [27].
视觉强化学习最新综述:全领域梳理(新加坡国立&浙大&港中文)
自动驾驶之心· 2025-08-16 00:03
Core Insights - The article discusses the integration of Reinforcement Learning with Computer Vision, marking a paradigm shift in how AI interacts with visual data [3][4] - It highlights the potential for AI to not only understand but also create and optimize visual content based on human preferences, transforming AI from passive observers to active decision-makers [4] Research Background and Overview - The emergence of Visual Reinforcement Learning (VRL) is driven by the successful application of Reinforcement Learning in Large Language Models (LLMs) [7] - The article identifies three core challenges in the field: stability in policy optimization under complex reward signals, efficient processing of high-dimensional visual inputs, and scalable reward function design for long-term decision-making [7][8] Theoretical Foundations of Visual Reinforcement Learning - The theoretical framework for VRL includes formalizing the problem using Markov Decision Processes (MDP), which unifies text and visual generation RL frameworks [15] - Three main alignment paradigms are proposed: RL with human feedback (RLHF), Direct Preference Optimization (DPO), and Reinforcement Learning with Verifiable Rewards (RLVR) [16][18] Core Applications of Visual Reinforcement Learning - The article categorizes VRL research into four main areas: Multimodal Large Language Models (MLLM), Visual Generation, Unified Models, and Visual-Language-Action (VLA) Models [31] - Each area is further divided into specific tasks, with representative works analyzed for their contributions [31][32] Evaluation Metrics and Benchmarking - A layered evaluation framework is proposed, detailing specific benchmarks for each area to ensure reproducibility and comparability in VRL research [44][48] - The article emphasizes the need for effective metrics that align with human perception and can validate the performance of VRL systems [61] Future Directions and Challenges - The article outlines four key challenges for the future of VRL: balancing depth and efficiency in reasoning, addressing long-term RL in VLA tasks, designing reward models for visual generation, and improving data efficiency and generalization capabilities [50][52][54] - It suggests that future research should focus on integrating model-based planning, self-supervised visual pre-training, and adaptive curriculum learning to enhance the practical applications of VRL [57]
港股午评:恒指跌1.19%、科指跌1.08%, 医药股强势,科技股低迷
Jin Rong Jie· 2025-08-15 04:21
Market Overview - The Hong Kong stock market experienced a decline, with the Hang Seng Index down 1.19% to 25,215.1 points, the Hang Seng Tech Index down 1.08% to 5,515.77 points, and the National Enterprises Index down 1.26% to 9,013.58 points [1] - Major technology stocks saw widespread declines, with Alibaba down 2.63%, JD.com down 3.84%, and Meituan down 3.14%, while Tencent saw a slight increase of 0.76% [1] - Internet healthcare stocks surged, with Dingdang Health rising over 26%, while Chinese brokerage stocks strengthened, with Zhongzhou Securities up over 13% [1] Company News - Alibaba Group is launching a large-scale AI talent recruitment plan, aiming to hire nearly 1,000 people focusing on advanced technologies such as large language models and AI hardware, with positions available in major cities like Beijing and Shanghai [2] - China Telecom reported a revenue of 271.5 billion yuan for the first half of the year, a year-on-year increase of 1.3%, and a net profit of 23 billion yuan, up 5.5% year-on-year [3] - CK Hutchison Holdings reported a revenue of 240.66 billion HKD for the first half of the year, a year-on-year increase of 3.45%, but a significant net profit decline of 91.65% to 850 million HKD [4] - JD.com reported a second-quarter revenue of 356.7 billion yuan, a year-on-year increase of 22.4%, but a net profit decline of approximately 50.8% to 6.2 billion yuan [4] - NetEase reported a revenue of 56.72 billion yuan for the first half of the year, a year-on-year increase of 8.37%, and a net profit of 18.90 billion yuan, up 31.33% year-on-year [5] Institutional Insights - Analysts from Zhongtai International noted that the current valuation of Hong Kong stocks has significantly recovered, with the Hang Seng Index's forecast PE returning to the mid-level of 2018-2019, and the risk premium at a historical low [6] - Guotai Junan analysts indicated that the overall pressure from capital outflows in Hong Kong stocks may be relatively controllable, with an expected net inflow of over 1.2 trillion yuan for the year [6] - Ping An Securities highlighted that despite uncertainties from U.S. tariffs, China's export data in July was unexpectedly strong, supporting a bullish outlook for Hong Kong stocks [6] - Everbright Securities stated that the overall profitability of Hong Kong stocks remains strong, with relatively scarce assets in sectors like internet, new consumption, and innovative pharmaceuticals, suggesting a favorable long-term investment outlook [6]
别盯着GPT-5了!Google这款Genie 3世界模型,才是未来的AI核心战场
老徐抓AI趋势· 2025-08-15 04:00
Core Viewpoint - The article emphasizes that while GPT-5 is receiving significant attention, the true focus should be on Google DeepMind's Genie 3, which represents a breakthrough in world modeling technology that could reshape the AI landscape [2][5]. Summary by Sections Introduction - The AI community is currently focused on GPT-5, but there is a risk of overlooking Genie 3, which is considered more significant [2]. World Model Definition - World models generate interactive and logically consistent environments, allowing users to explore and interact, unlike traditional video which is static and fixed [6]. Genie 3 Demonstration - Genie 3 can create a persistent world where changes made by users are retained, showcasing its ability to maintain logical consistency [9][11]. Disruptive Potential of World Models - World models could democratize high-quality content creation, significantly reducing costs in gaming and film production, and have potential applications in robot training [14][20]. Applications in Autonomous Driving - World models can generate training scenarios for autonomous vehicles, allowing for efficient data generation that adheres to physical laws, thus lowering training costs [15][19]. Relation to Metaverse and Mirror World - The advent of world models could lower the production costs associated with the metaverse, making it more feasible and aligning with the concept of mirror worlds that blend reality and virtuality [20]. Future Investment Opportunities - Companies and investors interested in autonomous driving, robotics, and immersive virtual experiences should closely monitor developments in world modeling technology, as it is seen as a key driver for these industries [22].