Workflow
大语言模型
icon
Search documents
Nicholas Chui:押注中国的“动物精神”正在回归
日经中文网· 2025-09-02 03:15
Core Viewpoint - The allocation of funds to Chinese stocks is increasing as investors recognize the Chinese government's shift towards economic support, marking a turning point for long-term growth expectations in China [1][2]. Group 1: Fund Flows and Market Performance - The return of funds to the Chinese market is not a short-term phenomenon, with a resurgence of investor confidence in China's long-term growth potential [2]. - Hong Kong's stock market has reached a high not seen in approximately 3 years and 10 months, while Shanghai's stock market is at its highest in nearly a decade [1]. Group 2: Consumer Sector Resilience - Consumer concept stocks, particularly in tourism and education, are performing strongly, supported by government policies and increasing national purchasing power [3]. - Companies like Xiaomi are diversifying their product offerings beyond smartphones to include electric vehicles and smart home appliances, enhancing brand recognition and product quality over time [3]. Group 3: Geopolitical Concerns - Concerns regarding US-China tensions persist, but there has been no panic selling among clients in response to tariff announcements, indicating a more measured approach to geopolitical risks [4][6].
大模型开始打王者荣耀了
量子位· 2025-09-02 01:40
Core Insights - The article discusses the implementation of the Think-In-Games (TiG) framework, which allows large language models to play the game Honor of Kings while learning in real-time, effectively bridging the gap between decision-making and action [1][3][4]. Group 1: TiG Framework Overview - TiG redefines decision-making based on reinforcement learning as a language modeling task, enabling models to generate strategies guided by language and optimize them through online reinforcement learning [3][4]. - The framework allows large language models to learn macro-level reasoning skills, focusing on long-term goals and team coordination rather than just micro-level actions [6][9]. - The model acts more like a strategic coach than a professional player, converting decisions into text and selecting macro actions based on game state [7][9]. Group 2: Training Methodology - The training process involves a multi-stage approach combining supervised fine-tuning (SFT) and reinforcement learning (RL) to enhance model capabilities [12][16]. - The research team utilized a "relabeling algorithm" to ensure each game state is tagged with the most critical macro action, providing a robust signal for subsequent training [9][11]. - The Group Relative Policy Optimization (GRPO) algorithm is employed to maximize the advantages of generated content while limiting divergence from reference models [9][11]. Group 3: Experimental Results - The results indicate that the combination of SFT and GRPO significantly improves model performance, with Qwen-2.5-32B's accuracy increasing from 66.67% to 86.84% after applying GRPO [14][15]. - The Qwen-3-14B model achieved an impressive accuracy of 90.91% after training with SFT and GRPO [2][15]. - The TiG framework demonstrates competitive performance compared to traditional reinforcement learning methods while significantly reducing data and computational requirements [17].
自搜索强化学习SSRL:Agentic RL的Sim2Real时刻
机器之心· 2025-09-02 01:27
Core Insights - The article discusses the development and effectiveness of SSRL (Structured Search Reinforcement Learning) in enhancing the training efficiency and stability of Search Agents using large language models (LLMs) [6][28] - SSRL demonstrates superior performance over traditional methods that rely on external search engines, achieving effective transfer from simulation to real-world applications (Sim2Real) [6][28] Group 1 - SSRL utilizes structured prompts and format rewards to effectively extract world knowledge from models, leading to improved performance across various benchmarks and reduced hallucination [2][6] - The research highlights the high costs and inefficiencies associated with current RL training methods for Search Agents, which include full-real and semi-real search approaches [7][13] - The introduction of SSRL allows for a significant increase in training efficiency, estimated at approximately 5.6 times, while maintaining a continuous increase in training rewards without collapse [31][32] Group 2 - Experiments show that models trained with SSRL outperform those relying on external engines, particularly in real-world search scenarios, indicating the importance of integrating real-world knowledge [28][31] - The article presents findings that suggest the combination of self-generated knowledge and real-world knowledge can enhance model performance, particularly through entropy-guided search strategies [34] - The integration of SSRL with TTRL (Task-Driven Reinforcement Learning) has shown to improve generalization and effectiveness, achieving up to a 67% performance increase in certain tasks [38][39]
维持推荐小盘成长,风格连续择优正确
2025-09-02 00:42
Summary of Key Points from the Conference Call Industry or Company Involved - The conference call primarily discusses the investment strategies and market outlook of CICC (China International Capital Corporation) focusing on small-cap growth stocks and various asset classes. Core Insights and Arguments - CICC maintains a positive outlook on small-cap growth style for September, despite a slight decline in overall indicators. Market conditions, sentiment, and macroeconomic factors support the continued superiority of small-cap growth in the coming month [1][2] - In asset allocation, CICC is optimistic about domestic equity assets, neutral on commodity assets, and cautious regarding bond assets. The macro expectation gap indicates a bullish stance on stocks, particularly small-cap and dividend stocks, while being bearish on growth stocks [3][4] - The industry rotation model for September recommends sectors such as comprehensive finance, media, computer, banking, basic chemicals, and real estate, based on price and volume information. The previous month's recommended sectors achieved a 2.4% increase [5] - The "growth trend resonance" strategy performed best in August with a return of 18.1%, significantly outperforming the mixed equity fund index for six consecutive months [7] - Year-to-date (YTD) performance of CICC's various strategies is strong, with an overall return of 43%, surpassing the Tian Gu Hang operating index by 15 percentage points. The XG Boost growth selection strategy has a YTD return of 47.1% [8] Other Important but Possibly Overlooked Content - The small-cap strategy underperformed expectations due to extreme market conditions led by large-cap stocks, which created a positive feedback loop for index growth. This indicates a potential phase of inefficacy for the strategy [6] - The active quantitative stock selection strategies include stable growth and small-cap exploration, with the latter showing mixed results in August. Despite positive absolute returns, small-cap exploration strategies lagged behind other indices [8] - CICC's quantitative team has developed various models based on advanced techniques like reinforcement learning and deep learning, with notable performance in stock selection strategies. The Attention GRU model, for instance, has shown promising results in both the market and specific indices [10]
吴恩达最新来信:是时候关注并行智能体了
具身智能之心· 2025-09-01 04:02
编辑丨量子位 点击下方 卡片 ,关注" 具身智能之心 "公众号 >> 点击进入→ 具身 智能之心 技术交流群 更多干货,欢迎加入国内首个具身智能全栈学习社区 : 具身智能之心知识星球 (戳我) , 这里包含所有你想要的。 人多,好办事。agent多,照样好办事! 在最新的Andrew's Letters中, 吴恩达 老师 就指出: 并行智能体正在成为提升AI能力的新方向。 信中,他描绘了这样的一些场景: 在这些场景中,多个agent协作,就像一支高效的agent team同时处理不同任务,速度快、效率高。 此外, 大语言模型token成本的不断下降,也让多个agent并行处理的方法变得可行。 多个agent 并行抓取分析网页, 更快速地生 成深度研究报告 。 多个agent 协同处理代码库的不同部分, 加快编程任务完成速度。 多个agent 在后台并行工作, 同时由一个 监督agent向用户提供反馈, 实现并行异步控制。 但就像网友指出的:如何协调多个agent呢? 这为我们理解AI能力的提升提供了新视角—— 不仅仅依靠更多的数据和算力,更重要的是让多个智能体 协同并行 工作。 并行智能体才是未来 以往,当我 ...
中金:维持百融云-W跑赢行业评级 上调目标价至14.8港元
Zhi Tong Cai Jing· 2025-09-01 03:03
Core Viewpoint - CICC maintains the non-GAAP net profit forecast for Bairong Cloud-W (06608) for 2025 and 2026, considering uncertainties in business operations due to regulatory tightening in the second half of the year. The target price is raised by 15% to HKD 14.8, reflecting a 20% upside potential based on adjusted P/E ratios of 19.6x and 15.3x for 2025 and 2026 respectively [1] Group 1 - Bairong Cloud's non-GAAP net profit for 1H25 exceeded CICC's expectations, with revenue increasing by 22% year-on-year to CNY 1.61 billion and gross profit also up by 22% to CNY 1.18 billion, resulting in a gross margin of 73.4% [2] - The non-GAAP net profit for 1H25 rose by 29% year-on-year to CNY 254 million, with a net profit margin of 15.8%, driven by the rapid growth in the financial industry's cloud supply and demand [2] Group 2 - The MaaS business showed a double-digit growth recovery, with revenue increasing by 19% year-on-year to CNY 502 million, and a customer retention rate of 98% [3] - The average revenue per customer increased by 14% year-on-year to CNY 2.28 million, indicating strong growth potential as the company plans to expand its product applications into non-financial sectors [3] Group 3 - The BaaS service revenue grew by 23% year-on-year to CNY 1.1 billion, with the financial industry cloud segment achieving a 45% increase in revenue to CNY 857 million, accounting for 77% of BaaS revenue [4] - The growth in the BaaS segment is attributed to improved cost conversion efficiency and increased client budget allocations, while the insurance industry cloud revenue declined by 19% year-on-year to CNY 253 million due to regulatory impacts [4]
中金:维持百融云-W(06608)跑赢行业评级 上调目标价至14.8港元
智通财经网· 2025-09-01 03:00
Core Viewpoint - The report from CICC maintains the profit forecast for Bairong Cloud (06608) for 2025 and 2026, considering uncertainties in business operations due to tightening regulations in the second half of the year, while raising the target price by 15% to HKD 14.8, reflecting a potential upside of 20% based on adjusted P/E ratios for 2025 and 2026 [1] Group 1: Financial Performance - Bairong Cloud's non-GAAP net profit for the first half of 2025 exceeded expectations, with revenue increasing by 22% year-on-year to CNY 1.61 billion, gross profit also up by 22% to CNY 1.18 billion, and gross margin rising by 0.2 percentage points to 73.4% [2] - The non-GAAP net profit for the same period increased by 29% year-on-year to CNY 254 million, with a net profit margin of 15.8%, up by 0.8 percentage points, driven by a strong demand-supply resonance in the financial cloud sector [2] Group 2: Business Segments - The MaaS business showed a recovery with a 19% year-on-year revenue increase to CNY 502 million, maintaining a high customer retention rate of 98% and a 14% increase in average revenue per customer to CNY 2.28 million, contributing to stable revenue growth [3] - The BaaS service revenue grew by 23% year-on-year to CNY 1.1 billion, with the financial cloud segment achieving a 45% increase to CNY 857 million, accounting for 77% of BaaS revenue, driven by improved cost efficiency and increased client budgets [4] - The BaaS insurance cloud segment faced challenges, with revenue declining by 19% year-on-year to CNY 253 million, despite a 9% increase in premium scale, primarily due to regulatory impacts leading to product withdrawals and reduced commission rates [4]
科研智能体「漫游指南」—助你构建领域专属科研智能体
机器之心· 2025-09-01 02:49
欢迎关注中国科学院自动化研究所 & 北京中关村学院 & 芝加哥大学 & 西湖大学 & 腾讯带来的科研智能体方面的最新综述调研。 当前基于大语言模型(LLM)的智能体构建通过推动自主科学研究推动 AI4S 迅猛发展,催生一系列科研智能体的构建与应用。然而人工智能与自然科学研 究之间认知论与方法论的偏差,对科研智能体系统的设计、训练以及验证产生着较大阻碍。 与传统综述不同,本篇综述为大家呈现了科研智能体的 「 漫游指南 」 ,旨在提供构建科研智能体的 「 说明指南 」 :从科学研究的全周期出发,概述了科 研智能体的分级策略,并详细阐述了对应等级的构建策略与能力边界;同时该 「 漫游指南 」 详细阐明了如何从头构建科研智能体,以及如何对科研智能体 的定向能力进行增强。同时 「 指南 」 中详细涵盖了科研智能体的概念阐述、构建方案、基线评估以及未来方向。 希望本 「 漫游指南 」 能启发 AI 研究者与具体自然科学研究者,促进 AI 与自然科学之间的深度融合。 综述的核心贡献如下: 论文地址:https://doi.org/10.36227/techrxiv.175459840.02185500/v1 仓库地址:ht ...
硬蛋创新(00400.HK)中期经营溢利2.76亿元 同比增加约20.8%
Ge Long Hui· 2025-08-29 16:56
Group 1 - The company reported a revenue of approximately RMB 6.677 billion for the six months ending June 30, 2025, representing a year-on-year increase of about 54.5% [1] - Operating profit was approximately RMB 276 million, an increase of about 20.8% year-on-year [1] - Net profit after tax was approximately RMB 190 million, reflecting a year-on-year increase of 12.4% [1] - Earnings per share stood at RMB 0.086 [1] Group 2 - The rapid penetration of AI applications has become a core driver of growth in the global semiconductor market [1] - According to the World Semiconductor Trade Statistics (WSTS), the global semiconductor market size reached USD 346 billion in the first half of the year, marking an 18.9% year-on-year growth [1] - The demand related to AI has been particularly significant, with a substantial increase in the need for high-performance GPUs, dedicated AI accelerators, and advanced storage chips [1] - Major global cloud service providers have significantly increased capital expenditures to expand AI training and inference server clusters, further driving the growth in shipments of high-end AI chips [1]
吴恩达最新来信:是时候关注并行智能体了
量子位· 2025-08-29 11:37
Core Viewpoint - The article emphasizes the emerging importance of parallel agents in enhancing AI capabilities, suggesting that collaboration among multiple agents can significantly improve efficiency and speed in task execution [1][3][4]. Summary by Sections Parallel Agents as the Future - The traditional approach to improving AI performance has relied heavily on scaling laws, which focus on increasing data and computational power. However, the article argues that the future lies in the ability of multiple agents to work in parallel [4][8]. Validation of Parallel Agents - Andrew Ng cites his previous work at Baidu and OpenAI as evidence that parallel agent methodologies can yield faster results compared to conventional methods that often require lengthy processing times [5][6]. Challenges in Coordination - The article highlights the inherent challenges in coordinating multiple agents to perform complex tasks, such as web analysis or software development, which can be difficult even for human teams [9][10]. Recent Research Developments - Two recent papers are mentioned that contribute to the understanding of parallel agents: - The first paper discusses how large language models can generate multiple trajectories during inference to enhance problem-solving efficiency in programming [11][13]. - The second paper introduces the Together Mixture Of Agents (MoA) architecture, which utilizes multiple large language models simultaneously to improve performance and allows for adjustments in the hierarchical structure of agents [14][15]. Future Research Directions - Ng concludes that there is still much research and engineering work needed to optimize the use of parallel agents, suggesting that the number of agents capable of working efficiently in parallel could be substantial [18]. Historical Context - The article references Ng's 2009 paper that demonstrated the large-scale application of GPUs in deep learning, marking a significant milestone in the field and underscoring the importance of parallel processing [19][20].