大语言模型
Search documents
NeurIPS 2025|CAKE:大模型驱动的贝叶斯优化新配方,让黑箱优化更智能、更高效
机器之心· 2025-12-02 06:47
Core Insights - The article discusses a new method called Context-Aware Kernel Evolution (CAKE) for Bayesian Optimization, which utilizes large language models (LLMs) to dynamically design optimal Gaussian Process (GP) kernel functions during the optimization process [5][6][14]. Group 1: Methodology - CAKE reimagines the kernel design problem as an "evolutionary process," using LLMs to generate new kernel functions based on existing observational data [17]. - The system maintains a "population" of kernel functions and employs genetic operations such as crossover and mutation to evolve these kernels [19]. - BIC-Acquisition Kernel Ranking (BAKER) is introduced to rank kernel functions based on their model fit and sampling potential, balancing optimization and exploration [21][22]. Group 2: Experimental Results - CAKE was tested against three baseline methods: Fixed (using a single SE or M5 kernel), Adaptive (random selection or BIC selection), and Compositional methods [25]. - In hyperparameter optimization tasks, CAKE achieved the highest final accuracy across all tested machine learning models, demonstrating high sample efficiency, especially in the early stages of optimization [27]. - In dynamic simulation tasks, CAKE outperformed all baseline methods, showing robustness to environmental changes and successfully achieving high scores in challenging tasks [28]. Group 3: Advantages and Future Directions - CAKE offers significant interpretability, allowing for human-readable explanations of kernel structures generated during optimization [34][37]. - The framework is expected to evolve further by incorporating more general kernel function syntax and extending its core ideas to other machine learning tasks, such as SVM and kernel PCA [42].
深演智能冲刺港股:2024年净利骤降64.6% 2025年上半年客户集中度飙至70.2%
Xin Lang Cai Jing· 2025-12-02 00:26
深演智能定位为营销与销售场景的决策AI技术公司,核心产品包括智能广告投放平台AlphaDesk和智能 数据管理平台AlphaData,2025年新增AI智能体系统Deep Agent。然而,公司业务结构呈现显著失衡, 智能广告投放业务收入占比从2022年的82.1%持续攀升,2025年上半年已达93.3%,成为绝对主导业 务;智能数据管理业务占比则从17.9%萎缩至6.7%,业务多元化战略失败。 来源:新浪港股-好仓工作室 主营业务:广告投放依赖加剧 业务结构失衡风险凸显 表:深演智能主营业务收入构成(单位:人民币万元) 业务板块2022年收入占比2023年收入占比2024年收入占比2025年上半年收入占比智能广告投放 82.1%80.5%85.5%93.3%智能数据管理17.9%19.5%14.5%6.7%合计100%100%100%100% 值得注意的是,新增的Deep Agent系统尚未产生实质收入,无法缓解业务单一化风险。智能广告投放业 务高度依赖媒体资源采购,2025年上半年媒体资源采购成本占销售成本比例高达87.1%,成本控制能力 薄弱,对上游媒体代理商议价能力受限。 财务表现:净利润剧烈波动 盈 ...
DeepSeek发布V3.2正式版
Xin Jing Bao· 2025-12-01 15:01
Core Insights - DeepSeek announced the release of two official model versions: DeepSeek-V3.2 and DeepSeek-V3.2-Speciale [1] Model Overview - DeepSeek-V3.2 aims to balance reasoning capability and output length, making it suitable for everyday use, such as Q&A scenarios and general agent tasks [1] - In benchmark tests for reasoning, DeepSeek-V3.2 achieved performance comparable to GPT-5, slightly below Gemini-3.0-Pro [1] - Compared to Kimi-K2-Thinking, V3.2 significantly reduced output length, leading to lower computational costs and reduced user wait times [1] Special Features - DeepSeek-V3.2-Speciale is designed to push the reasoning capabilities of open-source models to the limit, exploring the boundaries of model performance [1] - This version is an enhanced long-thinking variant of DeepSeek-V3.2, incorporating theorem-proving capabilities from DeepSeek-Math-V2 [1] - The model exhibits excellent instruction-following, rigorous mathematical proof, and logical verification abilities, performing comparably to Gemini-3.0-Pro in mainstream reasoning benchmark tests [1]
OpenAI大溃败,GPT-5「换皮」GPT-4o,两年半预训练0突破
3 6 Ke· 2025-12-01 02:12
Core Insights - OpenAI is facing significant challenges with its pre-training processes, particularly for the upcoming GPT-5 model, which reportedly still relies on the foundation of GPT-4o [1][3][12] - The company has not achieved substantial progress in scaling its pre-training efforts since the release of GPT-4o, leading to concerns about the performance of GPT-5 [7][12][20] - Google's TPU technology is emerging as a strong competitor, potentially undermining NVIDIA's dominance in AI hardware, which OpenAI has heavily relied upon [5][26] Pre-training Challenges - OpenAI's pre-training for GPT-5 has been described as a failure, with the internal project "Orion" being downgraded to GPT-4.5 due to unmet expectations [11][12] - The pre-training phase is critical for developing generative AI models, and OpenAI's struggles in this area have raised questions about the capabilities of GPT-5 compared to its predecessors [29][39] - Despite advancements in algorithms reducing the physical computation required for training, OpenAI's Orion project exceeded the typical training duration of 1-2 months, taking over 3 months [14][36] Performance Comparisons - The performance improvements of GPT-5 have been perceived as modest, with industry reactions indicating it is more of an enhancement of GPT-4o rather than a revolutionary upgrade [20][35] - Benchmark comparisons show that Google's Gemini 3 has outperformed GPT-5 in several areas, highlighting the competitive landscape in AI model performance [31] Strategic Shifts - OpenAI is reportedly shifting focus towards a new model, codenamed "Shallotpeat," aimed at addressing the pre-training issues encountered with previous models [46][50] - The company acknowledges the need for specialized models rather than a single "super model," reflecting a broader industry consensus on the diversification of AI applications [54][60] - OpenAI's internal discussions indicate a recognition of Google's advancements in pre-training, marking a significant shift in the competitive dynamics of the AI landscape [27][29]
证券研究报告、晨会聚焦:金工吴先兴:12月A股指数调样会带来哪些投资机会-20251130
ZHONGTAI SECURITIES· 2025-11-30 12:54
Group 1: Investment Opportunities in A-Share Index Adjustment - The upcoming December index adjustment is expected to create significant investment opportunities, particularly for stocks with a positive impact coefficient above 2, such as Tapa Group, Jiangzhong Pharmaceutical, and Zhengbang Technology [3][4] - The report highlights the importance of focusing on stocks that are newly added to major indices, with particular attention to Guangqi Technology and Zhongtian Technology, which are expected to experience substantial liquidity changes [4] - The passive fund outflows from stocks like Zhongji Xuchuang and Xinyi Sheng are projected to be limited due to their strong liquidity, despite their weights being reduced in various indices [4][5] Group 2: Animation and Film Industry Insights - The film industry is experiencing a recovery, with total box office revenue expected to exceed 50 billion yuan, driven by high-quality imported films and a resurgence in audience engagement [6][7] - The market is shifting towards high-quality content, with a notable increase in the contribution of narrative films to box office performance, indicating a growing demand for deep content [7] - Regulatory policies are expected to support the film industry, with initiatives aimed at expanding the understanding of mainstream themes and enhancing the supply of animated films and imported content [7][8] Group 3: Public REITs Market Development - The introduction of commercial real estate REITs marks a significant shift in China's public REITs market, moving from a focus solely on infrastructure to a dual focus on infrastructure and commercial real estate [8][9] - The potential market size for commercial real estate REITs is estimated to be between 800 billion and 1.5 trillion yuan, indicating a substantial opportunity for asset securitization in the commercial property sector [9][10] - The development of commercial real estate REITs is expected to enhance the liquidity and operational efficiency of the real estate market, addressing long-standing challenges in asset management [10][11]
泰国孔敬大学孔子学院积极对接中文水平考试3.0标准
人民网-国际频道 原创稿· 2025-11-30 04:01
Core Viewpoint - The Confucius Institute at Khon Kaen University in Thailand has launched a new academic year of Chinese elective courses, aligning with the HSK 3.0 standard, which is a significant reform in international Chinese education set to be implemented on November 18, 2025 [1][2]. Group 1: HSK 3.0 Standard Implementation - The HSK 3.0 standard integrates artificial intelligence and large language models into the curriculum, adjusting vocabulary, grammar, topics, and task outlines, while also adding a character outline [1]. - The teaching team at Khon Kaen University has systematically organized vocabulary and grammar points based on the HSK 2.0 exam outline, ensuring that the new curriculum aligns closely with the updated HSK 3.0 standard [2]. Group 2: Course Development and Structure - The Confucius Institute has developed a four-category course system covering core Chinese language skills, including comprehensive Chinese, Chinese listening and speaking, character reading and writing, and Chinese cultural knowledge [2]. - The upgraded 101 comprehensive Chinese course resources, which include student books, teacher manuals, and supporting materials, have been well received by students, enhancing teaching professionalism and efficiency [2]. Group 3: Future Plans - Starting in March 2025, the Confucius Institute will continue to update and develop the Chinese elective courses in line with the HSK 3.0 standard, with plans to complete the curriculum for levels one to three by 2027 [3].
美银回应谷歌TPU抢英伟达GPU生意:份额肯定会降,但不是瞬间发生
Zhi Tong Cai Jing· 2025-11-28 12:57
Core Insights - Google's TPU has gained significant recognition, leading to claims that it has surpassed NVIDIA in the market, prompting NVIDIA to assert its continued dominance in GPU technology [1] Market Share and Competition - Bank of America projects NVIDIA's market share will decline from approximately 85% to around 75% due to increased competition, particularly from Google's TPU, although this shift will occur gradually [2][6] - The current supply chain constraints and NVIDIA's scale advantages hinder competitors from rapidly capturing market share [7] AI Model Development - The competition in large language models (LLMs) is described as a long-term marathon, with recent releases from Google and Anthropic indicating a dynamic landscape [4] - Google's TPU has been integral to the training of its Gemini models, with plans to potentially lease TPU to Meta by 2026, which could intensify competition for existing GPU suppliers [4] Company Valuations - AMD's target price is set at $300, reflecting anticipated growth in AI and CPU market share [9] - Broadcom's target price is $400, supported by strong earnings growth and profitability in the semiconductor sector [10] - NVIDIA's target price is $275, justified by its leading position in the rapidly growing AI computing market [11]
百融云创旗下助贷屡被投诉36%利率 回应称合同合规
Zhong Guo Jing Ji Wang· 2025-11-28 06:13
Core Insights - Baidu Cloud's financial technology service provider, BaiRong YunChuang, has demonstrated strong growth in the financial digitalization sector, with loan service revenue exceeding 800 million yuan in the first half of the year [1] - Despite the revenue growth, there are ongoing complaints regarding high interest rates associated with its loan services, with reported annualized rates reaching as high as 35.95% [1] - The company claims that it has not received complaints regarding interest rates reaching 36%, asserting that all loan contracts are compliant and clearly state the applicable rates [1] Financial Performance - BaiRong YunChuang reported a revenue of 2.929 billion yuan for 2024, reflecting a year-on-year growth of 9% [4] - The revenue from Model as a Service (MaaS) was 932 million yuan, up 5% year-on-year, while Business as a Service (BaaS) revenue reached 1.997 billion yuan, growing by 12% [4] - The net profit for the year was 266 million yuan, a decline of 21%, with the net profit margin decreasing from 13% to 9% [4] Regulatory Context - The National Financial Regulatory Administration has issued guidelines requiring commercial banks to clearly define service fees in cooperation agreements and ensure that the total financing costs align with legal standards [2] - The Supreme People's Court has emphasized the need to regulate high-interest loans and support borrowers in reducing excessive interest rates that exceed 24% annually [3] - BaiRong YunChuang's subsidiary, Rongshu Loan, operates under these regulatory frameworks, providing intelligent financial services [3]
腾讯广告算法大赛圆满结束,多位选手现场获得腾讯Offer意向书
Sou Hu Cai Jing· 2025-11-28 04:16
Core Insights - The 2025 Tencent Algorithm Competition successfully held its finals in Shenzhen, with over 2800 teams participating globally, focusing on "multi-modal generative recommendation" [1][5] - The champion team "Echoch," consisting of members from Huazhong University of Science and Technology, Peking University, and University of Science and Technology of China, was awarded Tencent's offer and cash prizes [1] - The competition attracted over 8400 participants from nearly 30 countries, marking a historical high for overseas registrations [5] Competition Overview - The finals featured 20 teams that excelled in a rigorous selection process, showcasing innovative generative recommendation algorithms [1] - A special technical innovation award of 200,000 yuan was granted to the team "料峭春风吹酒醒" from the Institute of Computing Technology, Chinese Academy of Sciences [1] Technological Insights - The competition emphasized the application of advanced technologies such as LLM (Large Language Models) and MLLM (Multi-modal Large Language Models), leading to significant innovations in model performance [3] - The generative recommendation technology is seen as crucial for enhancing advertising precision and user experience, allowing for personalized ad recommendations [5] Industry Implications - Tencent's Vice President, Jiang Jie, highlighted the competition's role in attracting young talent to AI, reinforcing Tencent's commitment to technological innovation and collaboration between academia and industry [3] - The competition's dataset will be open-sourced post-event to foster further academic and industrial technological exchanges [5] Business Development - Tencent's Q3 financial report introduced the "Tencent Advertising AIM+" smart advertising product matrix, which optimizes marketing returns for advertisers [6] - The ongoing exploration of generative recommendation technologies within Tencent's advertising business aims to enhance user experience and drive commercial growth [6]
亚马逊研究奖获奖名单出炉:王晋东等26位华人入选
机器之心· 2025-11-28 04:11
Core Insights - The Amazon Research Awards (ARA) announced 63 recipients, including 26 Chinese scholars from 41 universities across 8 countries, aimed at funding multidisciplinary research topics [1][2]. AI Information Security - Eight researchers in AI information security received awards, with three being Chinese scholars [3]. - Zhou Li from the University of California, Irvine, focuses on using LLM for precise and analyst-friendly attack tracing in audit logs [4]. - Yu Meng from the University of Virginia studies weakly supervised RLHF, modeling ambiguity and uncertainty in human preferences [5]. - Ziming Zhao from Northeastern University specializes in system and software security, network security, and human-centered security research [6]. Amazon Ads - Two awardees in the Amazon Ads research area are both Chinese [8]. - Xiaojing Liao from the University of Illinois Urbana-Champaign investigates attack methods on large language models, focusing on interpretable vulnerability detection and remediation [10][11]. - Tianhao Wang from the University of Virginia works on differential privacy and machine learning privacy, designing practical algorithms [14]. AWS Agentic AI - Thirty researchers were awarded in the Agentic AI category, including several Chinese scholars [16]. - Cong Chen from Dartmouth College aims to drive global energy transition through engineering methods based on optimization, economics, and modern machine learning [19]. - Chunyang Chen from the Technical University of Munich focuses on the intersection of software engineering, human-computer interaction, and AI [21]. Trainium Development - Twenty awardees are involved in research related to Amazon's Trainium AI chips, with several being Chinese researchers [49]. - Kuan Fang from the University of Minnesota works on NetGenius for autonomous configuration and intelligent operation of next-generation wireless networks [50]. - Shizhong Han from the Lieber Institute focuses on revealing the genetic basis of brain diseases and translating genetic discoveries into new treatments [55]. Think Big Initiative - Three researchers were awarded under the Think Big initiative, which supports transformative ideas in scientific research, including one Chinese scholar [85]. - Tianlong Chen from the University of North Carolina at Chapel Hill utilizes molecular dynamics to empower protein AI models [88].