Workflow
推荐系统
icon
Search documents
突发!快手AI掌舵人周国睿即将离职,下一站爆出
Sou Hu Cai Jing· 2025-12-30 19:13
Core Insights - The news reports that Zhou Guorui, the head of Kuaishou's large model division, is set to leave the company, with his future plans currently unknown [2][4]. Company Overview - Zhou Guorui is a significant figure at Kuaishou, having held the position of Vice President and head of foundational large models and recommendation models [2][4]. - His LinkedIn profile indicates that he holds both bachelor's and master's degrees from Beijing University of Posts and Telecommunications, specializing in information and communication engineering [6]. Career Background - Prior to joining Kuaishou in 2021, Zhou worked at Alibaba's advertising division, focusing on deep learning applications in advertising ranking and model optimization [7][10]. - At Kuaishou, he advanced from the position of recommendation algorithm vice president to leading the large model and recommendation model teams [10]. Key Contributions - Zhou was instrumental in the development of the OneRec architecture, which significantly restructured the recommendation system, achieving larger models with lower costs [11][12]. - The OneRec system reportedly reduced operational costs to about one-tenth of previous levels while enhancing performance across various core business scenarios, including short video recommendations and e-commerce [12][14]. Future Implications - The immediate impact of Zhou's departure on Kuaishou's AI strategy is expected to be limited due to the established stability of the OneRec architecture and the company's commitment to self-developed recommendation models [18]. - However, the long-term effects may include challenges in technology iteration speed and potential instability in technical direction due to the loss of core talent [18].
NeurIPS 2025 | Language Ranker:从推荐系统的视角反思并优化大模型解码过程
机器之心· 2025-11-30 03:19
在大语言模型(LLM)的研究浪潮中,绝大多数工作都聚焦于优化模型的输出分布 —— 扩大模型规模、强化分布学习、优化奖励信号…… 然而, 如何将这些输 出分布真正转化为高质量的生成结果 —— 即解码(decoding)阶段,却没有得到足够的重视。 论文指出, LLM 可以被看作一种特殊的推荐系统,它把输入当作 "用户信息",在庞大的候选响应空间中为每位用户挑选最合适的响应。 如下图所示,大模型的关键组件与推荐系统可一一对应: 北京大学林宙辰、王奕森团队的论文《Language Ranker: A Lightweight Ranking Framework for LLM Decoding》提出了一种全新的视角: 将大模型的解码过程类比 为推荐系统中的排序阶段(Ranking Stage) 。这一视角揭示了现有解码方法的局限,并据此提出了高效、轻量的改进方案。 | Tianqi Du1* | Chenheng Zhang1* | Jizhe Zhang1 | Mingqing Xiao1,4 | Yifei Wang3 | | --- | --- | --- | --- | --- | | Yisen Wang1 ...
当推荐系统真正「懂你」:快手团队在NeurIPS 2025提出新成果TagCF
机器之心· 2025-11-27 04:09
论文标题: Who You Are Matters: Bridging Topics and Social Roles via LLM-Enhanced Logical Recommendation 背景和动机 用户理解:A Missing Formulation 论文: http://arxiv.org/abs/2505.10940 代码: https://github.com/Code2Q/TagCF 每天,推荐系统都在捕捉我们的兴趣与偏好。从刷过的视频到停留的直播间,算法总是聚焦在「内容」的理解上,推断用户喜欢哪类视频、哪种话题,擅长 在「内容层」识别用户喜欢什么,却很少真正理解「你是谁」。 快手消费策略算法团队注意到了这一问题,他们想让推荐系统不止「会猜」,而是「懂你」。为弥补这一缺失的建模角度, 快手消费策略算法团队 联合快 手基础大模型与应用部及武汉大学,提出了 TagCF 框架,让推荐系统从「知其然」迈向「知其所以然」。 该研究成果已被 NeurIPS 2025 接收,相关代码与实验框架已全面开源,旨在为学术界与工业界提供一套以「理解驱动」为核心的推荐系统方法论。 图 2 当推荐系统通过统计模型 ...
2018 - 2020,抖音超越快手的关键三年|42章经
42章经· 2025-11-16 12:59
Core Insights - The article discusses the rise of Douyin (TikTok) and its strategic decisions that led to its success, as shared by Yu Beichuan, a former employee who joined during its early days [2][3][11]. Group 1: Douyin's Growth Phases - Douyin was officially launched in 2016, with significant growth starting in mid-2017, leading to surpassing Kuaishou in daily active users (DAU) by early 2019 [3][11]. - The growth can be divided into several phases: initial growth from 2017 to 2018, rapid expansion from 2018 to 2019, and a focus on commercialization post-2020 [12][13][15]. - By the end of 2018, Douyin's DAU reached 30 million, and by early 2019, it had surpassed Kuaishou, becoming the leading short video platform [11][21]. Group 2: Key Strategic Decisions - Douyin's initial strategy involved not directing users from Toutiao, which allowed it to build a unique user base [46]. - The brand's youthful and independent aesthetic, along with strong content operations, attracted a younger audience [46][49]. - Significant marketing efforts included sponsoring the Spring Festival Gala in 2019, which resulted in a peak DAU of 470 million during the event [87][88]. Group 3: Challenges and Learnings - Despite rapid growth, there were internal concerns about the sustainability of user engagement and the potential DAU ceiling [21][22]. - Attempts to integrate social features were largely unsuccessful, highlighting the challenges of fostering user interaction in a primarily content-driven platform [24][27]. - The company learned that maintaining a balance between rapid growth and user retention was crucial, leading to a focus on enhancing user interaction [81][82]. Group 4: Organizational Culture and Impact - ByteDance's flat organizational structure allowed for direct communication across levels, fostering a culture of ambition and opportunity for young talent [100][106]. - The company's emphasis on extreme execution and strategic thinking contributed to its innovative approach and competitive edge in the market [114][121]. - As the company grew, maintaining its original culture became a challenge, leading to concerns about losing its competitive spirit [108][109].
小红书RecSys 2025最佳论文提名背后:破解视频时长预测难题
机器之心· 2025-10-20 04:50
Core Insights - The article highlights the impressive capabilities of Xiaohongshu's recommendation system, which has gained recognition at the RecSys 2025 conference for its innovative research and technology [4][6][7]. Group 1: Xiaohongshu's Recognition - Xiaohongshu's recommendation algorithm team received a "Best Paper Candidate" nomination at the prestigious RecSys 2025 conference for their paper on video watch time prediction [4][6]. - The conference is recognized as a leading academic event in the field of recommendation systems, attracting top scholars and industry experts from around the world [6][7]. - Xiaohongshu's technology and product have become focal points at the conference, with many attendees praising its recommendation capabilities as industry-leading [9][10]. Group 2: Research and Methodology - The paper titled "Multi-Granularity Distribution Modeling for Video Watch Time Prediction via Exponential-Gaussian Mixture Network" addresses the critical issue of predicting user watch time, which is essential for enhancing user engagement [17][22]. - The research identifies complex user behavior patterns in watch time, highlighting the challenges of skewed distributions and diverse viewing habits [30][31]. - The proposed Exponential-Gaussian Mixture Network (EGMN) model combines classic probability distributions to predict the complete probability distribution of watch time rather than a single value [33][35]. Group 3: Performance and Validation - EGMN demonstrated superior performance in offline experiments, achieving a 14.11% reduction in Mean Absolute Error (MAE) and a 7.76% increase in ranking consistency [39]. - Online A/B testing covering 15 million users over seven days showed significant improvements in key metrics, with a 19.94% decrease in KL divergence, indicating strong distribution fitting capabilities [40][41]. - Ablation studies confirmed the effectiveness of EGMN's components, validating the contributions of both the exponential and Gaussian components to the model's performance [42]. Group 4: Future Directions - The article emphasizes Xiaohongshu's commitment to a pragmatic approach in technology development, focusing on real user problems and continuous exploration of cutting-edge recommendation algorithms [46][47]. - The success at RecSys 2025 is seen as a starting point for further advancements in the recommendation system field, with the team actively seeking talent to enhance their research efforts [47].
ICML spotlight | 一种会「进化」的合成数据!无需上传隐私,也能生成高质量垂域数据
机器之心· 2025-07-11 09:22
Core Viewpoint - The article discusses the challenges of data scarcity in the context of large models and introduces the PCEvolve framework, which aims to generate synthetic datasets while preserving privacy and addressing the specific needs of vertical domains such as healthcare and industrial manufacturing [1][2][10]. Group 1: Data Scarcity and Challenges - The rapid development of large models has exacerbated the issue of data scarcity, with predictions indicating that public data generation will not keep pace with the consumption rate required for training these models by 2028 [1]. - In specialized fields like healthcare and industrial manufacturing, the availability of data is already limited, making the data scarcity problem even more severe [1]. Group 2: PCEvolve Framework - PCEvolve is a synthetic data evolution framework that requires only a small number of labeled samples to generate an entire dataset while protecting privacy [2]. - The evolution process of PCEvolve is likened to DeepMind's FunSearch and AlphaEvolve, focusing on generating high-quality training data from existing large model APIs [2]. Group 3: Limitations of Existing Large Models - Existing large model APIs cannot directly synthesize domain-specific data, as they fail to account for various characteristics unique to vertical domains, such as lighting conditions, sampling device models, and privacy information [4][7]. - The inability to upload local data due to privacy and intellectual property concerns complicates the prompt engineering process and reduces the quality of synthetic data [9][11]. Group 4: PCEvolve's Mechanism - PCEvolve employs a new privacy protection method based on the Exponential Mechanism, which is designed to adapt to the limited sample situation in vertical domains [11]. - The framework includes an iterative evolution process where a large number of candidate synthetic data are generated, followed by a selection process that eliminates lower-quality data based on privacy-protected scoring [11][19]. Group 5: Experimental Results - PCEvolve's effectiveness was evaluated through two main approaches: the impact of synthetic data on downstream model training and the quality of the synthetic data itself [21]. - In experiments involving datasets such as COVIDx and Came17, PCEvolve demonstrated significant improvements in model accuracy, with the final accuracy for COVIDx reaching 64.04% and for Came17 reaching 69.10% [22][23].
特想聊聊快手这次的变化
Hu Xiu· 2025-06-25 00:48
Core Viewpoint - Kuaishou has fully launched its AI model-driven recommendation system, OneRec, which is the first industrial-grade recommendation solution in the industry, setting a new standard globally [1][15]. Group 1: Technological Advancements - Kuaishou's technology has reached a top-tier level, particularly in video generation models [2]. - The company has made significant underlying technological advancements that surpass initial perceptions of it being merely a short video platform [3]. Group 2: Recommendation System Overview - Recommendation systems are a major technological innovation of the mobile internet era, utilized by popular platforms like Kuaishou, Douyin, and Pinduoduo [4]. - Traditional recommendation systems typically rely on user-based collaborative filtering and content-based collaborative filtering [4][6]. Group 3: Challenges in Traditional Systems - Traditional multi-stage recommendation systems face issues such as low overall GPU utilization and inefficiencies due to independent model operations [10][11]. - The complexity of user interests and the conflicting goals of increasing click-through rates while maintaining content diversity lead to decreased recommendation accuracy [9][10]. Group 4: OneRec's Innovations - OneRec shifts from a multi-stage filtering approach to an end-to-end model that directly generates a list of recommended videos based on user interests [16]. - The system employs a multi-modal semantic tokenizer to deeply understand video content beyond surface-level tags, enhancing content comprehension [21][24]. Group 5: User Modeling and Interest Tracking - OneRec integrates user behavior over time to create a comprehensive "interest sequence," allowing for more accurate recommendations that adapt to changing user preferences [28][30]. - The model uses deep neural networks to automatically learn complex interest changes from large datasets, improving recommendation accuracy [30]. Group 6: Recommendation Generation - The system utilizes an encoder-decoder structure, where the encoder compresses user interest trajectories into vectors, and the decoder generates a sequence of recommended content [32][33]. - The introduction of a Mixture of Experts (MoE) architecture enhances model capacity and efficiency, allowing for personalized recommendations while maintaining content diversity [34][36]. Group 7: Reinforcement Learning Integration - OneRec incorporates a reward mechanism using reinforcement learning to align user preferences with recommendation outcomes, enhancing the overall effectiveness of the system [38][44]. - The model's training includes various reward signals to ensure a balanced distribution of content types and to adapt to real-world business complexities [41][42]. Group 8: Performance Metrics - During the testing phase, OneRec demonstrated performance metrics comparable to existing complex systems, with user engagement metrics such as watch time and user lifecycle showing positive growth [46][47]. - In local life scenarios, OneRec achieved a 21% increase in GMV and significant growth in order volume and new customer acquisition [48]. Group 9: Future Considerations - Despite its advancements, OneRec still faces challenges related to inference speed, resource consumption, and further optimization of the reward mechanism [49]. - The introduction of OneRec marks a new phase in recommendation systems, aligning them with the latest advancements in AI and machine learning [49][50].
打破推荐系统「信息孤岛」!中科大与华为提出首个生成式多阶段统一框架,性能全面超越 SOTA
机器之心· 2025-06-20 10:37
Core Viewpoint - The article discusses the innovative UniGRF framework, which unifies retrieval and ranking tasks in recommendation systems using a single generative model, addressing inherent issues in traditional multi-stage recommendation paradigms [1][3][16]. Group 1: Pain Points of Traditional Recommendation Paradigms - Traditional recommendation systems typically employ a multi-stage approach, where a recall phase quickly filters a large item pool, followed by a ranking phase that scores and orders the candidates. This method, while efficient, often leads to information loss and performance bottlenecks due to the independent training of each phase [3][4]. - The separation of tasks can result in the premature filtering of potential interests outside the user's information bubble, causing cumulative biases and difficulties in inter-stage collaboration [3][4]. Group 2: Advantages of UniGRF - UniGRF integrates retrieval and ranking into a single generative model, allowing for full information sharing and reducing information loss between tasks [7]. - The framework is model-agnostic and can seamlessly integrate with various mainstream autoregressive generative model architectures, enhancing its flexibility [8]. - By maintaining a single model instead of two independent ones, UniGRF potentially improves efficiency in both training and inference processes [9]. Group 3: Key Mechanisms of UniGRF - The framework includes a Ranking-Driven Enhancer, which promotes effective collaboration between the recall and ranking phases by leveraging the high precision of the ranking outputs to guide the recall process [10][11]. - It also features a Gradient-Guided Adaptive Weighter that dynamically adjusts the weights of the loss functions for the two tasks based on their learning rates, ensuring synchronized optimization and overall performance enhancement [12]. Group 4: Experimental Results - Extensive experiments on three large public recommendation datasets (MovieLens-1M, MovieLens-20M, Amazon-Books) demonstrated that UniGRF significantly outperforms state-of-the-art (SOTA) models, highlighting the advantages of its unified framework [14][18]. - The framework shows particularly notable improvements in ranking performance, which is crucial as it directly impacts the quality of recommendations presented to users [18]. - Initial tests indicate that UniGRF adheres to the scaling law, suggesting potential performance gains with increased model parameters [18]. Group 5: Future Directions - The introduction of UniGRF offers a novel and efficient solution for generative recommendation systems, overcoming traditional multi-stage paradigm issues. Future research aims to expand the framework to include more recommendation stages and validate its large-scale applicability in real-world industrial scenarios [16][17].
推荐大模型来了?OneRec论文解读:端到端训练如何同时吃掉效果与成本
机器之心· 2025-06-19 09:30
Core Viewpoint - The article discusses the transformation of recommendation systems through the integration of large language models (LLMs), highlighting the introduction of the "OneRec" system by Kuaishou, which aims to enhance efficiency and effectiveness in recommendation processes [2][35]. Group 1: Challenges in Traditional Recommendation Systems - Traditional recommendation systems face significant challenges, including low computational efficiency, conflicting optimization objectives, and an inability to leverage the latest AI advancements [5]. - For instance, Kuaishou's SIM model shows a Model FLOPs Utilization (MFU) of only 4.6%/11.2%, which is significantly lower than LLMs that achieve 40%-50% [5][28]. Group 2: Introduction of OneRec - OneRec is an end-to-end generative recommendation system that utilizes an Encoder-Decoder architecture to model user behavior and enhance recommendation accuracy [6][11]. - The system has demonstrated a tenfold increase in effective computational capacity and improved MFU to 23.7%/28.8%, significantly reducing operational costs to just 10.6% of traditional methods [8][31]. Group 3: Performance Improvements - OneRec has shown substantial performance improvements in user engagement metrics, achieving a 0.54%/1.24% increase in app usage duration and a 0.05%/0.08% growth in the 7-day user lifecycle (LT7) [33]. - In local life service scenarios, OneRec has driven a 21.01% increase in GMV and an 18.58% rise in the number of purchasing users [34]. Group 4: Technical Innovations - The system employs a multi-modal fusion approach, integrating various data types such as video titles, tags, and user behavior to enhance recommendation quality [14]. - OneRec's architecture allows for significant computational optimizations, including a 92% reduction in the number of key operators, which enhances overall efficiency [27][28]. Group 5: Future Directions - Kuaishou's technical team identifies areas for further improvement, including enhancing inference capabilities, developing a more integrated multi-modal architecture, and refining the reward system to better align with user preferences [38].
特征工程、模型结构、AIGC——大模型在推荐系统中的3大落地方向|文末赠书
AI前线· 2025-05-10 05:48
Core Viewpoint - The article discusses the significant impact of large models on recommendation systems, emphasizing that these models have already generated tangible benefits in the industry rather than focusing on future possibilities or academic discussions [1]. Group 1: Impact of Large Models on Recommendation Systems - Large models have transformed the way knowledge is learned, shifting from a closed system reliant on internal data to an open system that integrates vast external knowledge [4]. - The structure of large models, typically based on transformer architecture, differs fundamentally from traditional recommendation models, which raises questions about whether they can redefine the recommendation paradigm [5]. - Large models have the potential to create a "new world" by enabling personalized content generation, moving beyond mere recommendations to directly creating tailored content for users [6]. Group 2: Knowledge Input Comparison - A comparison highlights that large models draw knowledge from an open world, while traditional systems rely on internal user behavior data, creating a complementary relationship [7]. - Large models possess advantages in knowledge quantity and embedding quality over traditional knowledge graph methods, suggesting they are the optimal solution for knowledge input in recommendation systems [8]. Group 3: Implementation Strategies - Two primary methods for integrating large model knowledge into recommendation systems are identified: generating embeddings from large language models (LLMs) and producing text tokens for input [10][11]. - The integration of multi-modal features through large models allows for a more comprehensive representation of item content, enhancing recommendation capabilities [13][15]. Group 4: Evolution of Recommendation Models - The exploration of large models in recommendation systems has progressed through three stages, from initial toy models to more industrialized solutions that significantly improve business metrics [20][24]. - Meta's generative recommendation model (GR) exemplifies a successful application of large models, achieving a 12.4% increase in core business metrics by shifting the focus from click-through rate prediction to predicting user behavior [24][26]. Group 5: Content Generation and Future Directions - The article posits that the most profound impact of large models on recommendation systems lies in the personalized generation of content, integrating AI creators into the recommendation process [28][29]. - Current AI-generated content still requires human input, but the potential for fully autonomous content generation based on user feedback is highlighted as a future direction [41][43]. Group 6: Industry Insights and Recommendations - The search and recommendation industry is viewed as continuously evolving, with the integration of large models presenting new growth opportunities rather than a downturn [45]. - The article suggests that the key to success in the next phase of recommendation systems lies in the joint innovation and optimization of algorithms, engineering, and large models [46].