Workflow
社交大模型
icon
Search documents
小红书提出社交大模型RedOne 2.0:兼听、敏行
量子位· 2025-11-18 00:59
Core Insights - The article discusses the launch of RedOne 2.0, a large model designed for social networking services (SNS), which utilizes reinforcement learning (RL) and lightweight supervised fine-tuning (SFT) to enhance user intent understanding and adaptability to diverse languages and cultures [1][6][35]. Group 1: Model Performance and Training Framework - RedOne 2.0 outperforms its predecessor in the SNS-Bench, demonstrating higher knowledge density and requiring less training data while achieving superior overall performance [2][20]. - The training framework of RedOne 2.0 is based on a three-stage progressive training method: exploration, targeted fine-tuning, and continuous optimization, which addresses the limitations of traditional SFT methods [8][23]. - The model shows significant improvements in various benchmarks, including General-Bench, SNS-Bench, and SNS-TransBench, indicating its strong generalization and domain-specific capabilities [18][20][21]. Group 2: Addressing Traditional Model Limitations - Traditional SFT methods often lead to performance imbalances, where improvements in one area can degrade performance in others, a challenge that RedOne 2.0 aims to overcome [5][8]. - The model's RL-driven approach allows for rapid adaptation to new trends and policies in the SNS environment, addressing the issue of slow model updates associated with traditional methods [5][6]. - RedOne 2.0's training strategy significantly reduces the need for large-scale labeled data, making it more efficient for deployment in various scenarios [7][8]. Group 3: User Experience and Business Value - The implementation of RedOne 2.0 has led to a 0.43% increase in core business metrics, indicating a measurable enhancement in user engagement and community activity [27][28]. - The model has improved content quality, with a reduction in vague titles by 11.9% and increases in practical, authentic, and interactive titles by 7.1%, 12.9%, and 25.8% respectively [27][28]. - Case studies demonstrate that RedOne 2.0 generates more engaging and interactive content compared to baseline models, effectively aligning with user preferences [31][34]. Group 4: Future Prospects - The team plans to expand RedOne 2.0's capabilities in multi-modal and multi-language contexts, exploring applications in complex scenarios such as cross-cultural communication [35][36]. - There is an intention to apply the RL-based training framework to other verticals like finance, healthcare, and education, addressing the balance between domain adaptation and general capabilities [35][36].
小红书宣布推出首个社交大模型“RedOne”
Zhi Tong Cai Jing· 2025-08-01 01:56
Core Insights - The company Xiaohongshu has launched its first social large model "RedOne," which is a customized LLM aimed at enhancing performance in the SNS (Social Network Service) sector [1] - RedOne shows an average improvement of 14.02% across eight major SNS tasks compared to baseline models, and a 7.56% enhancement in bilingual evaluation benchmarks [1] - In online testing, RedOne reduces harmful content detection exposure by 11.23% and increases post-view search click-through rates by 14.95% compared to single-task fine-tuned baseline models [1]
小红书提出首个社交大模型:八大社交任务平均提升14.02%
量子位· 2025-08-01 00:46
Core Viewpoint - Xiaohongshu has launched RedOne, a customized large language model (LLM) designed for the social networking service (SNS) sector, aiming to enhance performance across various SNS tasks and improve user interaction and content governance [1][5]. Summary by Sections Introduction of RedOne - RedOne is the first social model that integrates social understanding with platform rules, achieving an average performance improvement of 14.02% across eight major SNS tasks compared to baseline models [1][5]. - In online tests, RedOne reduced harmful content exposure by 11.23% and increased post-view search click-through rates by 14.95% [1][32]. Challenges in SNS Data - SNS data is characterized by high non-standardization, strong contextual dependence, and significant emotional variability, posing challenges for traditional NLP platforms [3][4]. - Existing models often focus on single-task performance, leading to diminishing returns and inadequate adaptability in multi-language and multi-cultural contexts [4]. Training Framework of RedOne - RedOne employs a three-stage training strategy: Continue Pretraining (CPT), Supervised Fine-Tuning (SFT), and Preference Optimization (PO) [5][19]. - The model is trained on a large-scale dataset exceeding 100 billion tokens, combining general high-quality data and SNS-specific data [9][12]. Data Collection and Quality Control - Data is collected from high-quality open-source corpora and SNS platforms, ensuring a diverse representation of social communication styles [8][10]. - A rigorous data filtering process is implemented to maintain high quality, resulting in a final dataset of 20 billion tokens for training [11][12]. Supervised Fine-Tuning (SFT) - SFT focuses on aligning the model's capabilities with real-world application needs, incorporating various tasks such as content understanding and user behavior modeling [15][16]. - The training process emphasizes retaining typical SNS language styles to ensure the model's relevance in real-world scenarios [16][17]. Preference Optimization (PO) - PO enhances the model's alignment with human preferences and platform rules by utilizing a preference dataset constructed through expert annotations and structural information from data labels [20][21]. - Direct Preference Optimization (DPO) is employed to refine the model's outputs, improving its adaptability to SNS environments [22]. Performance Comparison - RedOne outperforms baseline models in various tasks, demonstrating significant improvements in both general and SNS-specific benchmarks [23][26]. - The model's performance continues to improve with increased scale, indicating its potential for further advancements [26][28]. Online Experiment Results - In real-world applications, RedOne significantly reduced harmful content exposure and improved user engagement metrics, showcasing its practical value [32]. Generalization and Robustness - Experiments indicate that incorporating general domain data enhances the model's generalization capabilities, particularly in out-of-distribution tasks [35]. Future Outlook - RedOne represents a significant advancement in addressing the challenges of content governance and interaction quality in SNS, providing a reference for future specialized LLM applications [37].