自然语言处理
Search documents
报名开启!别再一个人刷论文了,来ACL 2025论文分享会一起面对面交流
机器之心· 2025-06-24 01:46
2025 年已经过半,AI 领域依旧保持着高速发展的势头。从大模型的演化,到多模态系统的融合,再到推理能力与可解 释性的持续突破,AI 正以前所未有的节奏快速前进。 然而,AI 的发展速度之快,也让人几乎难以跟上节奏。新模型、新框架层出不穷,几乎每隔数周就有突破性进展刷新 人们的认知。 在这样的背景下,如何掌握最前沿的技术动态,已成为每一位 AI 从业者面临的共同挑战。仅靠零散的信息获取已远远 不够,系统地参与权威学术交流、深入学习最新研究成果、与顶尖研究者保持对话,正变得愈发重要。 学术会议,尤其是 ACL、NeurIPS、ICML、CVPR 等全球顶级会议,正是这些技术交汇的核心场域。无论是深入研讨 的论文,还是引发热议的前沿报告,都为我们提供了观察 AI 发展脉络的绝佳窗口。 作为 NLP 领域最具影响力的会议之一,ACL 每年都吸引了广大学者参与。今年 ACL 总投稿数高达 8000 多篇,创历 史之最。今年 ACL 2025 将于 7 月 27 日 - 8 月 1 日在奥地利维也纳开幕。 时间:北京时间 7 月 19 日 09:00-17:30 更多详细日程,敬请关注机器之心后续公告。 合作伙伴介绍 ...
研判2025!中国自然语言处理行业产业链、相关政策及市场规模分析:技术突破推动行业增长,低成本算力与小样本学习加速技术落地[图]
Chan Ye Xin Xi Wang· 2025-06-08 02:10
Core Insights - The natural language processing (NLP) industry in China is projected to reach a market size of approximately 12.6 billion yuan in 2024, reflecting a year-on-year growth of 14.55% [1][15] - The cost of model training has significantly decreased due to the "East Data West Computing" initiative, which provides low-cost computing power, and the adoption of few-shot learning frameworks has reduced the demand for training data by 90% [1][15] - Major companies in the NLP sector include Baidu, iFlytek, and Alibaba, each leveraging their technological strengths to capture market share in various applications [2][17][21] Industry Overview - NLP is a crucial branch of computer science and artificial intelligence, aimed at enabling computers to understand, interpret, and generate human language [1][8] - The technology types in NLP are primarily categorized into rule-based methods, statistical methods, and deep learning methods [1][8] Industry Development History - The development of NLP in China has gone through four main stages: the initial phase (1950s-60s) focused on machine translation, the rule-dominated phase (1970s-80s) involved complex rule systems, the statistical learning phase (1990s-2012) integrated statistical models with machine learning, and the deep learning phase (2013-present) is characterized by the dominance of deep learning models and pre-trained language models [4][5][6] Industry Value Chain - The upstream of the NLP industry chain includes hardware devices, data services, open-source models, and cloud services, while the midstream focuses on NLP technology research and development, and the downstream encompasses applications in finance, healthcare, education, and smart manufacturing [1][8] Market Size - The NLP industry in China is experiencing significant growth, with a projected market size of 12.6 billion yuan in 2024, driven by advancements in pre-trained language models and reduced training costs [1][15] Key Companies' Performance - Baidu leads the NLP industry with a strong technological foundation and extensive commercialization, maintaining the largest market share [17][21] - iFlytek excels in voice recognition and machine translation, particularly in the education and healthcare sectors [17][20] - Alibaba has made breakthroughs in machine reading comprehension and natural language understanding, integrating its technology into various business scenarios [17][20] Industry Development Trends - The NLP industry is witnessing a trend towards the integration of large models and multimodal capabilities, enhancing performance and user interaction [24] - There is a growing focus on vertical applications in sectors like healthcare and finance, as well as the integration of NLP with smart hardware [26] - Data security and ethical standards are becoming increasingly important, driving sustainable development in the NLP sector [27]
Gemini2.5弯道超车背后的灵魂人物
Hu Xiu· 2025-06-05 03:14
《硅谷101》创始人泓君邀请了Energent.ai联合创始人Kimi Kong和HeyRevia创始人Shaun Wei,一起和两 位前Google的技术专家聊聊Gemini模型登顶背后的底层逻辑。 以下是这次对话内容的精选: 一、Gemini2.5崛起背后的底层逻辑 泓君:谷歌此次发布的Gemini 2.5 Pro,在当前各项评测中的数据都是所有大模型中最好的,Kimi你可 以分析一下它是如何做到的吗? 从去年在大会前夜被OpenAI的4o模型"精准狙击",到今年Gemini 2.5 Pro全面霸榜。短短一年时间, Gemini是如何完成从追赶者到领跑者的逆转? Kimi:我已经离开DeepMind快一年时间了,也不太清楚我的前同事们在这一年中又做了哪些新的创 新。但大语言模型训练根本的步骤是不变的,包括以下三点:Pre-training(预训练)、SFT(Supervised Fine-tuning,监督微调)和利用RLHF(基于人类反馈的强化学习)技术做的Alignment(对齐)。 大概在去年的NeurIPS(神经信息处理系统大会)上,业内已经普遍承认,公开网络数据基本都已经抓 完了,就像化石燃料已 ...
消失的人工客服,“智障”的AI客服
3 6 Ke· 2025-06-04 10:33
Core Insights - The increasing reliance on AI customer service in various industries has led to significant consumer dissatisfaction, particularly during high-demand periods like the e-commerce 618 shopping festival [1][2] - Complaints related to AI customer service have surged, with a reported 56.3% year-on-year increase in complaints related to "intelligent customer service" in the e-commerce after-sales service sector [2] - The effectiveness of AI customer service is being questioned, as many consumers prefer human customer service, which they find more effective despite longer wait times [2][3] Group 1: Consumer Experience - Many consumers report that AI customer service struggles to understand human language, leading to ineffective communication and frustration [1][2] - A significant portion of users (30.98%) feel that AI customer service does not adequately cater to vulnerable groups such as the elderly and disabled [2] - A test of 30 commonly used apps revealed that 40% failed to connect users to human customer service, with many requiring long wait times to reach a representative [3] Group 2: AI Customer Service Limitations - The primary shortcomings of AI customer service include an inability to resolve personalized issues, mechanical responses, and poor comprehension of inquiries [2] - The current implementation of AI customer service often prioritizes efficiency and cost-cutting over quality service, leading to a decline in overall consumer experience [6] - There is a need for businesses to enhance AI technology through better algorithms and natural language processing to improve service quality [7] Group 3: Recommendations for Improvement - Companies should not view AI and human customer service as mutually exclusive; instead, they should integrate both to enhance customer experience [6][7] - Providing a straightforward option for customers to reach human representatives is essential, especially for complex issues that AI cannot handle effectively [6] - Businesses should focus on optimizing AI for specific service scenarios to ensure it meets customer needs without compromising service quality [7]
微信ai客服怎么处理咨询?哪里查看记录?
Sou Hu Cai Jing· 2025-06-04 09:36
Group 1 - The core viewpoint of the article emphasizes the importance of WeChat AI customer service as a vital tool for communication between businesses and customers, enhancing customer satisfaction through quick responses and efficient problem handling [1][4]. Group 2 - The process of handling inquiries by WeChat AI customer service is highly automated, utilizing natural language processing to understand customer queries, search for relevant answers in its knowledge base, and escalate complex issues to human agents when necessary [4]. Group 3 - Viewing consultation records is crucial for assessing service quality and efficiency, with records accessible through the ChatWave backend management system, allowing businesses to analyze customer interactions and identify areas for improvement [5]. Group 4 - Strategies for optimizing inquiry handling include regularly updating the knowledge base, utilizing data analysis tools to understand common customer issues, incorporating user feedback for improving dialogue processes, and ensuring seamless transitions between AI and human customer service [6]. Group 5 - ChatWave offers significant advantages in inquiry handling, including strong natural language processing capabilities, support for multi-turn conversations, automation to enhance efficiency, and valuable insights from customer consultation data for product and service optimization [7][9].
腾讯申请一种文本处理模型训练等专利,提升模型改写能力
Jin Rong Jie· 2025-05-28 04:44
Group 1 - Tencent Technology (Shenzhen) Co., Ltd. has applied for a patent related to natural language processing technology, specifically for a text processing model training method and device [1] - The patent application, published as CN120045650A, was filed on November 2023 and aims to enhance the efficiency and quality of training datasets for text processing models [1] - The proposed method involves using multiple sample conversation data and preset rewriting instructions to generate annotated rewriting correlation data, which is then used to create a rewriting training set [1] Group 2 - Tencent Technology (Shenzhen) Co., Ltd. was established in 2000 and is primarily engaged in software and information technology services [2] - The company has a registered capital of 2 million USD and has made investments in 15 enterprises, participated in 254 bidding projects, and holds 5000 trademark and patent records [2] - Additionally, the company possesses 439 administrative licenses, indicating a robust operational framework [2]
以科技赋能传统文化,豆神动漫开拓传统文化交互体验新范式
Qi Lu Wan Bao Wang· 2025-05-23 16:19
Core Viewpoint - The development of the "Confucius Digital Human" 2.0 version by Dou Shen Animation represents a significant advancement in digital cultural products, utilizing cutting-edge technologies such as artificial intelligence, 3D modeling, and natural language processing to create an interactive and conversational digital representation of Confucius [1][3]. Group 1: Technology and Innovation - The "Confucius Digital Human" is not merely a virtual image but a highly intelligent interactive digital cultural carrier, capable of deep interaction and realistic expressions [3]. - The development team employed high-precision 3D modeling technology to accurately recreate Confucius's historical features, enabling the digital figure to speak, nod, blink, and express various emotions [3]. Group 2: Applications and Impact - The digital product can be widely applied in education, cultural exhibitions, academic research, tourism, and museums, serving as a powerful tool for the integration of digital economy and tourism industries [5]. - The company aims to leverage digital technology to transcend temporal boundaries, making Confucius accessible as a cultural mentor, and plans to continuously upgrade the technology for more refined and professional services [5].
人工智能专题:2025年中国人工智能与商业智能发展白皮书
Sou Hu Cai Jing· 2025-05-22 00:55
Core Insights - The report highlights the limitations of traditional Business Intelligence (BI) systems, which struggle to meet the demands for real-time and dynamic decision-making due to their closed architectures and static processing capabilities [1][21][24] - The integration of Artificial Intelligence (AI) with BI, termed Artificial Intelligence and Business Intelligence (ABI), is driving a shift from reactive to proactive decision-making, with ABI expected to experience explosive growth in China, reaching a market size of 800 million yuan in 2024 and a CAGR of 42% from 2024 to 2028 [1][11][13] - Key drivers for ABI growth include deepening enterprise reliance on data, breakthroughs in AI technology, and supportive policies [1][11] Industry Overview - ABI leverages technologies such as Natural Language Processing (NLP) and machine learning to enable conversational interactions, multimodal data analysis, and complex reasoning, enhancing decision-making across various sectors including finance, retail, manufacturing, government, and energy [2][3] - The financial sector utilizes ABI for intelligent risk control and quantitative trading, while retail benefits from dynamic pricing and inventory optimization [2][3] - Manufacturing employs predictive maintenance and process optimization to reduce downtime, and government sectors enhance service efficiency through smart traffic and urban governance [2][3] Market Dynamics - The ABI market in China is projected to grow from 300 million yuan in 2023 to 800 million yuan in 2024, driven by the increasing complexity of decision-making needs and the inadequacies of traditional BI tools [1][11][13] - ABI's core challenges include data governance lag, algorithm opacity, fragmented scenarios, and high technical costs, with future trends focusing on edge computing, real-time analysis, generative AI penetration, and privacy computing technologies [3][11] Technological Advancements - ABI employs advanced techniques such as Text2SQL and Text2DSL to convert natural language into data queries, enhancing the depth of analysis through external knowledge integration and multi-agent collaboration [2][3][30] - The integration of AI allows for the automation of data processing, significantly improving efficiency and enabling strategic decision-making by providing deeper insights and optimizing resource allocation [40][42] Future Outlook - The ABI landscape is evolving towards democratization and intelligence, reshaping the decision-making paradigm driven by data within enterprises [3][11] - Major global players like Microsoft and Salesforce focus on ecosystem integration, while domestic firms like Alibaba Cloud and Fanruan emphasize lightweight deployment and localized innovation [3][11]
一个「always」站在大模型技术C位的传奇男子
量子位· 2025-05-10 02:39
Core Viewpoint - The article highlights the significant contributions of Noam Shazeer in the AI field, particularly in the development of large language models (LLMs) and the Transformer architecture, emphasizing his role as a key figure in the evolution of AI technologies [9][10][12]. Group 1: Contributions to AI Technology - Shazeer is recognized as one of the most influential authors of the Transformer model, credited with pivotal advancements such as the introduction of the Mixture of Experts (MoE) architecture [10][18][24]. - His work on the paper "Attention Is All You Need" in 2017 is considered a foundational moment for LLMs, leading to widespread adoption and further innovations in the field [18][23]. - Shazeer has consistently anticipated technological trends, contributing to various breakthroughs, including the GShard framework for scaling models and the Switch Transformers, which achieved a parameter count of 1.6 trillion [30][33][41]. Group 2: Career and Achievements - Shazeer has a remarkable academic and professional background, having achieved a perfect score at the International Mathematical Olympiad in 1994 and later studying at Duke University [50][52]. - He joined Google as employee number 200 and made significant contributions to various projects, including Google's search spelling correction and the development of machine learning systems for ad ranking and spam detection [55][56]. - After a brief period away from Google, he co-founded Character.AI, which gained a valuation of $1 billion before being acquired by Google for $2.7 billion, leading to his return to the company [67][69]. Group 3: Impact on the Industry - Shazeer's innovations have laid the groundwork for current AI models, with many contemporary systems, including GPT-4 and others, building upon his research [41][44]. - His development of the Adafactor optimizer and Multi Query Attention (MQA) has been crucial for enhancing the efficiency of large models [43][44]. - The article concludes that Shazeer's foresight and contributions have positioned him as a defining figure in the current era of AI, with his work continuing to influence the direction of the industry [11][12][40].
海能投顾大数据中心打造精准投资决策支持系统
Sou Hu Cai Jing· 2025-05-08 11:57
Group 1 - The core infrastructure driving investment research upgrades is the financial big data center built by Haineng Investment Advisory, which has invested over 200 million yuan in a distributed computing cluster capable of processing 10PB of financial data daily, providing strong data support for investment decisions [1] - The "Data Cube" system integrates traditional financial data, alternative data, and satellite remote sensing information, with a proprietary commercial vitality index that analyzes mobile signaling data from 3,800 business districts to predict consumption trends 2-3 quarters in advance, achieving an excess return of 15.2% in the 2023 consumer sector layout [1] - The natural language processing engine can analyze financial news in 76 languages in real-time, with an accuracy rate of 92.4% for sentiment analysis, and it can structure 300 pages of documents in 30 seconds, improving efficiency by 400 times compared to manual analysis [1] - The "Factor Factory" platform has accumulated over 1,200 effective alpha factors, achieving an annualized stable return of 21.3% in the A-share market through a multi-factor model optimized by genetic algorithms, notably capturing three major turning points in the new energy sector through the unique "industry chain transmission factor" [1] Group 2 - The data middle platform of Haineng Investment Advisory adopts a microservices architecture, supporting agile development for business departments, allowing investment managers to build analysis models independently with visual tools, reducing strategy backtesting time from 3 days to 2 hours [2] - In 2023, the platform produced 187 effective investment strategies, with 63 strategies already implemented in practice and achieving excellent performance [2] - Future testing of quantum computing applications in portfolio optimization is expected to reduce the solving time for large-scale asset allocation problems from several hours to minutes, marking a revolutionary improvement in investment decision efficiency [2]