Workflow
自然语言处理
icon
Search documents
报名开启!别再一个人刷论文了,来ACL 2025论文分享会一起面对面交流
机器之心· 2025-06-24 01:46
2025 年已经过半,AI 领域依旧保持着高速发展的势头。从大模型的演化,到多模态系统的融合,再到推理能力与可解 释性的持续突破,AI 正以前所未有的节奏快速前进。 然而,AI 的发展速度之快,也让人几乎难以跟上节奏。新模型、新框架层出不穷,几乎每隔数周就有突破性进展刷新 人们的认知。 在这样的背景下,如何掌握最前沿的技术动态,已成为每一位 AI 从业者面临的共同挑战。仅靠零散的信息获取已远远 不够,系统地参与权威学术交流、深入学习最新研究成果、与顶尖研究者保持对话,正变得愈发重要。 学术会议,尤其是 ACL、NeurIPS、ICML、CVPR 等全球顶级会议,正是这些技术交汇的核心场域。无论是深入研讨 的论文,还是引发热议的前沿报告,都为我们提供了观察 AI 发展脉络的绝佳窗口。 作为 NLP 领域最具影响力的会议之一,ACL 每年都吸引了广大学者参与。今年 ACL 总投稿数高达 8000 多篇,创历 史之最。今年 ACL 2025 将于 7 月 27 日 - 8 月 1 日在奥地利维也纳开幕。 时间:北京时间 7 月 19 日 09:00-17:30 更多详细日程,敬请关注机器之心后续公告。 合作伙伴介绍 ...
研判2025!中国自然语言处理行业产业链、相关政策及市场规模分析:技术突破推动行业增长,低成本算力与小样本学习加速技术落地[图]
Chan Ye Xin Xi Wang· 2025-06-08 02:10
Core Insights - The natural language processing (NLP) industry in China is projected to reach a market size of approximately 12.6 billion yuan in 2024, reflecting a year-on-year growth of 14.55% [1][15] - The cost of model training has significantly decreased due to the "East Data West Computing" initiative, which provides low-cost computing power, and the adoption of few-shot learning frameworks has reduced the demand for training data by 90% [1][15] - Major companies in the NLP sector include Baidu, iFlytek, and Alibaba, each leveraging their technological strengths to capture market share in various applications [2][17][21] Industry Overview - NLP is a crucial branch of computer science and artificial intelligence, aimed at enabling computers to understand, interpret, and generate human language [1][8] - The technology types in NLP are primarily categorized into rule-based methods, statistical methods, and deep learning methods [1][8] Industry Development History - The development of NLP in China has gone through four main stages: the initial phase (1950s-60s) focused on machine translation, the rule-dominated phase (1970s-80s) involved complex rule systems, the statistical learning phase (1990s-2012) integrated statistical models with machine learning, and the deep learning phase (2013-present) is characterized by the dominance of deep learning models and pre-trained language models [4][5][6] Industry Value Chain - The upstream of the NLP industry chain includes hardware devices, data services, open-source models, and cloud services, while the midstream focuses on NLP technology research and development, and the downstream encompasses applications in finance, healthcare, education, and smart manufacturing [1][8] Market Size - The NLP industry in China is experiencing significant growth, with a projected market size of 12.6 billion yuan in 2024, driven by advancements in pre-trained language models and reduced training costs [1][15] Key Companies' Performance - Baidu leads the NLP industry with a strong technological foundation and extensive commercialization, maintaining the largest market share [17][21] - iFlytek excels in voice recognition and machine translation, particularly in the education and healthcare sectors [17][20] - Alibaba has made breakthroughs in machine reading comprehension and natural language understanding, integrating its technology into various business scenarios [17][20] Industry Development Trends - The NLP industry is witnessing a trend towards the integration of large models and multimodal capabilities, enhancing performance and user interaction [24] - There is a growing focus on vertical applications in sectors like healthcare and finance, as well as the integration of NLP with smart hardware [26] - Data security and ethical standards are becoming increasingly important, driving sustainable development in the NLP sector [27]
Gemini2.5弯道超车背后的灵魂人物
Hu Xiu· 2025-06-05 03:14
《硅谷101》创始人泓君邀请了Energent.ai联合创始人Kimi Kong和HeyRevia创始人Shaun Wei,一起和两 位前Google的技术专家聊聊Gemini模型登顶背后的底层逻辑。 以下是这次对话内容的精选: 一、Gemini2.5崛起背后的底层逻辑 泓君:谷歌此次发布的Gemini 2.5 Pro,在当前各项评测中的数据都是所有大模型中最好的,Kimi你可 以分析一下它是如何做到的吗? 从去年在大会前夜被OpenAI的4o模型"精准狙击",到今年Gemini 2.5 Pro全面霸榜。短短一年时间, Gemini是如何完成从追赶者到领跑者的逆转? Kimi:我已经离开DeepMind快一年时间了,也不太清楚我的前同事们在这一年中又做了哪些新的创 新。但大语言模型训练根本的步骤是不变的,包括以下三点:Pre-training(预训练)、SFT(Supervised Fine-tuning,监督微调)和利用RLHF(基于人类反馈的强化学习)技术做的Alignment(对齐)。 大概在去年的NeurIPS(神经信息处理系统大会)上,业内已经普遍承认,公开网络数据基本都已经抓 完了,就像化石燃料已 ...
消失的人工客服,“智障”的AI客服
3 6 Ke· 2025-06-04 10:33
AI已经能写诗、作画、开车,但在有些场景下,AI一出手就暴露了它的"人工智障"状态,煞风景地诠释什么叫"人类的悲欢AI并不相 通"。 近日,电商618大促拉开帷幕,随之而来的是,与购物、物流相关的投诉,许多消费者吐槽AI客服的大量引入导致与平台、商家的沟通变 得越来越难。一方面,AI客服听不懂人话,常常陷入"鸡同鸭讲"的荒诞境地,简单问题也难以得到有效解答;另一方面,人工客服"深藏 不露",想要转接人工客服,却要经历"过五关斩六将"的挑战,即便历经"千辛万苦"来到人工客服的"门口",又遇上"需要排队等候"…… 相关话题频频登上热搜,引发网友的集体吐槽。 近年来,AI客服越来越广泛应用于电商、金融、物流、教育、通信、医疗等行业。与此同时,智能客服沟通不畅、答非所问、转接人工 客服难等问题,也饱受消费者诟病。当AI客服变"智障",常常给消费者"添堵",而非解忧,开了服务的"倒车",成了沟通效率的"绊脚 石",损害了整体消费和服务体验。 国家市场监督管理总局发布的数据显示,2024年,在电商售后服务领域,与"智能客服"相关的投诉同比增长56.3%。艾媒咨询2024年发布 的《中国智能客服市场发展状况与消费行为调查数 ...
微信ai客服怎么处理咨询?哪里查看记录?
Sou Hu Cai Jing· 2025-06-04 09:36
Group 1 - The core viewpoint of the article emphasizes the importance of WeChat AI customer service as a vital tool for communication between businesses and customers, enhancing customer satisfaction through quick responses and efficient problem handling [1][4]. Group 2 - The process of handling inquiries by WeChat AI customer service is highly automated, utilizing natural language processing to understand customer queries, search for relevant answers in its knowledge base, and escalate complex issues to human agents when necessary [4]. Group 3 - Viewing consultation records is crucial for assessing service quality and efficiency, with records accessible through the ChatWave backend management system, allowing businesses to analyze customer interactions and identify areas for improvement [5]. Group 4 - Strategies for optimizing inquiry handling include regularly updating the knowledge base, utilizing data analysis tools to understand common customer issues, incorporating user feedback for improving dialogue processes, and ensuring seamless transitions between AI and human customer service [6]. Group 5 - ChatWave offers significant advantages in inquiry handling, including strong natural language processing capabilities, support for multi-turn conversations, automation to enhance efficiency, and valuable insights from customer consultation data for product and service optimization [7][9].
腾讯申请一种文本处理模型训练等专利,提升模型改写能力
Jin Rong Jie· 2025-05-28 04:44
Group 1 - Tencent Technology (Shenzhen) Co., Ltd. has applied for a patent related to natural language processing technology, specifically for a text processing model training method and device [1] - The patent application, published as CN120045650A, was filed on November 2023 and aims to enhance the efficiency and quality of training datasets for text processing models [1] - The proposed method involves using multiple sample conversation data and preset rewriting instructions to generate annotated rewriting correlation data, which is then used to create a rewriting training set [1] Group 2 - Tencent Technology (Shenzhen) Co., Ltd. was established in 2000 and is primarily engaged in software and information technology services [2] - The company has a registered capital of 2 million USD and has made investments in 15 enterprises, participated in 254 bidding projects, and holds 5000 trademark and patent records [2] - Additionally, the company possesses 439 administrative licenses, indicating a robust operational framework [2]
以科技赋能传统文化,豆神动漫开拓传统文化交互体验新范式
Qi Lu Wan Bao Wang· 2025-05-23 16:19
Core Viewpoint - The development of the "Confucius Digital Human" 2.0 version by Dou Shen Animation represents a significant advancement in digital cultural products, utilizing cutting-edge technologies such as artificial intelligence, 3D modeling, and natural language processing to create an interactive and conversational digital representation of Confucius [1][3]. Group 1: Technology and Innovation - The "Confucius Digital Human" is not merely a virtual image but a highly intelligent interactive digital cultural carrier, capable of deep interaction and realistic expressions [3]. - The development team employed high-precision 3D modeling technology to accurately recreate Confucius's historical features, enabling the digital figure to speak, nod, blink, and express various emotions [3]. Group 2: Applications and Impact - The digital product can be widely applied in education, cultural exhibitions, academic research, tourism, and museums, serving as a powerful tool for the integration of digital economy and tourism industries [5]. - The company aims to leverage digital technology to transcend temporal boundaries, making Confucius accessible as a cultural mentor, and plans to continuously upgrade the technology for more refined and professional services [5].
人工智能专题:2025年中国人工智能与商业智能发展白皮书
Sou Hu Cai Jing· 2025-05-22 00:55
Core Insights - The report highlights the limitations of traditional Business Intelligence (BI) systems, which struggle to meet the demands for real-time and dynamic decision-making due to their closed architectures and static processing capabilities [1][21][24] - The integration of Artificial Intelligence (AI) with BI, termed Artificial Intelligence and Business Intelligence (ABI), is driving a shift from reactive to proactive decision-making, with ABI expected to experience explosive growth in China, reaching a market size of 800 million yuan in 2024 and a CAGR of 42% from 2024 to 2028 [1][11][13] - Key drivers for ABI growth include deepening enterprise reliance on data, breakthroughs in AI technology, and supportive policies [1][11] Industry Overview - ABI leverages technologies such as Natural Language Processing (NLP) and machine learning to enable conversational interactions, multimodal data analysis, and complex reasoning, enhancing decision-making across various sectors including finance, retail, manufacturing, government, and energy [2][3] - The financial sector utilizes ABI for intelligent risk control and quantitative trading, while retail benefits from dynamic pricing and inventory optimization [2][3] - Manufacturing employs predictive maintenance and process optimization to reduce downtime, and government sectors enhance service efficiency through smart traffic and urban governance [2][3] Market Dynamics - The ABI market in China is projected to grow from 300 million yuan in 2023 to 800 million yuan in 2024, driven by the increasing complexity of decision-making needs and the inadequacies of traditional BI tools [1][11][13] - ABI's core challenges include data governance lag, algorithm opacity, fragmented scenarios, and high technical costs, with future trends focusing on edge computing, real-time analysis, generative AI penetration, and privacy computing technologies [3][11] Technological Advancements - ABI employs advanced techniques such as Text2SQL and Text2DSL to convert natural language into data queries, enhancing the depth of analysis through external knowledge integration and multi-agent collaboration [2][3][30] - The integration of AI allows for the automation of data processing, significantly improving efficiency and enabling strategic decision-making by providing deeper insights and optimizing resource allocation [40][42] Future Outlook - The ABI landscape is evolving towards democratization and intelligence, reshaping the decision-making paradigm driven by data within enterprises [3][11] - Major global players like Microsoft and Salesforce focus on ecosystem integration, while domestic firms like Alibaba Cloud and Fanruan emphasize lightweight deployment and localized innovation [3][11]
一个「always」站在大模型技术C位的传奇男子
量子位· 2025-05-10 02:39
西风 衡宇 发自 凹非寺 量子位 | 公众号 QbitAI 怎么老是你??? (How old are you) 这是最近网友不断对着 Transformer八子之一的Noam Shazeer (为方便阅读 ,我们称 他为沙哥) 发出的灵魂疑问。 尤其是最近Meta FAIR研究员朱泽园分享了他们《Physics of Language Models》项目的系列新进展后,有网友发现,其中提到的3-token 因果卷积相关内容,沙哥等又早在三年前就有相关研究。 是的," 又 "。 因为你只要梳理一遍他的工作履历,就不难发现,AI界大大小小的突破背后,总是能发现他的名字。 "不是搞个人崇拜,但为什么总是Noam Shazeer?" △ 网友称右下角沙哥图由GPT-4o生成 朱泽园也自己也站出来表示,沙哥成果超前: 我也觉得Shazeer可能是个时间旅行者。 我原本不相信他们的gated MLP (在写第3.3部分的时候,因为门控多层感知机让训练不稳定) ,但现在我信服了 (在添加了Canon 层之后,我们在第4.1部分对比了多层感知机和门控多层感知机) 。 正式认识一下,沙哥是谁? 他是 Transformer八 ...
海能投顾大数据中心打造精准投资决策支持系统
Sou Hu Cai Jing· 2025-05-08 11:57
Group 1 - The core infrastructure driving investment research upgrades is the financial big data center built by Haineng Investment Advisory, which has invested over 200 million yuan in a distributed computing cluster capable of processing 10PB of financial data daily, providing strong data support for investment decisions [1] - The "Data Cube" system integrates traditional financial data, alternative data, and satellite remote sensing information, with a proprietary commercial vitality index that analyzes mobile signaling data from 3,800 business districts to predict consumption trends 2-3 quarters in advance, achieving an excess return of 15.2% in the 2023 consumer sector layout [1] - The natural language processing engine can analyze financial news in 76 languages in real-time, with an accuracy rate of 92.4% for sentiment analysis, and it can structure 300 pages of documents in 30 seconds, improving efficiency by 400 times compared to manual analysis [1] - The "Factor Factory" platform has accumulated over 1,200 effective alpha factors, achieving an annualized stable return of 21.3% in the A-share market through a multi-factor model optimized by genetic algorithms, notably capturing three major turning points in the new energy sector through the unique "industry chain transmission factor" [1] Group 2 - The data middle platform of Haineng Investment Advisory adopts a microservices architecture, supporting agile development for business departments, allowing investment managers to build analysis models independently with visual tools, reducing strategy backtesting time from 3 days to 2 hours [2] - In 2023, the platform produced 187 effective investment strategies, with 63 strategies already implemented in practice and achieving excellent performance [2] - Future testing of quantum computing applications in portfolio optimization is expected to reduce the solving time for large-scale asset allocation problems from several hours to minutes, marking a revolutionary improvement in investment decision efficiency [2]