AI前线

Search documents
登顶 Arena!MiniMax 最新 Speech-02 模型屠榜:超越OpenAI、ElevenLabs,人声相似度99%
AI前线· 2025-05-15 06:45
作者 | 凌敏 没有什么比"群星闪耀"更适合形容近期的 TTS(Text-To-Speech,文本转语音)模型领域了。 开年以来,从科技巨头到创业公司再到研究机构,都在发力 TTS 模型。2 月,字节跳动海外实验 室推出一款轻量级 TTS 模型 MegaTTS3-Global;3 月,出门问问联合香港科技大学、上海交通 大学、南洋理工大学、西北工业大学等顶尖学术机构,共同开源新一代语音生成模型 Spark- TTS;同月,OpenAI 推出基于 GPT-4o-mini 架构的 TTS 模型。 与 AI 领域其他热门技术相比,TTS 似乎格外低调,但它却是智能硬件、数字人等场景的"隐形基 石"。凭借广泛的应用领域和开阔的商业前景,TTS 在最近一年取得了长足的进步,并悄然改变 着行业规则。 最近,TTS 模型又有重磅"上新",Speech-02语音模型一出手,就将 OpenAI、ElevenLabs 甩在 了后面,登顶 Arena 榜单,成为全球第一。 | Creator | | Model | Arena ELO | 95% CI | # Appearances | | --- | --- | --- | - ...
不再“纸上谈兵”:大模型能力如何转化为实际业务价值
AI前线· 2025-05-15 06:45
作者 | AICon 全球人工智能开发与应用大会 策划 | 李忠良 编辑 | 宇琪 随着技术的快速发展,大模型在各行业的应用潜力日益凸显,但如何将大模型能力高效转化为实际业 务价值,仍是企业面临的核心挑战。 近日 InfoQ《极客有约》X AICon 直播栏目特别邀请了 华为云 AI 应用首席架构师郑岩 担任主持人, 和 蚂蚁集团高级技术专家杨浩、明略科技高级技术总监吴昊宇 一起,在 AICon 全球人工智能开发 与应用大会 2025 上海站 即将召开之际,共同探讨大模型如何驱动业务提效。 部分精彩观点如下: 在 5 月 23-24 日将于上海举办的 AICon 全球人工智能开发与应用大会 上,我们特别设置了 【大模型 助力业务提效实践】 专题。该专题将围绕模型选型与优化、应用场景落地及效果评估等关键环节,分 享行业领先企业的实战经验。 查看大会日程解锁更多精彩内容: https://aicon.infoq.cn/2025/shanghai/schedule 以下内容基于直播速记整理,经 InfoQ 删减。 场景探索 郑岩:在探索大模型应用场景时,企业常会遇到"看起来很美但落地难"的需求,各位在实际项目中是 ...
AI 开发:从 Demo 到上线有多远?| 直播预告
AI前线· 2025-05-15 06:45
直播亮点 5 月 15 日 20:00-21:30 直播主题 AI 开发:从 Demo 到上线有多远? 直播嘉宾 主持人 : 孙志岗 AI 师傅创始人 /CEO 嘉宾 : 从 0 到 1,AI 创业者的实战经验分享。多位一线 AI 创业者现身说法,亲述从想法到落地的真实经 历。当 AI 不只是工具,而是时代机会,你准备好抓住了吗?扫码预约,不见不散! 直播介绍 直播时间 艾逗笔(idoubi) ThinkAny & MCP.so/ 创始人 莫欣 Agently.tech / 创始人 何少甫 AI 师傅 / 联合创始人、首席产品官 扫描下图海报 【二维码】 ,或戳直播预约按钮,预约 InfoQ 视频号直播。 如何向讲师提问? 文末留言写下问题,讲师会在直播中为你解答。 多视角剖析 AI 开发:从独立产品到系统架构,再到产研协作 AI 落地到底卡在哪,怎么从 Demo 走到真正上线 聊工具也聊认知,聊实践也聊误区 如何看直播? ...
微软再次裁员:18 年老员工、10 倍 TypeScript 性能提升幕后功臣也一并优化了
AI前线· 2025-05-14 10:19
Core Viewpoint - Microsoft is set to lay off 3% of its global workforce, affecting over 6,500 employees, as part of a strategic shift to optimize resources and increase investment in emerging artificial intelligence platforms [1][2]. Group 1: Layoff Details - The layoffs are part of a significant strategic adjustment, aimed at streamlining operations and enhancing profitability to support AI-focused initiatives [1]. - This round of layoffs is the largest since 10,000 employees were let go earlier in 2023, impacting all levels, regions, and teams within the company [2]. - Employees affected by performance-related layoffs will face a two-year rehire ban, and a new "good attrition" metric has been introduced to track employee departures [3]. Group 2: Financial Performance - Despite the layoffs, Microsoft reported strong quarterly earnings with revenue of $70.1 billion, a 13% year-over-year increase, and a net profit of $25.8 billion, up 18% [2]. Group 3: AI Investment Focus - Microsoft is heavily investing in AI, integrating AI capabilities into its core products like Microsoft 365, Azure, and Dynamics 365 to attract more enterprise customers [1]. - The company’s CEO, Satya Nadella, highlighted that a significant portion of the code in their codebase is now generated by software, indicating a shift towards AI-driven development [1]. Group 4: Impact on Key Projects - Key personnel involved in significant projects, such as the TypeScript performance enhancement initiative, have also been laid off, raising questions about the decision-making process behind the layoffs [5][13]. - The TypeScript performance optimization project aims for up to a 10x performance improvement and is still in progress, despite the departure of core team members [9][12].
微软华人AI团队核心成员被曝加入腾讯混元,知情人称与裁员无关|独家
AI前线· 2025-05-14 08:12
作者|冬梅、核子可乐 近 日 , 据 外 媒 消 息 , WizardLM 团 队 核 心 成 员 Can Xu 已 经 离 开 微 软 , 加 入 了 腾 讯 混 元 (Hunyuan)事业部。 Can Xu此前也 在 X 发帖表示 ," 我和 WizardLM 团队 离开微软加入了腾讯混元 ( I and WizardLM team has left Microsoft and joined Tencent Hunyuan! )" 。但此贴目前已被删 除,Can Xu 解释称此前信息有误,是他个人离开了微软,而非整个团队。 据知情人士向 AI 前线独家透露,WizardLM 团队的主力成员大部 分已经离开微软。 有趣的是,AI 前线留意到,Can Xu 宣布离开的时间点比较微妙,因为正值微软被曝出将在全球 范围内裁员 3%,也就是大约 6000 多人会受到影响,外界猜测Can Xu的离开或与微软裁员有 关。 但据知情人士透露,WizardLM 团队核心人物 Can Xu 和 Qingfeng Sun 早已离开微软,只是近 日才将消息公布出来,他们的离开与微软裁员无关。 知情人士还透露,团队也会采用远程办公 ...
RAG系统设计:揭秘语义搜索被低估的核心价值与KG驱动的架构选型策略
AI前线· 2025-05-14 05:47
分享嘉宾 | 尹一峰 审校 | 李忠良 策划 | AICon 全球人工智能与开发大会 RAG 要不要做语义检索,有很多讨论,还没有定论。在 InfoQ 举办的 AICon 全球人工智能与开发大会上 Hugging FaceMachine Learning Engineer 尹 一峰为我们带来了精彩专题演讲"RAG 基本范式的选择与系统设计",深入探讨基于语义搜索(Semantic Search)的 RAG 系统的重要性,揭示它为何在 当前技术背景下被严重低估,分析语义搜索的本质及其在 RAG 系统中的关键作用,并分享如何基于这一本质设计出高效的系统架构。 此外,演讲还将讨论 KG 驱动的 RAG 系统,并指出它并非适用于所有数据类型,帮助听众理解如何根据不同的数据特性选择最合适的 RAG 范式。 内容亮点: 以下是演讲实录(经 InfoQ 进行不改变原意的编辑整理)。 RAG 简介 我们需要了解为什么需要 RAG(Retrieval-Augmented Generation,检索增强生成)。原因很简单,因为 LLM 本身存在一些问题。RAG 作为一种辅助 工具,其存在是因为 LLM 本身有不足之处。 LLM ...
微软这支神秘的华人AI团队加入腾讯混元,曝与裁员无关|独家
AI前线· 2025-05-14 05:47
Core Viewpoint - The WizardLM team, creators of advanced large language models, has transitioned from Microsoft to Tencent's AI development organization, Hunyuan, aiming to enhance LLM training technology and develop superior AI models [1][3][31]. Group 1: Team Transition and Background - The WizardLM team, consisting of six key members, has left Microsoft amid speculation regarding layoffs affecting 3% of the workforce, although their departure is reportedly unrelated to these layoffs [4][6]. - The team was established in early 2023, focusing on the development of advanced large language models, with notable members including Qingfeng Sun and Can Xu, both of whom have significant experience in AI research [7][9][10]. - The team has previously contributed to the development of models such as WizardLM, WizardCoder, and WizardMath, and has published over 40 papers in top international conferences [10][13]. Group 2: Model Development and Achievements - WizardLM has released models that outperform Google's Gemma 3 series and have ranked among the top four global large language models in competitions [3][16]. - The core algorithm, Evol-Instruct, allows for the efficient generation of complex instruction data, leading to superior performance in human evaluations compared to traditional methods [13][14][17]. - The WizardLM-30B model achieved a 97.8% score compared to ChatGPT in specific tests, showcasing its advanced capabilities [14]. Group 3: Tencent's AI Strategy - Tencent has restructured its AI development framework, focusing on "computing power, algorithms, and data," and plans to invest approximately 124.9 billion USD in AI development [28][30]. - The company has established new technical departments dedicated to large language models and multimodal models, aiming to enhance AI capabilities in natural language processing and data integration [28][29]. - Following the acquisition of the WizardLM team, Tencent's ambition in the AI sector is expected to grow, with the team continuing to develop and release AI models [31].
氛围编程成新晋顶流,腾讯也出手了!代码助手 CodeBuddy 重磅升级,网友实测:真香
AI前线· 2025-05-13 06:35
Core Viewpoint - Vibe Coding has emerged as a significant trend in Silicon Valley, emphasizing a shift from traditional coding to describing requirements in natural language, allowing AI to generate code automatically [1][2][3]. Group 1: Concept and Evolution - The concept of Vibe Coding was introduced by Andrej Karpathy, highlighting a process where developers interact with AI to create applications without needing to write code themselves [1][2]. - Vibe Coding allows individuals without technical backgrounds to participate in programming, making the idea of "everyone is a programmer" more attainable [1][4]. - The capabilities of large models have evolved, enabling them to accurately understand user needs and generate runnable projects, marking a shift from code completion to comprehensive project development [2][4]. Group 2: Tools and Applications - Various tools have emerged in the Vibe Coding space, including Cursor, GitHub Copilot, and CodeBuddy, which is developed by Tencent [5][6]. - CodeBuddy's Craft mode can autonomously generate and modify multi-file code, enhancing the development process by allowing developers to focus on user experience rather than coding details [6][9]. - CodeBuddy has been widely adopted within Tencent, with 85% of developers using it, resulting in an average coding time reduction of over 40% and a productivity increase of 16% [20]. Group 3: Challenges and Future Outlook - Despite the advantages, challenges such as code quality and maintainability persist, with increasing code change rates leading to potential issues in code structure and readability [16][17]. - The rise of AI-generated code has led to a significant increase in code change rates, projected to be double that of pre-AI levels by 2024 [16][17]. - The future of Vibe Coding looks promising, with a growing number of startups indicating that a substantial portion of their code is AI-generated, suggesting a potential mainstream adoption of this approach in software development [21].
从“铁三角”到“六有”组织,北银金科如何打造千人高密度数智化团队?| 极客时间企业版
AI前线· 2025-05-13 06:35
当前,银行业正经历一场由数字化、智能化技术驱动的深刻变革。在这一过程中,金融科技子公 司作为技术创新的引擎,通过推动技术落地、业务重构和人才培养,正在重塑银行业的服务模式 与竞争格局。 在极客时间主办的 2025 DTDS 全球数智人才发展大会上,北 银金科党委书记,副总经理赵瑞兰 发表了题为《北银金科数智化时代的人才培养与组织变革共生之道》的演讲,介绍了北银金科过 去几年来在数智化转型过程中的实践经验以及人力资源部门与管理层战略对齐的思路。以下内容 由极客时间基于本次演讲内容进行编辑。 北银金科企业介绍 北银金科作为北京银行的科技子公司,成立于 2019 年 5 月 16 日。它是在银行业数字化转型的 大背景下诞生的。作为效率革新与成本控制的双重引擎,科技发挥重要作用,在前期投入阶段有 着较强的规模效应,需要更多的人员投入与资源积累,北银金科也是由此而生。 我们以北京银行"数字京行"的战略为指引,定下了"服务母行、做精产品、做优技术、做强实力"的 发展目标,以科技创新激发高质量发展新动能,为北京银行数字化转型提供坚实支撑。 北银金科的职能 定位分为三个方面,分别是以客户为中心的"一个原则",助推北京银行数字 ...
客户不转化、内容不合规?AI 与 Agent 如何破解金融营销五大难题
AI前线· 2025-05-13 06:35
Core Insights - The article emphasizes that AI and Agents are no longer optional but essential for transforming customer insights, decision-making efficiency, and service experience in financial marketing [1][3][5] - It highlights the evolution of financial marketing from traditional methods to the current intelligent 3.0 era, where AI technologies are the driving force behind marketing transformation [3][4][15] Industry Evolution - Financial marketing has evolved from a traditional 1.0 era reliant on physical branches and customer managers to a digital 2.0 era with CRM and online channels, but issues like data silos and fragmented experiences persist [3][4] - The current shift to intelligent 3.0 is characterized by the integration of AI technologies, which provide unprecedented customer insights and enhance decision-making processes [3][4][5] AI Value Proposition - AI offers unparalleled customer insights by analyzing both structured and unstructured data, enabling the identification of hidden customer needs [3][4] - It facilitates real-time and precise decision-making by integrating various data points to generate optimal marketing strategies tailored to individual customers [4][5] - AI-driven solutions improve service execution through automation, allowing for consistent and efficient customer interactions [5][11] Current Challenges in Financial Marketing - High customer acquisition costs and low conversion rates are significant challenges, with customer acquisition costs (CAC) often exceeding thousands [6][7] - Personalization remains a challenge, as many financial institutions struggle to provide truly individualized experiences [7][8] - Complex product offerings lead to customer confusion, making it difficult for them to make informed purchasing decisions [7][8] - Regulatory compliance poses challenges for innovation, requiring a balance between risk management and marketing efficiency [8][9] AI and Agent Solutions - The article proposes the creation of a robust "intelligent marketing platform" that integrates data, AI algorithms, and service applications to enhance marketing effectiveness [11][12] - Key technological advancements include large language models (LLM), knowledge graphs, and privacy-preserving computing, which collectively enhance AI's capabilities in financial marketing [12][13] Future Outlook - The future of financial marketing will focus on "intelligent density," where the effective use of AI technologies will create competitive advantages in understanding customers and optimizing experiences [15][16] - The industry is encouraged to embrace AI-driven transformations to secure long-term competitive positioning in the evolving market landscape [16]