WeDLM
Search documents
被员工怒怼“磕了”,追觅CEO:我有肚量;AI恋人陪聊涉黄被判刑,2.4万人付费;马斯克、奥特曼又开撕|AI周报
AI前线· 2026-01-18 05:32
Group 1: AI-related Legal Issues - The first criminal case involving AI-related obscenity in China was brought to trial, with the accused facing charges for providing chat services through the AlienChat software, which had 116,000 users, including 24,000 paying members, generating over 3 million yuan in revenue [3][4]. - The court found that out of 12,495 chat segments sampled from paying users, 3,618 segments were deemed obscene, leading to convictions for the founders [4]. Group 2: Corporate Developments in Technology - Pursuing a goal to create the world's first trillion-dollar company, the CEO of Chasing Technology, Yu Hao, stated that achieving this target is not expected within a year, despite facing internal criticism from employees regarding ambitious strategic goals [5][6][7]. - Ctrip is under investigation for alleged monopolistic practices, with the company confirming it will cooperate with regulatory authorities [10][11]. - The "Dead or Not" app, previously renamed "Demumu," is seeking a new brand name after feedback indicated the original name was considered inauspicious [12]. Group 3: Semiconductor and Tariff Changes - The U.S. government announced a 25% tariff on certain imported semiconductors and related products, effective January 15, 2026, as part of ongoing trade policy adjustments [14][15]. Group 4: Talent Movements in AI - Chen Lijie, a notable figure from Tsinghua University's Yao Class, has joined OpenAI to focus on mathematical reasoning, alongside the return of former OpenAI executives [16][18]. Group 5: Legal Actions and Financial Claims - Elon Musk is suing OpenAI and Microsoft for up to $134 billion, claiming that OpenAI has deviated from its non-profit mission and misled him regarding its financial dealings [19][20]. - OpenAI has characterized Musk's lawsuit as part of a pattern of harassment rather than a legitimate economic claim [20]. Group 6: AI Infrastructure and Innovations - Elon Musk announced the operational status of the "Colossus 2" supercomputer, which is designed to support the Grok AI chatbot, with plans for further upgrades [24][25]. - Meta is launching a new infrastructure initiative called "Meta Compute" to enhance its AI capabilities, while also planning to cut about 10% of jobs in its Reality Labs division [26][27]. Group 7: New AI Models and Technologies - Baichuan Intelligence released a new medical AI model, Baichuan-M3, which outperformed GPT-5.2 in various assessments, showcasing advanced diagnostic capabilities [39]. - Tencent's WeDLM model aims to improve inference efficiency in AI applications, addressing traditional limitations in model performance [35].
微信炼出扩散语言模型,实现vLLM部署AR模型3倍加速,低熵场景超10倍
机器之心· 2026-01-03 04:13
问题的关键在于:大多数扩散语言模型采用双向注意力机制,这与标准的 KV 缓存机制不兼容,导致并行预测的优势无法转化为实际的速度提升。 近日,腾讯微信 AI 团队提出了 WeDLM (WeChat Diffusion Language Model),这是 首个在工业级推理引擎(vLLM)优化条件下,推理速度超越同等 AR 模型 的扩散语言模型。 腾讯微信 AI 团队提出 WeDLM(WeChat Diffusion Language Model),通过在标准因果注意力下实现扩散式解码,在数学推理等任务上实现相比 vLLM 部署的 AR 模型 3 倍以上加速,低熵场景更可达 10 倍以上,同时保持甚至提升生成质量。 引言 自回归(AR)生成是当前大语言模型的主流解码范式,但其逐 token 生成的特性限制了推理效率。扩散语言模型(Diffusion LLMs)通过并行恢复多个 mask token 提供了一种替代方案,然而在实践中,现有扩散模型往往难以在推理速度上超越经过高度优化的 AR 推理引擎(如 vLLM)。 论文标题:WeDLM: Reconciling Diffusion Language Models ...