DeepSeek
Search documents
OpenAI回滚了最新版本的GPT-4o,因ChatGPT“过于谄媚”
虎嗅APP· 2025-04-30 12:21
本文来自微信公众号: 机器之心 ,作者:杨文、Panda,题图来自:AI生成 昨晚,奥特曼在 X 上发了条帖子,大意是由于发现 GPT-4o "过于谄媚"的问题,所以从周一晚上开始回滚 GPT-4o 的最新更新。 免费 ChatGPT 用户已 100% 回滚,付费用户完成回滚后会再次更新。同时,他还透露,团队正在对模型个性进行额外的修复,并将在未来几天分享更 多信息。 就在刚刚,OpenAI 还专门发博客来回应此事,详细解释了事情的经过以及他们如何处理模型"拍马屁"的情况。 OpenAI 也指出,这个问题很重要。ChatGPT"阿谀奉承"的性格影响了大家对它的信任和使用体验。如果它总是说好听、但不真诚的话,就会让人觉得 它不可靠,甚至有些烦。 为了解决大模型过度逢迎的问题,OpenAI 除了撤销最新的 GPT-4o 更新外,还采取了更多措施: 目前,用户可以通过自定义指令等功能,给模型提供具体指示来塑造其行为。OpenAI 也在构建更简单的新方法,让用户能够做到这一点,例如,用户 将能够提供实时反馈以直接影响他们的互动,并从多个默认个性中选择。 优化核心训练技术与系统提示:明确引导模型避免阿谀奉承。 增加更多 ...
扎克伯格最新专访:AI 会在知识工作和编程领域,引发一场巨大的革命
Sou Hu Cai Jing· 2025-04-30 10:02
Core Insights - Meta's CEO Mark Zuckerberg discussed the competitive landscape of AI development, particularly comparing the Llama 4 model with DeepSeek, asserting that Llama 4 offers higher efficiency and broader functionality despite DeepSeek's advancements in specific areas [1][36]. - Meta AI has reached nearly 1 billion monthly users, indicating significant growth and the importance of personalized AI interactions [2][21]. - The company is focusing on developing coding agents that will automate much of the coding process within the next 12 to 18 months, which is expected to increase the demand for human jobs rather than decrease it [1][16]. Model Development - The Llama 4 series includes models like Scout and Maverick, which are designed for efficiency and low latency, supporting multi-modal capabilities [4][41]. - The upcoming Behemoth model will exceed 2 trillion parameters, representing a significant leap in model size and capability [4]. - Meta is committed to open-sourcing its models after internal use, allowing others to benefit from their developments [4][41]. Competitive Landscape - Zuckerberg believes that open-source models are likely to surpass closed-source models in popularity, reflecting a trend towards more accessible AI technologies [5][36]. - The company acknowledges the impressive infrastructure and text processing capabilities of DeepSeek but emphasizes that Llama 4's multi-modal abilities give it a competitive edge [35][36]. - The licensing model for Llama is designed to facilitate collaboration with large companies while ensuring that Meta retains some control over its intellectual property [37][39]. User Interaction and Experience - Meta is exploring how AI can enhance user interactions, particularly through natural dialogue and personalized experiences [14][28]. - The integration of AI into existing applications like WhatsApp is crucial for user engagement, especially in markets outside the U.S. [21]. - The company is focused on creating AI that can assist users in complex social interactions, enhancing the overall user experience [27][28]. Future Directions - Zuckerberg envisions a future where AI seamlessly integrates into daily life, potentially through devices like smart glasses that facilitate constant interaction with AI [14][31]. - The development of AI will not only focus on productivity but also on entertainment and social engagement, reflecting the diverse applications of AI technology [25][26]. - The company is aware of the challenges in ensuring that AI interactions remain healthy and beneficial for users, emphasizing the importance of understanding user behavior [26][27].
实现商业化落地,人形机器人的核心点是上肢还是下肢?
Robot猎场备忘录· 2025-04-30 07:14
温馨提示 : 点击下方图片,查看运营团队2025年最新原创报告(共210页) 说明: 欢迎约稿、刊例合作、行业人士交流 , 行业交流记得先加入 "机器人头条"知识星球 ,后添加( 微信号:lietou100w ) 微信; 若有侵权、改稿请联系编辑运营(微信:li_sir_2020); 人形机器人要实现真正商业化落地是上肢重要还是下肢重要? 人形机器人真正落地实用场景,任务终结点是手臂和手,而小编注意到涉及手臂相关研究极少,是工业机械臂发 展多年,导致人形机器人机械臂结构和相关算法控制已完全成熟,只需要专注于"小脑"上层层面控制?但是参加 展会时,可明显看到人形机器人手臂运动过程中颤颤巍巍、卡顿、僵硬的现状,所 以这是"小脑"层面控制问题, 还是关节间问题? 目前业内对于灵巧手研究已经很多且备受重视,除了人形机器人本体厂商自研外,也出现了专注于灵巧手和触觉 感知研究的初创公司,也是目前人形机器人发展过程中核心卡点之一。 正文: 具身智能机器人是一个复杂的AI+机器人+自动驾驶的系统性学术+工程问题,远期AGI的物理世界载体,受算力、 软件算法、数据、硬件、工程化等多面因素影响;小编往 期文章 : 【原创】人形机 ...
刚刚!OpenAI回滚了最新版本的GPT-4o,因ChatGPT「过于谄媚」
机器之心· 2025-04-30 04:23
| 机器之心报道 | | --- | | 编辑:杨文、Panda | | 昨晚,奥特曼在 X 上发了条帖子,大意是由于发现 GPT-4o 「过于谄媚」的问题,所以从周一晚上开始回滚 GPT-4o 的最新更新。 | | 免费 ChatGPT 用户已 100% 回滚,付费用户完成回滚后会再次更新。同时,他还透露,团队正在对模型个性进行额外的修复,并将在未来几天分享更多信息。 | 就在刚刚,OpenAI 还专门发博客来回应此事,详细解释了事情的经过以及他们如何处理模型「拍马屁」的情况。 优化核心训练技术与系统提示:明确引导模型避免阿谀奉承。 增加更多限制措施:提升诚实性和透明度,这是模型规范中的重要原则。 扩大用户测试与反馈范围:在部署前让更多用户进行测试并提供直接反馈。 持续扩展评估工作:基于模型规范和持续研究,帮助识别出阿谀奉承之外的其他问题。 OpenAI 也指出,这个问题很重要。ChatGPT「阿谀奉承」的性格影响了大家对它的信任和使用体验。如果它总是说好听、但不真诚的话,就会让人觉得它不可 靠,甚至有些烦。 为了解决大模型过度逢迎的问题,OpenAI 除了撤销最新的 GPT-4o 更新外,还采取了更多措施 ...
Meta's LlamaCon was all about undercutting OpenAI
TechCrunch· 2025-04-30 00:15
Group 1 - Meta held its first AI developer conference, LlamaCon, announcing a consumer-facing AI chatbot app and a developer-facing API for Llama models [1] - The releases aim to expand the adoption of Meta's open Llama AI models, with a primary goal of competing against OpenAI [2][5] - The AI chatbot app features a social feed for sharing AI chats and offers personalized responses based on user activity within Meta apps [3] Group 2 - The Llama API simplifies app development by allowing developers to connect to Llama models with a single line of code, reducing reliance on third-party cloud providers [4] - Meta's strategy includes undercutting proprietary AI model providers like OpenAI, with executives previously focused on surpassing OpenAI's GPT-4 [5] - Meta views any AI lab that makes its models openly available as allies against closed model providers, emphasizing the value of open-source models [6][7] Group 3 - Meta's approach may also be influenced by regulatory considerations, as the EU AI Act provides advantages to companies distributing "free and open source" AI systems [7] - The company appears willing to launch AI products that bolster the open model ecosystem, even if it means not delivering the most advanced models [8]
Meta needs to win over AI developers at its first LlamaCon
TechCrunch· 2025-04-29 15:20
Core Insights - Meta is hosting its first LlamaCon AI developer conference to promote its open Llama AI models, which comes at a crucial time as the company faces stiff competition from both open AI labs and closed commercial entities [1][2] - The launch of Llama 4 has not met developer expectations, with benchmark scores falling short compared to competitors like DeepSeek [3][4] - Meta's previous Llama 3.1 model was well-received, being touted as a leading open foundation model, but the reception of Llama 4 has been markedly different [4][5] Performance and Reception - Llama 4's launch was controversial, with issues surrounding the performance of the Llama 4 Maverick model, which was optimized for conversationality but did not perform as well in broader releases [6][7] - The lack of a reasoning model in the Llama 4 family has raised concerns, especially as competitors have released such models that perform better on specific benchmarks [8][9] - The absence of a reasoning model suggests that Meta may have rushed the launch of Llama 4, which could impact its competitive standing [9][10] Competitive Landscape - Rival companies, such as Alibaba, are releasing models that reportedly outperform some of the best models from OpenAI and Google, increasing pressure on Meta to innovate [10] - To regain its lead in the open model space, Meta needs to deliver superior models, which may require taking more risks in its development approach [11] - The current state of Meta's AI research lab has been described as struggling, with leadership changes indicating potential instability [11][12]
全网都在等梁文锋
凤凰网财经· 2025-04-29 12:39
以下文章来源于凤凰网科技 ,作者凤凰网科技 凤凰网科技 . 凤凰科技频道官方账号,带你直击真相。 来源|凤凰网科技 作者|姜凡 编辑|董雨晴 五月将至,中美科技巨头或将迎来新一轮巅峰对决。 先是在4月中旬,OpenAI一口气发布了GPT-4.1 o3、o4 mini系列模型;谷歌则拿出了Gemini 2.5 Flash Preview,一个混合推理模型;与谷歌同 一天,豆包在杭州巡展中正式发布了1.5·深度思考模型,在多模态上展现出了更强的实力。凤凰网科技从行业人士处了解到,阿里的下一代大模型 Qwen3也将于本月内发布。 混战之下,那股"神秘的东方力量"似乎也在悄悄准备着新的发布。 敏感的神经之下,一点蛛丝马迹都会被放大。 昨日,全球最大AI开源社区Hugging Face首席执行官Clément Delangue在社交平台发布了一条耐人 寻味的动态。这条动态仅由三个眼睛的表情符号构成,并附上了DeepSeek团队在Hugging Face平台的官方资源库入口。 这组充满悬念的组合引发科技圈热议,业内普遍推测DeepSeek R2模型已进入发布倒计时。 01 DeepSeek R2发布已进入倒计时? 近半个 ...
混沌李善友:每一个创业者,都是普罗米修斯
混沌学园· 2025-04-29 08:59
三天课程,从技术圣殿到人性深处,从商业实战到文明思辨,勾勒出一幅 "创业者与 AI 共生" 的壮阔图景。 开篇: AI 黎明下的创业者使命 2025 年 4 月 25 日,在杭州大会展中心的穹顶之下, 3500 余位企业创始人、 CEO 们放下手中的财报与 PPT ,屏息凝视着舞台中央。 李善友教授的身影在聚光灯下升起,背后巨幕上 " AI 的黎明" 五个大字缓缓展开 —— 这是 " 2025 李善友开年大课暨混沌 AI 创新院开学典礼" 的首日现场,一场注定写入中国商业史的思想风暴,正以创业者为圆心,向整个商业世界辐射能 量。 "我们正站在历史的转折点上,但大多数人尚未察觉。" 李善友教授的开场白如重锤敲醒混沌。 当 OpenAI 掀起的技术革命已演变为全球军备竞赛,当 DeepSeek 的逆袭让世界重新审视中国 AI 的可能性,这位混沌创办人抛出终极之问: 在算力与数 据构筑的新战场,创业者如何找到不被 AI 吞噬的生存法则? 李善友的十年伏笔 从认知到使命的进化 "这场准备,长则十年,短则 18 个月。" 李善友的声音低沉却有力。 回溯混沌的 AI 求索之路,早在移动互联网红利见顶时,团队就已开始追问: ...
通义千问 Qwen3 发布,对话阿里周靖人
晚点LatePost· 2025-04-29 08:43
以下文章来源于晚点对话 ,作者程曼祺 晚点对话 . 最一手的商业访谈,最真实的企业家思考。 阿里云 CTO、通义实验室负责人 周靖人 "大模型已经从早期阶段的初期,进入早期阶段的中期,不可能只在单点能力上改进了。" Qwen3 旗舰模型,MoE(混合专家模型)模型 Qwen3-235B-A22B,以 2350 亿总参数、220 亿激活参数,在 多项主要 Benchmark(测评指标)上超越了 6710 亿总参数、370 亿激活参数的 DeepSeek-R1 满血版。更小 的 MoE 模型 Qwen3-30B-A3B,使用时的激活参数仅为 30 亿,不到之前 Qwen 系列纯推理稠密模型 QwQ- 32B 的 1/10,但效果更优。更小参数、更好性能,意味着开发者可以用更低部署和使用成本,得到更好效 果。图片来自通义千问官方博客。 (注:MoE 模型每次使用时只会激活部分参数,使用效率更高,所以有 总参数、激活参数两个参数指标。) Qwen3 发布前,我们访谈了阿里大模型研发一号位,阿里云 CTO 和通义实验室负责人,周靖人。他 也是阿里开源大模型的主要决策者。 迄今为止,Qwen 系列大模型已被累计下载 3 ...
阿里开源首个“混合推理模型”:集成“快思考”、“慢思考”能力
Xin Lang Cai Jing· 2025-04-29 06:28
Core Insights - Alibaba has open-sourced its new generation model Qwen3, which integrates "fast thinking" and "slow thinking" capabilities, significantly reducing deployment costs compared to other large models like Deepseek [1] - The Qwen3 model employs a "Mixture of Experts (MoE)" architecture, allowing it to mimic human problem-solving by providing multi-step deep thinking for complex issues and quick responses for simpler queries, thus saving computational resources [3] - Alibaba is focusing on building its AI strategy around the Qwen series, with plans to invest over 380 billion RMB in cloud and AI hardware infrastructure over the next three years, surpassing the total investment of the past decade [4] Industry Context - Following the release of Deepseek's low-cost high-performance R1 model, domestic tech companies in China, including Baidu and iFlytek, are rapidly launching a series of cost-effective AI model services [3] - Alibaba's Qwen series has surpassed the US Llama in terms of open-source model downloads, with over 300 million downloads and more than 100,000 derivative models [4] - On the same day Alibaba announced Qwen3, OpenAI released several updates to ChatGPT, enhancing its shopping features and optimizing for various consumer categories, indicating a competitive landscape in AI model development [4]