DeepSeek

Search documents
刚刚!OpenAI回滚了最新版本的GPT-4o,因ChatGPT「过于谄媚」
机器之心· 2025-04-30 04:23
| 机器之心报道 | | --- | | 编辑:杨文、Panda | | 昨晚,奥特曼在 X 上发了条帖子,大意是由于发现 GPT-4o 「过于谄媚」的问题,所以从周一晚上开始回滚 GPT-4o 的最新更新。 | | 免费 ChatGPT 用户已 100% 回滚,付费用户完成回滚后会再次更新。同时,他还透露,团队正在对模型个性进行额外的修复,并将在未来几天分享更多信息。 | 就在刚刚,OpenAI 还专门发博客来回应此事,详细解释了事情的经过以及他们如何处理模型「拍马屁」的情况。 优化核心训练技术与系统提示:明确引导模型避免阿谀奉承。 增加更多限制措施:提升诚实性和透明度,这是模型规范中的重要原则。 扩大用户测试与反馈范围:在部署前让更多用户进行测试并提供直接反馈。 持续扩展评估工作:基于模型规范和持续研究,帮助识别出阿谀奉承之外的其他问题。 OpenAI 也指出,这个问题很重要。ChatGPT「阿谀奉承」的性格影响了大家对它的信任和使用体验。如果它总是说好听、但不真诚的话,就会让人觉得它不可 靠,甚至有些烦。 为了解决大模型过度逢迎的问题,OpenAI 除了撤销最新的 GPT-4o 更新外,还采取了更多措施 ...
Meta's LlamaCon was all about undercutting OpenAI
TechCrunch· 2025-04-30 00:15
Group 1 - Meta held its first AI developer conference, LlamaCon, announcing a consumer-facing AI chatbot app and a developer-facing API for Llama models [1] - The releases aim to expand the adoption of Meta's open Llama AI models, with a primary goal of competing against OpenAI [2][5] - The AI chatbot app features a social feed for sharing AI chats and offers personalized responses based on user activity within Meta apps [3] Group 2 - The Llama API simplifies app development by allowing developers to connect to Llama models with a single line of code, reducing reliance on third-party cloud providers [4] - Meta's strategy includes undercutting proprietary AI model providers like OpenAI, with executives previously focused on surpassing OpenAI's GPT-4 [5] - Meta views any AI lab that makes its models openly available as allies against closed model providers, emphasizing the value of open-source models [6][7] Group 3 - Meta's approach may also be influenced by regulatory considerations, as the EU AI Act provides advantages to companies distributing "free and open source" AI systems [7] - The company appears willing to launch AI products that bolster the open model ecosystem, even if it means not delivering the most advanced models [8]
Meta needs to win over AI developers at its first LlamaCon
TechCrunch· 2025-04-29 15:20
On Tuesday, Meta is hosting its first-ever LlamaCon AI developer conference at its Menlo Park headquarters, where the company will try to pitch developers on building applications with its open Llama AI models. Just a year ago, that wasn’t a hard sell.However, in recent months, Meta has struggled to keep up with both “open” AI labs like DeepSeek and closed commercial competitors such as OpenAI in the rapidly evolving AI race. LlamaCon comes at a critical moment for Meta in its quest to build a sprawling Lla ...
全网都在等梁文锋
凤凰网财经· 2025-04-29 12:39
以下文章来源于凤凰网科技 ,作者凤凰网科技 凤凰网科技 . 凤凰科技频道官方账号,带你直击真相。 来源|凤凰网科技 作者|姜凡 编辑|董雨晴 五月将至,中美科技巨头或将迎来新一轮巅峰对决。 先是在4月中旬,OpenAI一口气发布了GPT-4.1 o3、o4 mini系列模型;谷歌则拿出了Gemini 2.5 Flash Preview,一个混合推理模型;与谷歌同 一天,豆包在杭州巡展中正式发布了1.5·深度思考模型,在多模态上展现出了更强的实力。凤凰网科技从行业人士处了解到,阿里的下一代大模型 Qwen3也将于本月内发布。 混战之下,那股"神秘的东方力量"似乎也在悄悄准备着新的发布。 敏感的神经之下,一点蛛丝马迹都会被放大。 昨日,全球最大AI开源社区Hugging Face首席执行官Clément Delangue在社交平台发布了一条耐人 寻味的动态。这条动态仅由三个眼睛的表情符号构成,并附上了DeepSeek团队在Hugging Face平台的官方资源库入口。 这组充满悬念的组合引发科技圈热议,业内普遍推测DeepSeek R2模型已进入发布倒计时。 01 DeepSeek R2发布已进入倒计时? 近半个 ...
混沌李善友:每一个创业者,都是普罗米修斯
混沌学园· 2025-04-29 08:59
三天课程,从技术圣殿到人性深处,从商业实战到文明思辨,勾勒出一幅 "创业者与 AI 共生" 的壮阔图景。 开篇: AI 黎明下的创业者使命 2025 年 4 月 25 日,在杭州大会展中心的穹顶之下, 3500 余位企业创始人、 CEO 们放下手中的财报与 PPT ,屏息凝视着舞台中央。 李善友教授的身影在聚光灯下升起,背后巨幕上 " AI 的黎明" 五个大字缓缓展开 —— 这是 " 2025 李善友开年大课暨混沌 AI 创新院开学典礼" 的首日现场,一场注定写入中国商业史的思想风暴,正以创业者为圆心,向整个商业世界辐射能 量。 "我们正站在历史的转折点上,但大多数人尚未察觉。" 李善友教授的开场白如重锤敲醒混沌。 当 OpenAI 掀起的技术革命已演变为全球军备竞赛,当 DeepSeek 的逆袭让世界重新审视中国 AI 的可能性,这位混沌创办人抛出终极之问: 在算力与数 据构筑的新战场,创业者如何找到不被 AI 吞噬的生存法则? 李善友的十年伏笔 从认知到使命的进化 "这场准备,长则十年,短则 18 个月。" 李善友的声音低沉却有力。 回溯混沌的 AI 求索之路,早在移动互联网红利见顶时,团队就已开始追问: ...
通义千问 Qwen3 发布,对话阿里周靖人
晚点LatePost· 2025-04-29 08:43
以下文章来源于晚点对话 ,作者程曼祺 晚点对话 . 最一手的商业访谈,最真实的企业家思考。 阿里云 CTO、通义实验室负责人 周靖人 "大模型已经从早期阶段的初期,进入早期阶段的中期,不可能只在单点能力上改进了。" Qwen3 旗舰模型,MoE(混合专家模型)模型 Qwen3-235B-A22B,以 2350 亿总参数、220 亿激活参数,在 多项主要 Benchmark(测评指标)上超越了 6710 亿总参数、370 亿激活参数的 DeepSeek-R1 满血版。更小 的 MoE 模型 Qwen3-30B-A3B,使用时的激活参数仅为 30 亿,不到之前 Qwen 系列纯推理稠密模型 QwQ- 32B 的 1/10,但效果更优。更小参数、更好性能,意味着开发者可以用更低部署和使用成本,得到更好效 果。图片来自通义千问官方博客。 (注:MoE 模型每次使用时只会激活部分参数,使用效率更高,所以有 总参数、激活参数两个参数指标。) Qwen3 发布前,我们访谈了阿里大模型研发一号位,阿里云 CTO 和通义实验室负责人,周靖人。他 也是阿里开源大模型的主要决策者。 迄今为止,Qwen 系列大模型已被累计下载 3 ...
阿里开源首个“混合推理模型”:集成“快思考”、“慢思考”能力
Xin Lang Cai Jing· 2025-04-29 06:28
Core Insights - Alibaba has open-sourced its new generation model Qwen3, which integrates "fast thinking" and "slow thinking" capabilities, significantly reducing deployment costs compared to other large models like Deepseek [1] - The Qwen3 model employs a "Mixture of Experts (MoE)" architecture, allowing it to mimic human problem-solving by providing multi-step deep thinking for complex issues and quick responses for simpler queries, thus saving computational resources [3] - Alibaba is focusing on building its AI strategy around the Qwen series, with plans to invest over 380 billion RMB in cloud and AI hardware infrastructure over the next three years, surpassing the total investment of the past decade [4] Industry Context - Following the release of Deepseek's low-cost high-performance R1 model, domestic tech companies in China, including Baidu and iFlytek, are rapidly launching a series of cost-effective AI model services [3] - Alibaba's Qwen series has surpassed the US Llama in terms of open-source model downloads, with over 300 million downloads and more than 100,000 derivative models [4] - On the same day Alibaba announced Qwen3, OpenAI released several updates to ChatGPT, enhancing its shopping features and optimizing for various consumer categories, indicating a competitive landscape in AI model development [4]
全网都在等梁文锋
投中网· 2025-04-29 06:21
凤凰科技频道官方账号,带你直击真相。 将投中网设为"星标⭐",第一时间收获最新推送 以下文章来源于凤凰网科技 ,作者凤凰网科技 凤凰网科技 . DeepSeek R2模型要来了? 作者丨 姜凡 编辑丨 董雨晴 来源丨 凤凰网科技 五月将至,中美科技巨头或将迎来新一轮巅峰对决。 先是在4月中旬,OpenAI一口气发布了GPT-4.1 o3、o4 mini系列模型;谷歌则拿出了Gemini 2.5 Flash Preview,一个混合推理模型;与谷歌同一天,豆包在杭州巡展中正式发布了1.5·深度思 考模型,在多模态上展现出了更强的实力。凤凰网科技从行业人士处了解到,阿里的下一代大模型 Qwen3也将于本月内发布。 混战之下,那股"神秘的东方力量"似乎也在悄悄准备着新的发布。 敏感的神经之下,一点蛛丝马迹都会被放大。 昨日,全球最大AI开源社区Hugging Face首席执行 官Clément Delangue在社交平台发布了一条耐人寻味的动态。这条动态仅由三个眼睛的表情符号构 成,并附上了DeepSeek团队在Hugging Face平台的官方资源库入口。 这组充满悬念的组合引发科技圈热议,业内普遍推测DeepS ...
聚焦科技金融 打造第四张“名片”
Mei Ri Shang Bao· 2025-04-29 03:05
4月24日,大会创业创新年度颁奖盛典上,兴业银行杭州管理部再次获得"创业服务机构"的称号。数据 显示,截至2025年3月末,杭州管理部辖内科技金融贷款余额达到222亿元,近三年年均复合增长率 20%,是贷款增速的一倍。 4月25日,中国未来独角兽大会上,兴业银行杭州分行副行长赵颎和杭州八大城区母基金共同发布《杭 州AI卧龙图》;当日晚2025中国独角兽之夜上,赵颎上台致辞,并共同启动了"寻龙记"种子独角兽寻访 计划,正式成立"扶摇·独角兽智库"。 商报讯 (记者 苗露 通讯员 许诺) 4月23—25日,由民建浙江省委会、浙江省工商联、中国投资发展促 进会主办的第九届万物生长大会在杭州国际博览中心举行。会上首次推出了《2025浙江独角兽企业榜 单》《浙江未来独角兽企业TOP100榜单》《浙江种子独角兽企业TOP100榜单》三份省级榜单,覆盖了 全省范围内的众多创新型企业。《2025杭州独角兽(准独角兽)企业榜单》也如约而至,其中新晋独角 兽5家,杭州"六小龙"里的DeepSeek、宇树科技和游戏科学赫然在列。 本次大会还首次发布了《2025中国未来独角兽榜单》和《杭州AI卧龙图》。前者聚焦人工智能、具身 智能、 ...
AI浪潮录丨对话刘知远:通往AGI不易,长跑要顶住资本寒冬
Bei Ke Cai Jing· 2025-04-29 01:18
Group 1 - Beijing is becoming a strategic high ground in the AI large model field, with significant advancements in technology and a thriving ecosystem for innovation [1][4] - The emergence of AI unicorns like DeepSeek and the development of the "Wudao" model signify China's growing capabilities in AI, aiming to compete with the US by 2025 [4][5] - The AI landscape in China is rapidly evolving, with numerous "little dragons" and "little tigers" emerging, indicating a flourishing environment for AI startups [5][6] Group 2 - The development of AI models has shifted from "large model refining" to "refining large models," with DeepSeek's success serving as a strong signal of China's position in the global AI arena [5][20] - The establishment of the Zhiyuan Research Institute has played a crucial role in fostering AI talent and innovation, acting as a "angel investor" for top scholars in the field [11][22] - The AI industry is witnessing a trend towards more efficient and capable models, with a focus on achieving higher model density and performance [20][21] Group 3 - The journey towards Artificial General Intelligence (AGI) is seen as a long-term goal for AI entrepreneurs, requiring strategic planning and patience [17][19] - The local processing capabilities of edge models provide advantages in data protection and user privacy, making them appealing in various applications [19][20] - The success of DeepSeek highlights the importance of combining financial resources with visionary leadership in the AI startup ecosystem [21][22]