Workflow
模型蒸馏
icon
Search documents
核心模型被曝蒸馏DeepSeek?前女友一纸控诉,曝出欧版OpenAI塌房真相
3 6 Ke· 2025-08-18 12:12
曾被誉为「欧洲OpenAI」的Mistral AI,陷入「抄袭」丑闻!在分手小作文中,前员工爆料核心技术是蒸馏DeepSeek,却误导外界称为自主RL成果。 Mistal套壳DeepSeek,被当场抓现行了? 几天前就有人在X上爆料:Mistral的新模型是直接蒸馏自DeepSeek,而且基准测试结果还被歪曲了。 这个被视为欧洲版OpenAI「全村希望」的公司,地位就如同中国的DeepSeek一般,如今居然塌房了? 这实在是太魔幻了。 更为劲爆的是,这个重磅大瓜还是从一篇Mistral女员工的「分手小作文」里曝出来的。 原话是这样的—— 你早知道Mistral做事不讲道德:把DeepSeek蒸馏后当成自己的模型,使用OpenAI的数据,对外却误导称是RL在发挥作用,但它实际上只是DS3的产物, 还歪曲基准测试结果。 你不仅明知这些,还积极参与其中。当我指出这些问题时,你没有承担任何责任,反而选择无视我、对我冷处理。 情感纠纷小作文,曝出套壳大瓜 也就是说,这位Mistral离职的女员工,不仅在小作文中曝光了自己和前男友、Mistral同事的感情纠葛,还爆出Mistral套壳DeepSeek的丑闻。 这个消息一 ...
被曝蒸馏DeepSeek还造假!欧版OpenAI塌房了
猿大侠· 2025-08-15 04:11
Core Viewpoint - Mistral, a prominent player in the open-source AI sector, is accused of distilling its latest model from DeepSeek, misleading the public about its model's performance and testing results [3][22][24]. Group 1: Allegations and Evidence - A former employee of Mistral revealed through a mass email that the company's latest model may have directly distilled from DeepSeek, misrepresenting it as a successful reinforcement learning case [2][3]. - Analysis by Twitter user Sam Peach indicated a surprising similarity between Mistral-small-3.2 and DeepSeek-v3, suggesting that the resemblance is likely a result of distillation rather than coincidence [7][14]. - The analysis involved identifying overused words and n-grams in the models' outputs, leading to a similarity map that showed Mistral-small-3.2 and DeepSeek-v3 were closely positioned, indicating high output similarity [16][18]. Group 2: Company Background and Market Position - Mistral, founded in 2023 and based in Paris, is often referred to as the European version of OpenAI, co-founded by former Google DeepMind and Meta employees [24]. - The company has gained significant attention, with a valuation reaching $10 billion and plans for a new funding round of $1 billion, following a previous round that raised €600 million (approximately $645 million) [25]. - Mistral has maintained an open-source approach, releasing models like Mistral Small and Mistral Code, and has developed a chatbot named LeChat to compete with ChatGPT [27][28].
我在618主场,和3位顶尖技术博士聊了聊
量子位· 2025-06-18 07:49
Core Viewpoint - The article discusses the evolution of technology and its impact on the shopping experience during the 618 shopping festival, highlighting the advancements in logistics, customer service, and product recommendation systems that enhance user experience and operational efficiency [1][2][3]. Group 1: Technology and User Experience - Technology serves to improve quality of life rather than complicate it, as evidenced by advancements in logistics and customer service [3][4]. - The improvements in user experience and error reduction are attributed to the efforts of technical teams working behind the scenes to optimize systems and processes [4][20]. - The implementation of a "same product identification system" allows for better product comparison and competitive pricing, enhancing the shopping experience [8][9]. Group 2: Case Studies of Technical Teams - Chang Lin from the retail division focuses on optimizing the same product identification system using model distillation to improve efficiency and reduce costs [11][13][16]. - Xing Yan from the logistics division emphasizes the importance of understanding specific operational scenarios, leading to the development of intelligent distribution models for delivery personnel [33][38]. - Chu Xue from the technology division works on voice recognition systems, ensuring high accuracy in applications like smart customer service and AI-driven calls [42][51]. Group 3: Talent Development and Company Culture - The company emphasizes a solid technical foundation and long-term investment in talent, as seen in the TGT (Tech Genius Team) program aimed at recruiting top technical talent [57][59]. - The TGT program offers a unique structure with no salary cap based on potential, mentorship from experienced professionals, and access to real-world data for practical applications [59][60]. - The company fosters a collaborative environment where technical personnel are encouraged to engage with real-world problems while developing their skills [61][62].