Workflow
Mistral Small
icon
Search documents
被曝蒸馏DeepSeek还造假!欧版OpenAI塌房了
猿大侠· 2025-08-15 04:11
Core Viewpoint - Mistral, a prominent player in the open-source AI sector, is accused of distilling its latest model from DeepSeek, misleading the public about its model's performance and testing results [3][22][24]. Group 1: Allegations and Evidence - A former employee of Mistral revealed through a mass email that the company's latest model may have directly distilled from DeepSeek, misrepresenting it as a successful reinforcement learning case [2][3]. - Analysis by Twitter user Sam Peach indicated a surprising similarity between Mistral-small-3.2 and DeepSeek-v3, suggesting that the resemblance is likely a result of distillation rather than coincidence [7][14]. - The analysis involved identifying overused words and n-grams in the models' outputs, leading to a similarity map that showed Mistral-small-3.2 and DeepSeek-v3 were closely positioned, indicating high output similarity [16][18]. Group 2: Company Background and Market Position - Mistral, founded in 2023 and based in Paris, is often referred to as the European version of OpenAI, co-founded by former Google DeepMind and Meta employees [24]. - The company has gained significant attention, with a valuation reaching $10 billion and plans for a new funding round of $1 billion, following a previous round that raised €600 million (approximately $645 million) [25]. - Mistral has maintained an open-source approach, releasing models like Mistral Small and Mistral Code, and has developed a chatbot named LeChat to compete with ChatGPT [27][28].
被曝蒸馏DeepSeek还造假!欧版OpenAI塌房了
量子位· 2025-08-14 07:34
Core Viewpoint - Mistral, a prominent AI company, is accused of distilling its latest model from DeepSeek, misrepresenting it as a successful reinforcement learning case while distorting benchmark test results [3][21]. Group 1: Company Background - Mistral is recognized as the European version of OpenAI and has gained a strong reputation in the open-source AI community [4][5]. - Founded in 2023 in Paris, Mistral was established by former Google DeepMind and Meta employees [24]. - The company has maintained an open-source approach, releasing models like Mistral Small and Mistral Code, which are competitive in multilingual processing and reasoning capabilities [27]. Group 2: Recent Developments - A recent Twitter leak from a former employee revealed that Mistral's model, Mistral-small-3.2, shows a high similarity to DeepSeek-v3, suggesting possible distillation [12][19]. - The leak indicates that Mistral may have concealed this fact, misleading the public about the effectiveness of its model [21]. - Mistral's valuation reached $10 billion in August 2023, with ongoing fundraising efforts [25]. Group 3: Controversy and Community Reaction - The allegations have sparked controversy, particularly due to Mistral's significant standing in the open-source AI sector [24]. - Many in the community argue that distilled models should be transparently labeled to maintain integrity [22]. - As of now, Mistral has not publicly responded to these allegations, despite recently launching a new model, Mistral Medium V3.1 [29].
深度|英伟达黄仁勋对话欧洲最大AI独角兽Mistral CEO: 开源是技术民主化的基石;AI将对每个国家的GDP产生双位数影响
Z Potentials· 2025-04-11 04:20
Core Viewpoints - The discussion emphasizes the strategic value of sovereign AI and the necessity for countries to actively invest in AI development as a common human endeavor rather than a privilege of a few tech companies [3][5][4] - AI is recognized as a general-purpose technology that can transform various sectors, including public services, agriculture, and defense, necessitating tailored national AI strategies [4][6] - Open-source collaboration is highlighted as a cornerstone for technological democratization, accelerating progress and enabling countries to deploy models on their own infrastructure [3][27] Group 1: Sovereign AI and National Strategy - Sovereign AI is essential for countries to maintain cultural identity and digital sovereignty, as no one understands a nation's needs better than its own citizens [3][5] - The limitations of centralized AI models are discussed, emphasizing the need for customization based on local preferences and requirements [3][6] - Countries must view AI as a new layer of infrastructure, akin to electricity, and invest in building their own capabilities to avoid dependency on foreign technologies [7][8] Group 2: Open Source and Collaboration - Open-source models are crucial for accelerating technological advancements and fostering a collaborative ecosystem among research labs [27][28] - The partnership between NVIDIA and Mistral in developing models like Mistral NeMo illustrates the benefits of combining resources and expertise to create superior AI solutions [28][29] - Open-source technology enhances security through transparency and community involvement, reducing risks associated with centralized control [31][32] Group 3: Economic Impact and Digital Workforce - AI is projected to have a significant impact on GDP, similar to the historical influence of electricity, making it imperative for nations to develop their own AI capabilities [6][7] - The concept of digital labor is introduced, where AI systems are seen as integral to the workforce, requiring nations to actively shape and optimize these technologies [8][9] - The importance of local talent development and infrastructure is emphasized to ensure that AI systems align with national values and regulations [7][10] Group 4: Company Strategies and Ecosystem - Companies like NVIDIA prioritize developer-centric strategies, focusing on creating an ecosystem that supports innovation and collaboration [42][43] - The flexible organizational structure of NVIDIA allows it to adapt to rapid technological changes while minimizing bureaucracy [33][34] - The unique positioning of companies in the AI landscape is crucial for establishing partnerships with cloud service providers and driving mutual success [40][41]