DeepSeek
Search documents
宝马中国宣布接入DeepSeek,宝马妥协了?
3 6 Ke· 2025-05-02 02:21
Core Viewpoint - BMW China is embracing local AI technology by integrating DeepSeek, marking a significant step in its digital transformation strategy and enhancing its AI capabilities in the Chinese market [1][3][6] Group 1: BMW's AI Integration - BMW has announced the integration of DeepSeek into its operations, which will enhance the BMW Intelligent Personal Assistant and improve human-machine interaction in new models starting from Q3 2025 [1][2] - The collaboration with DeepSeek follows BMW's earlier partnership with Alibaba to develop AI language models, showcasing BMW's commitment to local AI ecosystem development [1][3] Group 2: Strategic Importance of Local AI - This move signifies BMW's recognition of the importance of local AI technologies and its willingness to adapt to the rapidly evolving Chinese automotive market [3][4] - BMW's previous initiatives, such as the launch of a 360-degree AI strategy and the development of intelligent systems like "Car Expert" and "Travel Companion," reflect its ongoing efforts to enhance its smart vehicle offerings [3][4] Group 3: Challenges and Opportunities - Despite its historical strengths in manufacturing and brand image, BMW faces challenges in keeping pace with the increasing demand for smart and connected vehicles [4][5] - The partnership with DeepSeek is seen as a strategic decision to accelerate BMW's digital transformation and leverage the advanced technologies and innovative models from Chinese tech companies [4][6]
互联网大厂五一前密集开源新模型,布局各异谁将留在牌桌?
Nan Fang Du Shi Bao· 2025-05-01 14:12
Core Insights - Major domestic AI model companies are rapidly open-sourcing their models ahead of the May Day holiday, with Alibaba releasing Qwen3, Xiaomi launching Xiaomi MiMo, and DeepSeek introducing DeepSeek-Prover-V2 [1][2][5] Alibaba - Alibaba's Qwen3 features two MoE models with 30B and 235B parameters, and six dense models ranging from 0.6B to 32B, achieving state-of-the-art performance in its category [2] - Qwen3 is the first "hybrid reasoning model" in China, integrating fast and deep thinking capabilities, significantly reducing computational power consumption [5] - Alibaba has consistently open-sourced various models this year, including the 14B video generation model and the 7B multimodal model, aiming to leverage open-source models for AI applications while monetizing its cloud services [6] Xiaomi - Xiaomi's MiMo model, with only 7B parameters, outperformed OpenAI's closed-source model o1-mini in public benchmarks for mathematical reasoning and coding competitions [6] - This marks Xiaomi's first foray into open-sourcing its models, developed by its newly established Core team [6] DeepSeek - DeepSeek has released two versions of DeepSeek-Prover-V2, focusing on mathematical theorem proving and achieving significant performance improvements in benchmark tests [8] - The new models support extensive context inputs and are based on previous versions, showcasing a commitment to enhancing reasoning capabilities [8] Industry Trends - The open-sourcing of models by these companies is seen as a strategic move to enhance competitiveness against closed-source models from companies like OpenAI and Anthropic, which still hold a slight performance edge [9][10] - Industry experts predict a consolidation in the AI model sector, with DeepSeek, Alibaba, and ByteDance emerging as the leading players in China, while the U.S. market remains competitive with companies like xAI and OpenAI [10][11] - The open-source models are expected to democratize AI technology, making it more accessible and promoting innovation across various industries [9][10]
AI圈顶级榜单曝黑幕,Meta作弊刷分实锤?
虎嗅APP· 2025-05-01 13:51
Core Viewpoint - The article discusses allegations of manipulation in the LMArena ranking system for AI models, suggesting that major companies are gaming the system to inflate their scores and undermine competition [2][11][19]. Group 1: Allegations of Cheating - Researchers from various institutions have published a paper accusing AI companies of exploiting LMArena to boost their rankings by selectively testing models and withdrawing low-scoring ones [11][12][15]. - The paper analyzed 2.8 million battles across 238 models from 43 providers, revealing that a few companies implemented policies that led to overfitting specific metrics rather than genuine AI advancements [12][19]. - Meta reportedly tested 27 variants of its Llama 4 model privately before its public release, raising concerns about unfair advantages [19][20]. Group 2: Data Access Inequality - The study found that closed-source commercial models (like those from Google and OpenAI) participated more frequently in LMArena compared to open-source models, leading to a long-term data access inequality [23][30]. - Approximately 61.3% of all data in LMArena is directed towards specific model providers, with Google and OpenAI models accounting for about 19.2% and 20.4% of all user battle data, respectively [26][30]. - The limited access to data for open-source models could potentially lead to a relative performance improvement of up to 112% if they had access to more data [31][32]. Group 3: Official Response - LMArena quickly responded to the allegations, claiming that the research contained numerous factual inaccuracies and misleading statements [36][40]. - They emphasized that they have always aimed to treat all model providers fairly and that the number of tests submitted is at the discretion of the providers [40][41]. - LMArena's policies regarding model testing and ranking have been publicly available for over a year, countering claims of secrecy [40][41]. Group 4: Future of Rankings - Andrej Karpathy, a prominent figure in AI, expressed concerns that the focus on LMArena scores has led to models that excel in ranking rather than overall quality [42][43]. - He suggested OpenRouterAI as a potential new ranking platform that could be less susceptible to manipulation [44][49]. - The original intent of LMArena, created by students from various universities, has been overshadowed by corporate interests and the influx of major tech companies [51][56].
科技晚报AI速递:今日科技热点一览 丨2025年5月1日
Xin Lang Cai Jing· 2025-05-01 13:24
Group 1: AI and Technology Developments - Nvidia CEO Jensen Huang urged the Trump administration to revise AI chip export regulations, highlighting that China's AI technology is rapidly catching up and that current restrictions harm U.S. competitiveness [1] - OpenAI's GPT-4o faced criticism for being overly agreeable, prompting a rollback to address concerns about AI's emotional responses and the risk of misinformation [2] - Microsoft launched the Phi-4 reasoning model series, which includes three versions designed for complex reasoning tasks, outperforming some larger models in various tests [3] Group 2: Legal and Regulatory Challenges - A U.S. federal judge ruled that Apple violated a 2021 court order by not allowing external payment options in its App Store, indicating potential adjustments in Apple's payment policies to mitigate legal risks [1] - Google CEO Sundar Pichai warned that a proposed antitrust measure requiring the sharing of search data could have devastating effects on Google's search business, potentially stifling innovation and compromising user privacy [4] Group 3: Market Dynamics and Employment Trends - Shopify's CEO announced a mandate for all employees to utilize AI, marking a significant shift towards AI-driven operations and potentially leading to job cuts, as the U.S. white-collar job market faces its lowest recruitment levels in 12 years [4] - Ele.me entered the competitive landscape of food delivery with a substantial subsidy plan, aiming to regain market share amidst aggressive competition from JD and Meituan [5] Group 4: Advancements in AI Models - DeepSeek released the DeepSeek-Prover-V2 mathematical reasoning model, showcasing significant improvements in reasoning capabilities and marking a shift towards structured logical reasoning in AI [6]
特斯联2024年营收超18亿元,三大业务板块升级释放增长新动能
2 1 Shi Ji Jing Ji Bao Dao· 2025-05-01 05:08
Core Viewpoint - Teslin, established in 2015, is a key player in China's AIoT industry, focusing on technology-driven industrial upgrades and spatial intelligence for sustainable development [1] Financial Performance - Teslin's revenue for 2024 is projected to be 1.843 billion yuan, representing an 83.2% increase compared to 2023 [1][2] - Revenue figures for 2022 and 2023 were 738 million yuan and 1.006 billion yuan, respectively, resulting in a compound annual growth rate (CAGR) of 58.0% from 2022 to 2024 [1][2] - The company's expense ratio (sales, management, and R&D) decreased from 76.9% in 2023 to 45.0% in 2024, indicating effective cost control [3] Market Position - Teslin has become one of the fastest-growing companies in the AI industry, outperforming peers such as SenseTime and Horizon Robotics, which reported revenue growth rates of 10.8% to 53.6% in 2024 [3] - The company has established a comprehensive AIoT technology product system over nine years, positioning itself as a leading enterprise in the rapidly growing AIoT market [2] Market Expansion - As of December 31, 2024, Teslin's products have been deployed by over 800 clients across 160 cities globally, with a total order amount of 2.3 billion yuan [4] - The number of clients increased from 224 in 2022 to 342 in 2024, reflecting an optimized customer structure [4] Strategic Focus - Teslin is focusing on three strategic directions: AIoT models, AIoT infrastructure, and AIoT agents, which are expected to drive future business growth [6] - The company is responding to the increasing demand in the market by restructuring its internal teams to enhance efficiency and innovation [3] Industry Context - The global AIoT market is experiencing rapid growth, with a projected CAGR of over 31.7% over the next five years [2] - China's AI market is also expanding, with spending reaching 14.8 billion USD in 2023, making it the second-largest AI market globally [7] - Teslin's technology strategy aligns with China's push for self-sufficiency in AI, reducing reliance on external technologies and enhancing the resilience of the industrial chain [7]
DeepSeek新数学模型刷爆记录!7B小模型自主发现671B模型不会的新技能
量子位· 2025-05-01 03:53
DeepSeek放大招!新模型专注数学定理证明,大幅刷新多项高难基准测试。 在普特南测试上, 新模型 DeepSeek-Prover-V2 直接把记录刷新到 49道 。 目前的 第一名 在657道题中只做出 10道 题,为Kimi与 AIME2024冠军团队Numina 合作成果 Kimina-Prover 。 而未针对定理证明优化的 DeepSeek-R1只做出 1道 。 让还没发布的R2更令人期待了。 | 657) | | --- | | (out of | | Lean | | मै | Model | num- | | | --- | --- | --- | --- | | | | solved | compute | | 1 | Kimina-Prover-7B-Distill♥ | 10 | pass@192 | | 2 | Self-play Theorem Prover♥ | 8 | pass@3200 | | 3 | Goedel-Prover-SFT♥ | 7 | pass@512 | | 4 | ABEL | 7 | pass@596 | | 5 | InternLM2.5-StepPr ...
DeepSeek开源Prover-V2强推理模型,网友:奥数从没这么简单过
机器之心· 2025-05-01 02:11
Core Insights - DeepSeek has released DeepSeek-Prover-V2, an open-source large language model specifically designed for formal theorem proving, achieving industry-leading performance in theorem proving tasks [1][3][4]. Model Overview - Two versions of DeepSeek-Prover-V2 have been released, with parameter sizes of 7 billion and 671 billion. The larger model is based on DeepSeek-V3-Base, while the smaller one is built on DeepSeek-Prover-V1.5-Base, supporting a maximum context length of 32,000 tokens [3][4]. - DeepSeek-Prover-V2 is tailored for the mathematical AI programming language Lean 4, focusing on formal theorem proving [3][4]. Technical Implementation - The model utilizes a recursive theorem proving process to generate cold-start training data, where DeepSeek-V3 decomposes complex problems into manageable sub-goals and formalizes the reasoning steps [9][11]. - The training process involves two phases: a non-CoT (non-Chain of Thought) mode for rapid formal proof generation and a CoT mode for detailed reasoning steps, enhancing transparency and logical progression [17][19]. Performance Metrics - The DeepSeek-Prover-V2-671B model achieved an 88.9% pass rate on the MiniF2F test and successfully solved 49 out of 658 problems in the PutnamBench dataset [15][23]. - The model's performance was evaluated against various benchmarks, demonstrating unprecedented accuracy and efficiency compared to other advanced models in the industry [20][23]. Dataset Release - DeepSeek has also introduced ProverBench, a benchmark dataset containing 325 problems, including 15 from recent AIME math competitions, aimed at comprehensive evaluation of models in high school and undergraduate mathematics [25][26].
刚刚!DeepSeek-Prover-V2-671B 发布,网友:DS 是假期终结者
程序员的那些事· 2025-05-01 02:04
Core Viewpoint - DeepSeek has launched DeepSeek-Prover-V2-671B, marking a significant advancement in AI mathematical reasoning capabilities, particularly in automated theorem proving [2][4]. Group 1: Model Overview - DeepSeek-Prover-V2-671B is a next-generation automated theorem proving expert model with 671 billion parameters, optimized for proof generation and verification in the Lean 4 framework [4][6]. - The model employs a mixture of experts (MoE) architecture, activating approximately 37 billion parameters per inference, enhancing computational efficiency while maintaining strong reasoning capabilities [4][6]. Group 2: Key Breakthroughs - The release signifies three major milestones, including the potential for innovation across various application domains [6]. - The model's specifications include a context length of approximately 128,000 tokens, allowing it to handle complex reasoning chains and lengthy proofs [6][7]. - The attention mechanism is likely a multi-head latent attention (MLA), which compresses key-value (KV) cache, significantly reducing memory requirements [6][7]. Group 3: Applications and Impact - The model supports formal verification in areas such as cryptographic security proofs and chip design validation, enabling rigorous mathematical checks in automated processes [7]. - It aids mathematicians in formalizing theorems, exploring new conjectures, and proving complex mathematical problems, potentially accelerating mathematical research [7]. - The model can be utilized as an interactive educational tool, guiding students in mastering rigorous mathematical proof methods [7].
1月股市涨了:这是川普的股市!4月股市跌了:这是拜登的股市!特朗普执政100天,被痛批失败!沃尔玛低头了,145%关税全扛!
雪球· 2025-05-01 01:32
Group 1 - The U.S. economy showed unexpected contraction with a GDP decline of 0.3% in Q1, marking the first quarterly negative growth since 2022, significantly below the expected growth of 0.4% [3][11] - The market reacted sharply to the GDP data, with the Dow Jones dropping nearly 800 points and the Nasdaq falling close to 3% during early trading [3][5] - Following news of potential trade negotiations and tariff adjustments, the market began to recover, reducing most of the earlier losses [5] Group 2 - Major companies like AMD saw significant stock declines, with Supermicro Computer dropping over 11% due to disappointing earnings forecasts [6] - Among the tech giants, Microsoft and Meta reported better-than-expected earnings, with Meta raising its capital expenditure guidance for the year, leading to stock price increases [8] - In contrast, Tesla and Amazon experienced stock declines, with Tesla down 3.4% and Amazon down 1.6% after initial larger drops [8] Group 3 - The U.S. Commerce Department indicated that the GDP decline was primarily due to a 36% surge in preemptive imports before the implementation of Trump's tariff policies, which expanded the trade deficit [11] - Consumer spending growth was weak at only 1.8%, the lowest since mid-2023, contributing to the economic slowdown [11] - Some economists warned that if current tariff policies remain unchanged, the U.S. economy could face stagnation, with a 90% probability of recession predicted by Apollo Global Management's chief economist [12] Group 4 - Trump quickly attributed the economic downturn to his predecessor, claiming it was "Biden's stock market" and asserting that the economy is merely in a transitional phase [14][15] - Criticism arose regarding Trump's economic policies, with some analysts noting that the stock market and dollar performance during his term has been the worst since 1980 [16] - Retail giants like Walmart and Target have begun to absorb tariff costs, indicating a shift in strategy due to supply chain disruptions caused by the tariff war [19][20]
创始人“跑路”?极石汽车回应:消息不实;美团免除骑手外卖柜使用费;微软30%代码由AI编写丨邦早报
创业邦· 2025-05-01 01:03
Group 1 - Apple is restructuring its global affairs and music departments, including management adjustments in Europe, India, China, and other Asian regions [3] - OpenAI has rolled back the latest update of GPT-4o due to concerns about its overly flattering personality, with plans for further improvements [4] - Meituan announced that starting May 1, 2025, it will waive delivery cabinet fees for its riders, enhancing their delivery rights [8] Group 2 - Starbucks plans to increase its workforce and reduce investment in automation, aiming to improve customer experience [10] - Volvo is initiating a cost-cutting plan totaling 18 billion Swedish Krona, which includes global layoffs to enhance profitability [11] - Microsoft CEO stated that 30% of the company's code is now generated by AI, indicating a growing reliance on artificial intelligence in software development [11] Group 3 - Extreme Stone Automotive denied rumors about its founder's alleged departure, affirming that operations are normal and the founder is fulfilling his duties [13] - Decathlon is reportedly looking to sell about 30% of its Chinese business, with a potential valuation of around $1 billion [13] - Nvidia's CEO emphasized the need for AI factories in all American companies, which will create technology jobs [19] Group 4 - CATL is planning to initiate its Hong Kong listing next month, potentially becoming the largest stock issuance in the city in four years, aiming to raise at least $5 billion [20] - Great Wall Heavy Industry has completed a 520 million yuan Series A financing round, led by two Fortune Global 500 companies [20] - DeepSeek released a new AI model with 671 billion parameters, enhancing its capabilities for complex mathematical proofs [25]