Workflow
Seek .(SKLTY)
icon
Search documents
DeepSeek新版R1模型实际性能如何?第三方评测来了
Nan Fang Du Shi Bao· 2025-06-05 12:26
Core Insights - DeepSeek has released an upgraded version of its R1 model, which shows improved performance compared to its predecessor and surpasses OpenAI's o3 model, although it still lags behind o4-mini(high) and Google's Gemini 2.5 Pro Preview 05-06 [1][2] Model Performance - The new R1 model achieved a total score of 63.55, an increase of 1.61 points from the previous version, placing it fourth in the rankings [2] - The highest score was obtained by o4-mini(high) at 70.51, followed by Gemini 2.5 Pro preview 05-06 at 66.48 [2] Reasoning and Instruction Following - The instruction-following capability of the new R1 model improved significantly, scoring 48.46, which is 17.09 points higher than the old version, but still falls short of international top models like o3 (66.95) and o4-mini(high) (68.07) [4] - The reasoning task scores showed a decline of 1.7 points compared to the old R1 model, with the main differences observed in mathematical and scientific reasoning tasks, while performing better in coding tasks [4] Reduction in Hallucination Rate - The updated R1 model has optimized its performance regarding "hallucination" issues, with a reduction in hallucination rates by approximately 45%-50% in tasks such as rewriting, summarization, and reading comprehension [4] - The hallucination rate for the new R1 model is now at 13.86%, a decrease of 7.16 percentage points, although it still has a significant gap compared to the best-performing model, doubao-1.5-pro-32k, which has a hallucination rate of only 4.11% [5] - The most notable improvements in hallucination rates were observed in text summarization and reading comprehension tasks, with reductions of 9.27% and 14.49%, respectively [5]
DeepSeek发源地再推人工智能创新高地方案!科创板人工智能ETF(588930)现涨超2%,实时成交额突破6000万元
Mei Ri Jing Ji Xin Wen· 2025-06-05 06:55
Group 1 - The core viewpoint of the news is the significant development and investment in artificial intelligence (AI) in Hangzhou, with specific targets set for 2025, including a market-scale computing power exceeding 50 EFLOPS and a revenue target for the AI core industry exceeding 390 billion yuan [1] - The implementation plan for AI innovation in Hangzhou aims to cultivate two internationally leading foundational models and over 25 industry-specific influential models, alongside establishing more than 700 large-scale enterprises in the AI sector [1] - The A-share market showed a slight fluctuation, but AI-related stocks surged, with notable increases in companies like Yuke Technology and Chipone Technology, indicating a high market interest in AI themes [1] Group 2 - Shanxi Securities highlighted the growing global demand for AI computing power, particularly driven by large model training and inference, presenting significant opportunities for domestic AI and server manufacturers [2] - The domestic demand for AI computing power remains strong, especially from major internet companies and intelligent computing centers, with IDC predicting the accelerated server market in China to reach $25.3 billion by 2028, growing at a compound annual growth rate of over 20% from 2024 to 2028 [2] - The introduction of DeepSeek R1 is expected to lower the barriers for AI application development and deployment, making inference demand a primary growth driver for AI computing power, thus expanding market space for domestic manufacturers [2]
美的空调怎么样?DeepSeek看起来是真的香!
Cai Fu Zai Xian· 2025-06-04 06:39
Core Viewpoint - The Midea Fresh Air Machine T6 is positioned as a multifunctional air management solution that prioritizes health and comfort, particularly for families with children, by addressing various air quality concerns and providing a holistic approach to indoor air management [1][10]. Group 1: Product Features - The Midea Fresh Air Machine T6 integrates six functions: air conditioning, fresh air, air purification, disinfection, dehumidification, and humidification, making it a versatile "air steward" [3]. - The air conditioning feature utilizes a unique design with irregular micro-holes to soften strong winds, creating a comfortable airflow experience without the sensation of direct wind [3]. - The device includes a 3-liter water tank for independent humidification at a rate of 450ml/h, ensuring continuous moisture for up to 6 hours, and a powerful dehumidification capability of 5.03kg/h to combat humidity [5]. Group 2: Health and Safety - The air purification and fresh air functions can quickly restore a clean atmosphere in the home, effectively removing odors and airborne particles [7]. - The device is capable of eliminating common bacteria such as E. coli and H1N1, ensuring a healthy air environment [8]. - It features DeepSeek technology that automatically senses and adjusts air humidity, temperature, and airflow, enhancing user convenience and health [8]. Group 3: User Experience - The Midea Fresh Air Machine T6 is designed for ease of use, with strong voice interaction capabilities that allow users, including children, to operate it effortlessly [8]. - The product is perceived not just as a machine but as a comprehensive approach to air quality management, reflecting a growing awareness of the importance of air quality in daily life [10].
DeepSeek与ChatGPT:免费与付费背后的选择逻辑
Sou Hu Cai Jing· 2025-06-04 06:29
Core Insights - The emergence of DeepSeek, a domestic open-source AI model, has sparked discussions due to its free advantages, yet many still prefer to pay for ChatGPT, raising questions about user preferences and the quality of AI outputs [1][60]. - The output quality of AI tools is significantly influenced by user interaction, with 70% of the output quality depending on how users design their prompts [4][25]. Technical Differences - DeepSeek utilizes a mixed expert model with a training cost of $5.5 million, making it a cost-effective alternative compared to ChatGPT, which has training costs in the hundreds of millions [2]. - In the Chatbot Arena test, DeepSeek ranked third, demonstrating competitive performance, particularly excelling in mathematical reasoning with a 97.3% accuracy rate in the MATH-500 test [2]. Performance in Specific Scenarios - DeepSeek has shown superior performance in detailed analyses and creative writing tasks, providing comprehensive insights and deep thinking capabilities [3][17]. - The model's reasoning process is more transparent but requires structured prompts for optimal use, indicating that user guidance is crucial for maximizing its potential [7][12]. Cost and Efficiency - DeepSeek's pricing is 30% lower than ChatGPT, with a processing efficiency that is 20% higher and energy consumption reduced by 25% [8][9]. - For enterprises, private deployment of DeepSeek can be cost-effective in the long run, with a one-time server investment of around $200,000, avoiding ongoing API fees [9][10]. Deployment Flexibility - DeepSeek offers flexibility in deployment, allowing individual developers to run the 7B model on standard hardware, while enterprise setups can support high concurrency [11][10]. - The model's ability to run on lightweight devices significantly lowers the barrier for AI application [11]. Advanced Prompting Techniques - Mastery of advanced prompting techniques, such as "prompt chaining" and "reverse thinking," can significantly enhance the effectiveness of DeepSeek [13][14]. - The model's performance can be optimized by using multi-role prompts, allowing it to balance professionalism and readability [15][16]. Language Processing Capabilities - DeepSeek demonstrates a 92.7% accuracy rate in Chinese semantic understanding, surpassing ChatGPT's 89.3%, and supports classical literature analysis and dialect recognition [17]. Industry Applications - In finance, DeepSeek has improved investment decision efficiency by 40% for a securities company [18]. - In the medical field, it has achieved an 85% accuracy rate in disease diagnosis, nearing the level of professional doctors [19]. - For programming assistance, DeepSeek's error rate is 23% lower than GPT-4.5, with a 40% faster response time [20]. Complementary Nature of AI Tools - DeepSeek and ChatGPT are not mutually exclusive but serve as complementary tools, each suited for different tasks based on user needs [21][22]. - DeepSeek is preferable for deep reasoning, specialized knowledge, and data privacy, while ChatGPT excels in multi-modal interaction and creative content generation [24][22]. Importance of Prompting Skills - The ability to design effective prompts is becoming a core competency in the AI era, influencing the quality of AI outputs [54][55]. - The book "DeepSeek Application Advanced Tutorial" aims to enhance users' prompting skills and unlock the model's full potential [61].
DeepSeek-R1 再进化,这次的更新好强啊...
3 6 Ke· 2025-06-04 03:32
Core Viewpoint - DeepSeek has released an upgraded version of its R1 model, named DeepSeek-R1-0528, which shows significant improvements in reasoning, programming, and reducing hallucinations compared to its predecessor [1][3][22]. Model Improvements - The new version retains the base model from December 2024 but has enhanced computational power, allowing for deeper reasoning and more detailed problem-solving [4][6]. - The average token usage for the AIME 2025 test increased from 12K to 23K tokens, resulting in an accuracy improvement from 70% to 87.5% [4][5]. Benchmark Performance - In various benchmarks, DeepSeek-R1-0528 achieved notable scores, such as 87.5% in the AIME 2025 math competition, outperforming its predecessor and showing competitive results against models like OpenAI's and Gemini 2.5 [5][15]. - The model's performance in coding tasks has reached levels comparable to OpenAI's models, with successful outputs in complex coding challenges [10][14]. Reduction of Hallucinations - The hallucination rate in the new model has decreased by 45% to 50%, leading to more reliable outputs in tasks such as summarization and reading comprehension [18]. Creative Writing Capabilities - DeepSeek-R1-0528 has shown improvements in creative writing, producing coherent and logical narratives without the previous issues of "getting stuck" [19][21]. User Reception - While some users express skepticism about the update's impact, many remain optimistic about DeepSeek's potential as a representative of domestic AI technology [22][23].
中国创新药的“DeepSeek时刻”!可T+0交易的港股创新药ETF(159567)现涨3.7%,实时换手率突破32%排名同指数第一
Mei Ri Jing Ji Xin Wen· 2025-06-04 02:30
Group 1 - The Hong Kong stock market showed positive trends on June 4, with the innovative drug sector experiencing significant gains, including over 15% increase for Innovent Biologics and over 7% for Zai Lab and Tigermed [1] - The Hong Kong innovative drug ETF (159567) recorded a trading volume exceeding 1.1 billion yuan for three consecutive trading days, indicating high market enthusiasm [1] - The cost and speed advantages of Chinese teams in drug development are highlighted, with a 2-3 times cost advantage and approximately 2 times speed advantage compared to traditional methods, achieving comparable performance to U.S. teams using AI [1] Group 2 - The innovative drug ETF (159567) tracks the National Index of Hong Kong Innovative Drugs, with 90% of its weight in innovative drug companies, positioning it to benefit from trends such as AI-enabled drug development and the expansion of domestic innovative drugs [2] - The innovative drug ETF (159992) encompasses leading companies in the innovative drug industry chain, benefiting from both AI advancements and the introduction of new healthcare policies [2] - Head innovative drug companies have entered a profitability cycle, with multiple products commercializing and driving rapid revenue growth, supported by the upcoming introduction of a new healthcare payment policy in 2025 [2]
中国创新药,正让美国担心会是下一个DeepSeek、无人机、电动车
Hu Xiu· 2025-06-04 01:57
Core Insights - The article highlights the increasing recognition among U.S. pharmaceutical insiders of China's innovative drug sector reaching a "DeepSeek moment," where U.S. companies are investing heavily in Chinese new drugs as a strategic bet on their potential [1][5] - The competitive advantage of Chinese innovation still hinges on breakthroughs in core innovation, despite the current advantages in cost and efficiency [1][4] Group 1: Investment Trends - Pfizer acquired overseas rights for a PD-1/VEGF dual antibody from 3SBio for over $6 billion, including an upfront payment of $1.25 billion [1] - Bristol-Myers Squibb secured co-development rights for a similar drug from BioNTech for $11.1 billion, with an upfront payment of $1.5 billion [1] - Chinese companies have sold overseas rights for innovative drugs to U.S. firms, with total commitments nearing $30 billion [3][12] Group 2: Competitive Landscape - There are currently 35 PD-1/VEGF dual antibodies in development globally, with 20 originating from China, indicating a strong clinical advancement from Chinese firms [2] - Chinese companies have evolved from being five years behind in PD-1 monoclonal antibodies to being approximately three years ahead in the dual antibody space [3] Group 3: Research and Development Efficiency - Chinese firms are leveraging engineering optimization as a competitive advantage, combining different monoclonal antibodies and small molecules to create innovative therapies [4] - The cost and speed advantages of Chinese teams in drug development are significant, with estimates suggesting 2-3 times lower costs and about double the speed compared to U.S. teams [6] Group 4: Market Dynamics - In Q1 of this year, 37% of innovative drug transactions with upfront payments over $50 million originated from Chinese companies, nearly doubling from two years ago [7] - Chinese companies now account for 67% of global transactions in oncology, 60% in cardiovascular diseases, and 50% in endocrine and autoimmune diseases [7] Group 5: Future Outlook - The trend of Chinese innovation in pharmaceuticals is expected to continue expanding into other therapeutic areas, with increasing competition anticipated between Chinese and U.S. firms [10] - The "DeepSeek moment" in Chinese biopharmaceuticals is raising awareness among U.S. markets, prompting significant acquisitions and strategic partnerships [10][12]
为什么DeepSeek还未能撼动OpenAI
Hu Xiu· 2025-06-04 00:27
Core Insights - Mary Meeker's AI trends report spans 339 pages, covering various aspects of AI technology, applications, and innovations [1] - The report identifies the launch of DeepSeek's reasoning model R1 in January 2025 as a significant event marking the global competition in AI [2] Company Developments - DeepSeek's R1 model achieved performance comparable to OpenAI's models at a lower cost, disrupting the traditional paradigm of expensive closed-source models [3] - Despite initial competition from DeepSeek, OpenAI has seen significant growth, with its valuation reaching $300 billion and active users increasing from 400 million to 800 million [10] - OpenAI's annual revenue surged from $3.7 billion to $12.7 billion, indicating strong market demand for its offerings [10] Market Dynamics - DeepSeek's R1 initially outperformed OpenAI in website traffic and app downloads, but its metrics have since declined [11] - The AI market is experiencing a phase of homogenization and commoditization, with performance differences among leading models becoming minimal [12] - Users prioritize performance and differentiation, with OpenAI maintaining a strong brand presence and Anthropic emerging as a preferred choice for programming models [12] Future Trends - The next phase of AI commercialization may not be a "winner-takes-all" scenario but rather a fusion and reconstruction of platforms and applications [14] - DeepSeek aims to achieve AGI by integrating feedback from its infrastructure, products, and user interactions, positioning itself as a potential horizontal platform for developers [15]
BIM应用领军者分享:前瞻探索基于 DeepSeek 的BIM与人工智能融合新机遇
Cai Fu Zai Xian· 2025-06-03 03:40
Core Insights - The event focused on the integration of digital technologies such as BIM, big data, IoT, and AI within the construction industry to foster innovation and high-quality development [1][2] - The conference gathered over 160 representatives from various sectors, emphasizing the importance of intelligent construction and the transformation towards industrialization, digitalization, and sustainability [1] Group 1: BIM and AI Integration - Expert Xue Xiang shared insights on the deep integration of BIM and AI, highlighting the role of DeepSeek in enhancing project management efficiency and addressing industry challenges [2] - The combination of BIM and AI is seen as a significant advantage throughout the entire lifecycle of construction projects, providing innovative solutions and insights for future development [2][7] Group 2: Case Study - Huainan Financial Plaza - The Huainan Financial Plaza project, with a total construction area of approximately 280,000 m² and a total cost of about 713 million yuan, faced significant challenges due to its complexity and tight schedule of 1200 days [3] - The project utilized BIM technology to enhance project management efficiency, optimize design solutions, and ensure timely completion despite the challenging conditions [3][4] Group 3: Implementation and Standards - A specialized team was formed to implement BIM technology, establishing modeling standards and guidelines to ensure effective collaboration among various stakeholders [4] - The project adopted a layered modeling approach and conducted simulations to improve efficiency and address potential issues before construction began [5] Group 4: Achievements and Recognition - The project received multiple prestigious awards, including the "Special Award" at the 14th BIM Alliance "Svir Cup" National Excellent Engineering Application Competition, showcasing its exemplary application of BIM technology [6] - A talent cultivation system was established to train professionals in BIM technology, contributing to the sustainable development of the industry [7] Group 5: Future Directions - The integration of BIM with AI is expected to enhance work efficiency, improve project quality, and reduce human errors, paving the way for more diverse applications of BIM technology [7] - The industry anticipates further exploration of the deep integration of BIM and AI to drive the construction sector towards greater digitalization, intelligence, efficiency, and sustainability [7]
百度AI搜索全面接入DeepSeek R1-0528,推理能力升级
Sou Hu Cai Jing· 2025-06-01 18:15
Core Viewpoint - Baidu AI Search has fully integrated the DeepSeekR1-0528 model, enhancing its search service's intelligence and user experience [1][2][3] Group 1: Model Integration and Features - The DeepSeekR1-0528 model is now available for free on both PC and App platforms, following its launch on Baidu Smart Cloud on May 30 [1] - The model has shown significant improvements in reasoning capabilities, allowing for a more precise understanding of user intent, leading to personalized and accurate search results [1][2] - The integration of DeepSeekR1-0528 is expected to set a new benchmark in the intelligent search field, promoting further industry development [3] Group 2: User Experience Enhancements - The model generates content in a more humanized style, providing rich information and clearer formatting for better readability [2] - DeepSeekR1-0528 demonstrates strong logical reasoning abilities, efficiently completing complex tasks with clear logical steps [2] - The model can quickly outline research trends and pinpoint key literature in academic searches, as well as generate personalized plans for lifestyle inquiries like travel and food recommendations [2]