DeepSeek R1 0528

Search documents
现在“最强”的AI模型,能不能替代医生门诊?一个AI产品经理的实际测试
3 6 Ke· 2025-07-27 00:46
2025年,我带着我的团队在做AI与空间计算产品研发,同时自己也是AI模型的重度使用者。因为博士研究的科研需求,我付费了Gemini、X、GPT这类模 型主流国际模型,将日常的博士研究工作、个人的产品研发工作,以及生活健康诊断都放在AI大模型上。 如下是7月份的模型排行分数,可以看到Grok4领先,随后就是国际模型,国内模型中,开源模型DEEPSEEK得到最高分。 | MODEL TJ | CREATOR 14 | CONTEXT | ARTIFICIAL ANALYSIS | BLENDED | MEDIAN | MEDIAN 11 | | --- | --- | --- | --- | --- | --- | --- | | | | WINDOW | INTELLIGENCE INDEX | USD/1M Tokens | Tokens/s | First Chunk (s) | | Grok 4 | ×1 | 256k | 73 | $6.00 | 74.5 | 12.12 | | o3-pro | OpenAl | 200k | 71 | $35.00 | | | | Gemini 2.5 Pro | ...
马斯克新发布的“全球最强模型”含金量如何?
第一财经· 2025-07-10 15:07
Core Viewpoint - The article discusses the launch of Grok 4, an AI model developed by xAI, which is claimed to be the most powerful AI model globally, surpassing existing top models in various benchmarks [1][2]. Group 1: Grok 4 Performance - Grok 4 achieved a perfect score in the AIME25 mathematics competition and scored 26.9% in the "Human Last Exam" (HLE), which consists of 2,500 expert-level questions across multiple disciplines [1]. - The AI analysis index for Grok 4 reached 73, making it the top-ranked model, ahead of OpenAI's o3 and Google's Gemini 2.5 Pro, both at 70 [2]. - Grok 4 set a historical high score of 24% in the HLE, surpassing the previous record of 21% held by Google's Gemini 2.5 Pro [5]. Group 2: Development and Training - Grok 4's training volume is 100 times that of Grok 2, with over 10 times the computational power invested in the reinforcement learning phase compared to other models [5]. - The subscription fee for Grok 4 is set at $30 per month, while a more advanced version, Grok 4 Heavy, costs $300 per month [5]. Group 3: Financial Aspects and Funding - xAI has raised a total of $10 billion in its latest funding round, which includes $5 billion in debt and $5 billion in equity, bringing its total funding since 2024 to $22 billion [10]. - Despite the substantial funding, xAI faces high operational costs, reportedly spending $1 billion per month, with only $4 billion in cash remaining as of March 2025 [11]. - xAI's projected revenue for 2025 is $5 billion, significantly lower than OpenAI's expected $12.7 billion, indicating a lag in commercial progress [11]. Group 4: Future Outlook - xAI aims to leverage the vast data from X to train its models, potentially avoiding high data costs, with a goal to achieve profitability by 2027 [12]. - Upcoming releases include a programming model in August, a multi-agent model in September, and a video generation model in October, although previous delays raise questions about these timelines [12].
DeepSeek开源新版R1,媲美OpenAI最高o3模型
news flash· 2025-05-28 21:41
金十数据5月29日讯,今天凌晨,全球著名开源大模型平台DeepSeek开源了R1最新0528版本。DeepSeek 目前没有对该版本进行任何说明,又只是"悄悄"地开放了模型。估计很快会放出模型卡介绍更多功能。 但已经有网友迫不及待的对新版R1进行测试,在著名代码测试平台Live CodeBench中显示,其性能可以 媲美OpenAI最新的o3模型高版本。也有网友对新版R1的风格进行了测试,几乎和OpenAI的o3差不多。 (AIGC开放社区) DeepSeek开源新版R1,媲美OpenAI最高o3模型 | Rank | Model | Pass@1 ↓ | Easy-Pass@1 | Medium-P | | --- | --- | --- | --- | --- | | 1 | 04-Mini (High) | 80.2 | 99.1 | 80 | | 2 | 03 (High) | 75.8 | 99.1 | | | 3 | 04-Mini (Medium) | 74.2 | 98.2 | 8 | | 4 | DeepSeek-R1-0528 | 73.1 | 98.7 | 8 | | 5 | 03-Mi ...