Seek .(SKLTY)
Search documents
战报:马斯克Grok4笑傲AI象棋大赛,DeepSeek没干过o4-mini,Kimi K2被喊冤
3 6 Ke· 2025-08-06 08:41
Core Insights - The first Kaggle AI Chess Competition, initiated by Google, showcased various AI models, with Grok 4 emerging as the top performer after the first round of matches [6][13][30] - The competition aims to test the "emergent" capabilities of AI, rather than just focusing on winning or losing [5][21] - The event features live commentary from chess grandmaster Hikaru Nakamura, enhancing the viewing experience [7] Group 1: Competition Overview - The competition runs from August 5 to August 7, with daily live broadcasts at 10:30 AM Pacific Time [6] - Participants include OpenAI's o3 and o4-mini, DeepSeek R1, Kimi K2 Instruct, Gemini 2.5 Pro and 2.5 Flash, Claude Opus 4, and Grok 4 [6][10] - After the first day, the models advancing to the semifinals include Gemini 2.5 Pro, Grok 4, o4-mini, and o3 [9][10] Group 2: Performance Analysis - Grok 4 demonstrated superior tactical strategy and speed, leading to high praise from observers [13][30] - In the match between Grok 4 and Gemini 2.5 Flash, Grok 4's performance was likened to that of a true grandmaster [14] - The match between OpenAI's o4-mini and DeepSeek R1 highlighted o4-mini's ability to capitalize on R1's mistakes, showcasing its strategic insight [16] Group 3: AI Capabilities and Chess - Chess was chosen for this competition due to its clear rules and high complexity, making it an ideal scenario for testing AI decision-making abilities [21][24] - The competition serves as a reliable method for evaluating AI capabilities, with Grok 4's performance indicating a significant advancement in AI's emergent abilities [23][24] - Observers noted that traditional AI relies on domain-specific training, while cutting-edge AI models like Grok 4 exhibit consistent generalization across various tasks [24]
DeepSeek终于把OpenAI逼急了
Feng Huang Wang· 2025-08-06 08:21
Core Insights - OpenAI has launched its first open-source language model, GPT-OSS, which is expected to initiate a new wave of open-source development in the tech industry [1][6] - The release of GPT-OSS marks a significant shift from OpenAI's previous closed-source and paid model strategy to an open and collaborative ecosystem [6][9] Model Specifications - The GPT-OSS-120B model features a MoE architecture with 117 billion parameters, requiring only a single 80GB GPU for operation, and its performance is comparable to the closed-source O4-mini [4] - The GPT-OSS-20B model also utilizes MoE architecture, has 21 billion parameters, and can run smoothly on devices with 16GB of memory, performing similarly to O3-mini [4] Market Impact - OpenAI's decision to make GPT-OSS available for free commercial use is seen as a significant advantage for AI startups in China and globally [5] - The rapid development of Chinese open-source models, such as DeepSeek and Tongyi Qwen, has prompted OpenAI to reconsider its strategy, as these models have gained significant traction and market presence [7][8] Competitive Landscape - The emergence of competitive Chinese models has raised concerns within Silicon Valley, leading to a potential strategic shift among companies like Meta, which may abandon its open-source approach in favor of closed-source models [9] - OpenAI's recent actions indicate a heightened focus on protecting its intellectual property and maintaining a competitive edge in the evolving AI landscape [9]
闹玩呢,首届大模型对抗赛,DeepSeek、Kimi第一轮被淘汰了
3 6 Ke· 2025-08-06 08:01
Group 1 - The core focus of the article is the first international chess competition for large models, where Grok 4 is highlighted as a leading contender for the championship [1][24]. - The competition features various AI models, including Gemini 2.5 Pro, o4-mini, Grok 4, and others, all of which advanced to the semifinals with a 4-0 victory in their initial matches [1][9]. - The event is hosted on the Kaggle Game Arena platform, aiming to evaluate the performance of large language models (LLMs) in dynamic and competitive environments [1]. Group 2 - Kimi k2 faced o3 and lost 0-4, with Kimi k2 struggling to find legal moves after the opening phase, indicating potential technical issues [3][6]. - DeepSeek R1 lost to o4-mini with a score of 0-4, showcasing a pattern of initial strong moves followed by significant errors [10][13]. - Gemini 2.5 Pro achieved a 4-0 victory over Claude 4 Opus, but its true strength remains uncertain due to the opponent's mistakes [14][18]. - Grok 4's performance was particularly impressive, winning 4-0 against Gemini 2.5 Flash, demonstrating a strong ability to capture unprotected pieces [21][27]. Group 3 - The article notes that current AI models in chess exhibit three main weaknesses: insufficient global board visualization, limited understanding of piece interactions, and issues with executing legal moves [27]. - Grok 4's success suggests it may have overcome these limitations, raising questions about the consistency of these models' advantages and shortcomings in future matches [27]. - The article also mentions a poll where 37% of participants favored Gemini 2.5 Pro as the likely winner before the competition began [27].
OpenAI发布开源模型“王者归来”,DeepSeek剧情会反转吗
Hu Xiu· 2025-08-06 03:47
最大的开源社区Hugging Face创始人兼CEO Clement Delangue称之为"王者归来"。 "这就像剧情反转, 像是一场王者归来, OpenAI终于重新发布开源模型gpt-oss-120b和gpt-oss-20b。这是其自从GPT-2之后,首次发布开源语言模型。 这也是上半年DeepSeek-R1发布,引发中国掀起一股开源狂潮,7月份中国K2、GLM-4.5、Step-3及Qwen3更新版本等密集发布之后,美国AI实验室首次发 出最强开源模型。 Llama4上半年发布失败,美国朝野一致对开源AI落后于中国感到焦虑之际,OpenAI看起来要扳回一局。 像是某件大事的开端。 让我们一起推进开源AI吧" gpt-oss vs. DeepSeek StabilityAI创始人Emad Mostaque等人,对比了gpt-oss与DeepSeek: 训练效率:gpt-oss-120b每个token激活约5.1B参数,而DeepSeek是37B,少了7倍以上,因此可以处理超过5倍的tokens,即大约80万亿tokens(作为参考, Qwen3使用了30万亿)。 计算消耗:gpt-oss比DeepSeek ...
OpenAI发布低成本模型 与Meta(META.US)和DeepSeek正面竞争
智通财经网· 2025-08-06 01:53
Core Insights - OpenAI has released its first open-weight language models, gpt-oss-120b and gpt-oss-20b, since the launch of GPT-2 in 2019, aimed at providing low-cost options for developers, researchers, and businesses [1][2] - The release follows multiple delays due to the need for additional safety testing and review of high-risk areas, with comprehensive safety training and testing implemented for the models [2] - The models are designed to run on various hardware environments, from consumer-grade devices to cloud services, showcasing advanced reasoning, tool invocation, and chain-of-thought processing capabilities [2] Company and Industry Developments - OpenAI collaborates with companies like NVIDIA, AMD, Cerebras, and Groq to ensure stable operation of the models across different chips [1] - The models can be downloaded via platforms like Hugging Face and GitHub under the Apache 2.0 license, with cloud services provided by Amazon, Baseten, and Microsoft [2] - OpenAI's president expressed excitement about the growth of the AI ecosystem and the company's role in pushing technological boundaries [1]
谁在往“DeepSeek们”的回答里塞广告?
3 6 Ke· 2025-08-04 09:37
Core Viewpoint - AI is transforming modern workplaces and daily life, shifting user behavior from "searching" to "asking AI" for solutions, leading to a significant increase in AI search users from 310 million in January 2024 to 1.98 billion by February 2025, a growth rate of 538.7% [1] Group 1: User Experience and Concerns - Users are increasingly questioning whether AI-generated answers contain advertisements, as seen in the experiences of individuals like Zhao Xinting, who noticed brand mentions in AI responses and expressed skepticism about their authenticity [1][4] - Social media platforms are filled with users voicing concerns that AI responses are becoming "advertising spaces," with examples of AI tools like DeepSeek and Doubao incorporating promotional content in their answers [5][9] Group 2: Marketing Opportunities - The rise of AI has created new marketing opportunities, particularly through Generative Engine Optimization (GEO), which aims to influence AI outputs by producing content that aligns with AI preferences, similar to traditional Search Engine Optimization (SEO) [10] - The GEO market is projected to grow significantly, with estimates suggesting a market size of approximately 2.1 billion yuan in 2023, expected to reach 24.2 billion yuan by 2027, indicating a potential market value transformation exceeding 300 billion yuan in the next five years [14] Group 3: Service Providers and Pricing - GEO service companies are emerging, offering services that optimize brand visibility in AI responses, with pricing models based on the number of keywords and entries, ranging from 6,000 yuan for 50 entries to 20,000 yuan for 500 entries per month [12][13] - The effectiveness of GEO services is measured by the frequency of brand mentions in AI responses, with some companies offering guarantees of performance or refunds if results are unsatisfactory [14]
爆火仅半年,DeepSeek在银行业已泯然众模型?三大障碍成拦路虎
Feng Huang Wang· 2025-08-04 03:42
Core Insights - The banking industry's initial enthusiasm for DeepSeek has diminished over the past six months, with many professionals indicating that the model's impact has not met expectations [1][4][5] - DeepSeek faces significant challenges in the banking sector, primarily due to the complexity of financial data, which it struggles to process effectively [7][8][9] - Despite the setbacks, the trend of increasing investment in financial technology within the banking sector is expected to continue [2][4] Application Status - DeepSeek has not produced any "killer applications" in the banking sector, as initially anticipated, with many banks reporting underwhelming results from its implementation [1][7] - The model's general-purpose nature limits its compatibility with existing banking technologies, leading to difficulties in integration [8][9] - Smaller banks have been more proactive in adopting DeepSeek, often for marketing purposes, while larger banks have shown reduced enthusiasm [3][4][5] Industry Response - The regulatory environment has shifted, with authorities advising large banks against extensive promotion of DeepSeek, emphasizing the importance of self-developed financial models [4][5] - The emergence of new financial models from domestic tech giants has further diluted DeepSeek's uniqueness in the market [6][5] - The banking sector's low tolerance for errors in financial applications has led to cautious approaches in deploying DeepSeek for critical functions like AI advisory and risk management [9]
AI周报 | DeepSeek斩获ACL 2025最佳论文;库克称苹果计划“大幅”增加AI投资
Di Yi Cai Jing· 2025-08-03 01:16
Group 1: DeepSeek and ACL Conference - DeepSeek, in collaboration with Peking University, won the Best Paper Award at the 63rd ACL conference, highlighting a significant achievement in natural language processing with the introduction of the Native Sparse Attention (NSA) mechanism [1][2] - The ACL conference saw a record submission of over 8000 papers, with a main conference acceptance rate of 20.3% and a Findings acceptance rate of 16.7% [1] Group 2: Anthropic's Market Position - Anthropic has surpassed OpenAI in popularity among enterprises, capturing 32% of the large language model market, while OpenAI's share has decreased to 25% [3][4] - Two years ago, OpenAI held a dominant 50% market share, with Anthropic at only 12%, indicating a significant shift in the competitive landscape [3] Group 3: AI Model Developments - The AI startup Step 3 has released an open-source foundational model with 321 billion parameters, showcasing advanced capabilities in visual perception and complex reasoning [5] - Multiple companies, including Tencent and Moonlight, have also released open-source models, indicating a trend towards open-source solutions in the AI industry [5] Group 4: Baidu's AI Integration - Baidu is testing an AI application entry point on its search homepage, allowing users to access various AI applications directly [6][7] - This move follows a major redesign of Baidu's search platform and reflects the company's commitment to integrating AI into its services [6] Group 5: Robotics Industry Insights - Tencent's chief scientist, Zhang Zhengyou, stated that the embodied intelligence industry is still in its early stages, comparing it to the mobile phone industry's evolution [8] - He emphasized that current humanoid robots are primarily used for data collection and research, and a significant breakthrough is needed for widespread adoption [8] Group 6: Supernode Solutions - Several companies showcased supernode solutions at the WAIC, addressing the challenges of large-scale computing clusters [9] - Supernodes aim to enhance performance by integrating computing chip resources, which is increasingly necessary as model parameters grow larger [9] Group 7: Financial Performance of Major Tech Companies - Meta reported a 22% year-over-year revenue increase in Q2, reaching $47.5 billion, with a net profit of $18.3 billion, up 36% [10] - Microsoft achieved a revenue of $76.4 billion in Q4, an 18% increase, with its market capitalization reaching $4 trillion, driven by demand for AI services [11]
DeepSeek公司要上市了?知情人士回应
news flash· 2025-08-01 11:15
《辟谣财知道》注意到,近期一则关于DeepSeek(深度求索)公司上市的消息出现在诸多权威的新闻网 站。据南方日报报道,知情人士表示,该消息不实。 ...
DeepSeek上市的假新闻正被权威网站批量刊载
Nan Fang Du Shi Bao· 2025-08-01 09:47
近期,一则关于DeepSeek(深度求索)公司上市的消息出现在诸多权威的新闻网站。知情人士告诉南 都N视频记者,该消息不实。虚假信源也使得DeepSeek的AI应用成了"受害者"。 这则DeepSeek的IPO假新闻有两个版本:版本一是DeepSeek准备科创板上市,于7月18日发布。该版本 的消息中写道:"DeepSeek今日(7月15日)正式宣布,公司已递交科创板上市申请,计划于2025年11月 正式挂牌交易,此次IPO旨在进一步扩大算力租赁业务规模。" 然而经记者核实,上海证券交易所并无DeepSeek的上市申请记录,DeepSeek近期也从未在任何官方渠 道宣布过上市计划。更关键是,DeepSeek背后的公司迄今未进行过股改。股改是一家公司上市的必要 条件。此外,DeepSeek官网显示的服务内容中,并不包含所谓算力租赁业务。 版本二发布7月30日左右,改称DeepSeek提交了北交所上市申报材料,拟于2025年11月正式挂牌。然 而,北京证券交易所官网同样无法查询到DeepSeek的上市申请记录。 上述新闻网站发布的DeepSeek上市消息,共同点是没有明确的署名,消息来源模糊。 虚假的信源也污染了 ...