Seek .(SKLTY)
Search documents
闹玩呢,首届大模型对抗赛,DeepSeek、Kimi第一轮被淘汰了
3 6 Ke· 2025-08-06 08:01
Group 1 - The core focus of the article is the first international chess competition for large models, where Grok 4 is highlighted as a leading contender for the championship [1][24]. - The competition features various AI models, including Gemini 2.5 Pro, o4-mini, Grok 4, and others, all of which advanced to the semifinals with a 4-0 victory in their initial matches [1][9]. - The event is hosted on the Kaggle Game Arena platform, aiming to evaluate the performance of large language models (LLMs) in dynamic and competitive environments [1]. Group 2 - Kimi k2 faced o3 and lost 0-4, with Kimi k2 struggling to find legal moves after the opening phase, indicating potential technical issues [3][6]. - DeepSeek R1 lost to o4-mini with a score of 0-4, showcasing a pattern of initial strong moves followed by significant errors [10][13]. - Gemini 2.5 Pro achieved a 4-0 victory over Claude 4 Opus, but its true strength remains uncertain due to the opponent's mistakes [14][18]. - Grok 4's performance was particularly impressive, winning 4-0 against Gemini 2.5 Flash, demonstrating a strong ability to capture unprotected pieces [21][27]. Group 3 - The article notes that current AI models in chess exhibit three main weaknesses: insufficient global board visualization, limited understanding of piece interactions, and issues with executing legal moves [27]. - Grok 4's success suggests it may have overcome these limitations, raising questions about the consistency of these models' advantages and shortcomings in future matches [27]. - The article also mentions a poll where 37% of participants favored Gemini 2.5 Pro as the likely winner before the competition began [27].
OpenAI发布开源模型“王者归来”,DeepSeek剧情会反转吗
Hu Xiu· 2025-08-06 03:47
最大的开源社区Hugging Face创始人兼CEO Clement Delangue称之为"王者归来"。 "这就像剧情反转, 像是一场王者归来, OpenAI终于重新发布开源模型gpt-oss-120b和gpt-oss-20b。这是其自从GPT-2之后,首次发布开源语言模型。 这也是上半年DeepSeek-R1发布,引发中国掀起一股开源狂潮,7月份中国K2、GLM-4.5、Step-3及Qwen3更新版本等密集发布之后,美国AI实验室首次发 出最强开源模型。 Llama4上半年发布失败,美国朝野一致对开源AI落后于中国感到焦虑之际,OpenAI看起来要扳回一局。 像是某件大事的开端。 让我们一起推进开源AI吧" gpt-oss vs. DeepSeek StabilityAI创始人Emad Mostaque等人,对比了gpt-oss与DeepSeek: 训练效率:gpt-oss-120b每个token激活约5.1B参数,而DeepSeek是37B,少了7倍以上,因此可以处理超过5倍的tokens,即大约80万亿tokens(作为参考, Qwen3使用了30万亿)。 计算消耗:gpt-oss比DeepSeek ...
OpenAI发布低成本模型 与Meta(META.US)和DeepSeek正面竞争
智通财经网· 2025-08-06 01:53
Core Insights - OpenAI has released its first open-weight language models, gpt-oss-120b and gpt-oss-20b, since the launch of GPT-2 in 2019, aimed at providing low-cost options for developers, researchers, and businesses [1][2] - The release follows multiple delays due to the need for additional safety testing and review of high-risk areas, with comprehensive safety training and testing implemented for the models [2] - The models are designed to run on various hardware environments, from consumer-grade devices to cloud services, showcasing advanced reasoning, tool invocation, and chain-of-thought processing capabilities [2] Company and Industry Developments - OpenAI collaborates with companies like NVIDIA, AMD, Cerebras, and Groq to ensure stable operation of the models across different chips [1] - The models can be downloaded via platforms like Hugging Face and GitHub under the Apache 2.0 license, with cloud services provided by Amazon, Baseten, and Microsoft [2] - OpenAI's president expressed excitement about the growth of the AI ecosystem and the company's role in pushing technological boundaries [1]
谁在往“DeepSeek们”的回答里塞广告?
3 6 Ke· 2025-08-04 09:37
Core Viewpoint - AI is transforming modern workplaces and daily life, shifting user behavior from "searching" to "asking AI" for solutions, leading to a significant increase in AI search users from 310 million in January 2024 to 1.98 billion by February 2025, a growth rate of 538.7% [1] Group 1: User Experience and Concerns - Users are increasingly questioning whether AI-generated answers contain advertisements, as seen in the experiences of individuals like Zhao Xinting, who noticed brand mentions in AI responses and expressed skepticism about their authenticity [1][4] - Social media platforms are filled with users voicing concerns that AI responses are becoming "advertising spaces," with examples of AI tools like DeepSeek and Doubao incorporating promotional content in their answers [5][9] Group 2: Marketing Opportunities - The rise of AI has created new marketing opportunities, particularly through Generative Engine Optimization (GEO), which aims to influence AI outputs by producing content that aligns with AI preferences, similar to traditional Search Engine Optimization (SEO) [10] - The GEO market is projected to grow significantly, with estimates suggesting a market size of approximately 2.1 billion yuan in 2023, expected to reach 24.2 billion yuan by 2027, indicating a potential market value transformation exceeding 300 billion yuan in the next five years [14] Group 3: Service Providers and Pricing - GEO service companies are emerging, offering services that optimize brand visibility in AI responses, with pricing models based on the number of keywords and entries, ranging from 6,000 yuan for 50 entries to 20,000 yuan for 500 entries per month [12][13] - The effectiveness of GEO services is measured by the frequency of brand mentions in AI responses, with some companies offering guarantees of performance or refunds if results are unsatisfactory [14]
爆火仅半年,DeepSeek在银行业已泯然众模型?三大障碍成拦路虎
Feng Huang Wang· 2025-08-04 03:42
Core Insights - The banking industry's initial enthusiasm for DeepSeek has diminished over the past six months, with many professionals indicating that the model's impact has not met expectations [1][4][5] - DeepSeek faces significant challenges in the banking sector, primarily due to the complexity of financial data, which it struggles to process effectively [7][8][9] - Despite the setbacks, the trend of increasing investment in financial technology within the banking sector is expected to continue [2][4] Application Status - DeepSeek has not produced any "killer applications" in the banking sector, as initially anticipated, with many banks reporting underwhelming results from its implementation [1][7] - The model's general-purpose nature limits its compatibility with existing banking technologies, leading to difficulties in integration [8][9] - Smaller banks have been more proactive in adopting DeepSeek, often for marketing purposes, while larger banks have shown reduced enthusiasm [3][4][5] Industry Response - The regulatory environment has shifted, with authorities advising large banks against extensive promotion of DeepSeek, emphasizing the importance of self-developed financial models [4][5] - The emergence of new financial models from domestic tech giants has further diluted DeepSeek's uniqueness in the market [6][5] - The banking sector's low tolerance for errors in financial applications has led to cautious approaches in deploying DeepSeek for critical functions like AI advisory and risk management [9]
AI周报 | DeepSeek斩获ACL 2025最佳论文;库克称苹果计划“大幅”增加AI投资
Di Yi Cai Jing· 2025-08-03 01:16
Group 1: DeepSeek and ACL Conference - DeepSeek, in collaboration with Peking University, won the Best Paper Award at the 63rd ACL conference, highlighting a significant achievement in natural language processing with the introduction of the Native Sparse Attention (NSA) mechanism [1][2] - The ACL conference saw a record submission of over 8000 papers, with a main conference acceptance rate of 20.3% and a Findings acceptance rate of 16.7% [1] Group 2: Anthropic's Market Position - Anthropic has surpassed OpenAI in popularity among enterprises, capturing 32% of the large language model market, while OpenAI's share has decreased to 25% [3][4] - Two years ago, OpenAI held a dominant 50% market share, with Anthropic at only 12%, indicating a significant shift in the competitive landscape [3] Group 3: AI Model Developments - The AI startup Step 3 has released an open-source foundational model with 321 billion parameters, showcasing advanced capabilities in visual perception and complex reasoning [5] - Multiple companies, including Tencent and Moonlight, have also released open-source models, indicating a trend towards open-source solutions in the AI industry [5] Group 4: Baidu's AI Integration - Baidu is testing an AI application entry point on its search homepage, allowing users to access various AI applications directly [6][7] - This move follows a major redesign of Baidu's search platform and reflects the company's commitment to integrating AI into its services [6] Group 5: Robotics Industry Insights - Tencent's chief scientist, Zhang Zhengyou, stated that the embodied intelligence industry is still in its early stages, comparing it to the mobile phone industry's evolution [8] - He emphasized that current humanoid robots are primarily used for data collection and research, and a significant breakthrough is needed for widespread adoption [8] Group 6: Supernode Solutions - Several companies showcased supernode solutions at the WAIC, addressing the challenges of large-scale computing clusters [9] - Supernodes aim to enhance performance by integrating computing chip resources, which is increasingly necessary as model parameters grow larger [9] Group 7: Financial Performance of Major Tech Companies - Meta reported a 22% year-over-year revenue increase in Q2, reaching $47.5 billion, with a net profit of $18.3 billion, up 36% [10] - Microsoft achieved a revenue of $76.4 billion in Q4, an 18% increase, with its market capitalization reaching $4 trillion, driven by demand for AI services [11]
DeepSeek公司要上市了?知情人士回应
news flash· 2025-08-01 11:15
《辟谣财知道》注意到,近期一则关于DeepSeek(深度求索)公司上市的消息出现在诸多权威的新闻网 站。据南方日报报道,知情人士表示,该消息不实。 ...
DeepSeek上市的假新闻正被权威网站批量刊载
Nan Fang Du Shi Bao· 2025-08-01 09:47
近期,一则关于DeepSeek(深度求索)公司上市的消息出现在诸多权威的新闻网站。知情人士告诉南 都N视频记者,该消息不实。虚假信源也使得DeepSeek的AI应用成了"受害者"。 这则DeepSeek的IPO假新闻有两个版本:版本一是DeepSeek准备科创板上市,于7月18日发布。该版本 的消息中写道:"DeepSeek今日(7月15日)正式宣布,公司已递交科创板上市申请,计划于2025年11月 正式挂牌交易,此次IPO旨在进一步扩大算力租赁业务规模。" 然而经记者核实,上海证券交易所并无DeepSeek的上市申请记录,DeepSeek近期也从未在任何官方渠 道宣布过上市计划。更关键是,DeepSeek背后的公司迄今未进行过股改。股改是一家公司上市的必要 条件。此外,DeepSeek官网显示的服务内容中,并不包含所谓算力租赁业务。 版本二发布7月30日左右,改称DeepSeek提交了北交所上市申报材料,拟于2025年11月正式挂牌。然 而,北京证券交易所官网同样无法查询到DeepSeek的上市申请记录。 上述新闻网站发布的DeepSeek上市消息,共同点是没有明确的署名,消息来源模糊。 虚假的信源也污染了 ...
产学研联动!DeepSeek上市前夕与中科院共建“新一代算力实验室
Jiang Nan Shi Bao· 2025-08-01 03:09
Core Insights - DeepSeek is enhancing its technological barriers by collaborating with the Institute of Computing Technology, Chinese Academy of Sciences to establish a joint laboratory focused on cutting-edge technologies such as "storage-compute integration" [1] - The dual-driven model of "listing + R&D" is expected to accelerate the transformation of scientific research achievements into practical applications [1] - The laboratory has already filed three patents that have entered the PCT international application stage, which may lead to new profit growth points in the future [1]
看完妈妈和DeepSeek的聊天记录,我哭了
3 6 Ke· 2025-07-31 12:31
AI正在以一种意想不到的方式,嵌入中国家庭最私密的肌理。 它不再仅仅是工具,更开始扮演一个微妙的"第三方"角色——在因观念、代际和沟通方式差异而撕裂的家庭关系中,充当起"军师"或"翻译官"。 蔡考和程君,这两位年轻女性的家庭,都因AI的偶然介入,经历了一场充满试探、挫折与反复的、漫长的"沟通实验"。 AI如同一面镜子,照见了她们与母亲在亲密关系中的僵局,也意外地赋予了她们重建现实关系的力量。 这并非一个"科技改变生活"的乐观故事。它更像是一个粗糙的、关于两代人在巨大的认知鸿沟面前,如何借助一个陌生的工具,笨拙走向彼此的现实记 录。 交锋 2025年5月下旬,距离女儿蔡考的又一次相亲还有一周,妈妈张瑞芳特地从浙江赶到上海。她此行的目的,是监督女儿为这场"考试"做万全准备。 张瑞芳去上海之前,问蔡考需不需要带过去点护肤品。蔡考说:我这全有。 结果张瑞芳发现,蔡考唯一的"家当"是酒店拿来的免费润肤霜。她形容女儿匪夷所思。 蔡考第一次相亲见面后没了下文,张瑞芳很焦虑,把这一切都归咎于女儿"长得不像照片"。"再不减减肥、脸上抹点东西,别人就看不上你了。" 蔡考暴跳如雷,质问妈妈为什么要代入男人的目光审视、否定自己,为什 ...