Seek .(SKLTY)
Search documents
阿里千问与DeepSeek入选全球AI开源贡献榜前十
news flash· 2025-06-12 10:33
Core Insights - The latest AI open-source contribution ranking released by Hugging Face highlights significant achievements by Chinese teams, with Alibaba's Tongyi Qianwen ranking fifth globally and first in China [1] - DeepSeek is also recognized, placing ninth, making it one of the only two non-U.S. institutions in the top ten [1] Summary by Categories Ranking and Performance - Alibaba's Tongyi Qianwen is ranked fifth globally and first in China in the AI open-source contribution ranking [1] - DeepSeek is positioned ninth, showcasing strong performance among global competitors [1] Evaluation Criteria - The ranking is based on an assessment of various factors including open-source models, datasets, developer engagement, community activity, and global influence [1]
OpenAI开源模型发布推迟至夏末,为了狙击DeepSeek R2?
Hua Er Jie Jian Wen· 2025-06-11 02:37
Group 1 - OpenAI has postponed the release of its anticipated open-source model to "later this summer" instead of June, as announced by CEO Sam Altman [1] - The open-source model aims to match the complex reasoning capabilities of GPT-4o and surpass leading open-source models like DeepSeek's R1 [2] - The AI market competition is intensifying, with new models being launched by competitors such as Mistral and Qwen, which are capable of switching between deep reasoning and traditional quick responses [2] Group 2 - Altman acknowledged that OpenAI has historically made mistakes in its open-source strategy, and the new model is seen as a crucial step to repair developer relations [2] - There are speculations that the delay may be a strategic move to counter DeepSeek's upcoming R2 model, which is expected to be released soon [2][3] - DeepSeek R2 is anticipated to have significant upgrades in technical architecture, functionality, and resource efficiency, with a predicted 87% reduction in AI invocation costs [3] Group 3 - DeepSeek's founder, Liang Wenfeng, emphasizes the goal of making China a contributor to innovation rather than a passive participant [4] - DeepSeek's product iteration schedule is robust, with plans for major updates every quarter, including the upcoming V2.5 and V3 versions [4]
“千帆”系列昇腾DeepSeek技术沙龙重庆站成功举办
Sou Hu Cai Jing· 2025-06-10 23:57
Group 1 - The event "DeepSeek Technology Salon" was successfully held in Chongqing, focusing on the collaboration between Ascend AI and DeepSeek technology, with over 100 experts from more than 40 industry clients and partners participating [1][4] - A joint solution for educational intelligent applications was launched by JiFa Education and Huawei, integrating multi-modal AI and adaptive learning algorithms to support various educational scenarios [2] - The Chongqing Artificial Intelligence Innovation Center aims to leverage its computing power resources to create a leading public service platform for AI in the western region, contributing to the development of the AI industry nationwide [4][13] Group 2 - Huawei's strategy is shifting from traditional "software customization + services" to a model combining "computing power + data + large models," enhancing service value and reshaping the industry landscape [8][10] - The Chongqing OpenLab is designed to provide a comprehensive platform for innovation, integrating various centers to facilitate the successful implementation of industry solutions and business scenarios [11] - The Chongqing Artificial Intelligence Innovation Center's first phase has a computing power of 400P, focusing on promoting AI technology innovation, industry development, and talent cultivation [13]
十大推理模型挑战2025年高考数学题:DeepSeek-R1、腾讯混元T1并列第一,马斯克的Grok 3遭遇“滑铁卢”
Mei Ri Jing Ji Xin Wen· 2025-06-10 13:53
Core Insights - The discussion around the difficulty of mathematics in the 2025 college entrance examination continues to be a hot topic, with a focus on the performance of various AI reasoning models in a standardized test based on the new curriculum mathematics I paper [1] Group 1: AI Model Performance - The evaluation tested ten AI reasoning models, including DeepSeek-R1, Tencent's Mix Yuan T1, OpenAI's o3, Google's Gemini 2.5 Pro, and xAI's Grok 3, to assess their mathematical capabilities [1] - DeepSeek-R1 and Tencent's Mix Yuan T1 achieved perfect scores of 117, demonstrating exceptional performance in algebra and function problems [4] - The scores of other models included: iFlytek's Spark X1 with 112, Gemini 2.5 Pro with 109, OpenAI's o3 with 107, Alibaba's Qwen 3 with 106, and Doubao's Deep Thinking with 104 [2][7] Group 2: Evaluation Methodology - The assessment utilized a standardized test with a total score of 150, but excluded questions requiring graphical analysis to ensure a level playing field among the models [3] - Scoring was based on high school examination standards, with a focus on final answers for open-ended questions rather than the process [3] Group 3: Notable Failures - Grok 3, developed by xAI and touted as the "strongest AI," ranked third from the bottom with a score of 91, primarily due to its inability to correctly interpret multiple-choice questions [8] - The second lowest was the Zhiyu Qingyan reasoning model, scoring 78, which often faltered at the final step of reasoning, leading to lost points [8][10] - Kimi k1.5 ranked last, suffering significant score losses on the final two challenging questions [10]
一文了解DeepSeek和OpenAI:企业家为什么需要认知型创新?
Sou Hu Cai Jing· 2025-06-10 12:49
Core Insights - The article emphasizes the transformative impact of AI on business innovation and the necessity for companies to adapt their strategies to remain competitive in the AI era [1][4][40] Group 1: OpenAI's Journey - OpenAI was founded in 2015 by Elon Musk and Sam Altman with the mission to counteract the monopolistic tendencies of tech giants and promote open, safe, and accessible AI [4][7] - The development of large language models (LLMs) by OpenAI is attributed to the effective use of the Transformer architecture and the Scaling Law, which predicts a linear relationship between model size, training data, and computational resources [8][11] - The emergence of capabilities in models like GPT is described as a phenomenon of "emergence," where models exhibit unexpected abilities when certain thresholds of parameters and data are reached [12][13] Group 2: DeepSeek's Strategy - DeepSeek adopts a "Limited Scaling Law" approach, focusing on maximizing efficiency and performance with limited resources, contrasting with the resource-heavy strategies of larger AI firms [18][22] - The company employs innovative model architectures such as Multi-Head Latent Attention (MLA) and Mixture of Experts (MoE) to optimize performance while minimizing costs [20][21] - DeepSeek's R1 model, released in January 2025, showcases its ability to perform complex reasoning tasks without human feedback, marking a significant advancement in AI capabilities [23][25] Group 3: Organizational Innovation - DeepSeek promotes an AI Lab paradigm that encourages open collaboration, resource sharing, and dynamic team structures to foster innovation in AI development [27][28] - The organization emphasizes self-organization and autonomy among team members, allowing for a more flexible and responsive approach to research and development [29][30] - The company's success is attributed to breaking away from traditional corporate constraints, enabling a culture of creativity and exploration in foundational research [34][38]
重磅!中国团队发布SRDA新计算架构,从根源解决AI算力成本问题,DeepSeek“神预言”成真?
Xin Lang Cai Jing· 2025-06-09 13:27
作者 | 玉盘 AI 团队 审核 | 华卫 "大模型每生成 1 美元价值,需支付 3 美元算力成本",算力成本挑战已无争议。从软件层面的各类优化 方案层出不穷,真正从硬件源头着手的方案却屈指可数,市面上能看到的包括 Groq 在内的新计算硬件 也多数在大模型爆发前定型,难以充分匹配大模型本身的需求。 DeepSeek 从用户角度的不少构想与玉盘 SRDA 在做的事不谋而合,包括 IO 融合、3D 堆叠 DRAM 等, 而玉盘进一步提出了更完整的架构设计,或正式拉开下一代大模型专用计算架构的序幕。 今天,国内团队玉盘 AI 发布《SRDA AI 大模型专用计算架构》白皮书,提出了一种全新的计算架构: 系统级精简可重构数据流架构 SRDA (System-level Simplified Reconfigurable Dataflow Architecture), 从硬件源头解决当前 AI 算力的核心瓶颈。 与此同时,DeepSeek 于半个月前发表论文《Insights into DeepSeek-V3: Scaling Challenges and Reflections on Hardware for AI ...
报道:DeepSeek核心高管离职创业,瞄准Agent赛道
news flash· 2025-06-09 13:02
Core Insights - A core executive from DeepSeek has quietly left to start a new venture, planning to launch an Agent product around Christmas 2025 [1] - The departing executive is reported to be the former CTO of DeepSeek, although there is no official CTO position within the company [1] - The new startup has secured funding from a prominent venture capital firm [1]
DeepSeek核心高管离职创业,瞄准Agent赛道|独家
Hu Xiu· 2025-06-09 08:24
Core Insights - A core executive from DeepSeek has left the company to start a new venture focused on the Agent sector, with plans to launch a product by Christmas 2025 [1] - The executive, previously serving as the CTO, left during a peak period for DeepSeek, raising questions about the timing of the departure [1][2] - The AI industry is witnessing a trend of high-level talent leaving established companies to pursue entrepreneurial opportunities, often leveraging their previous experience and reputation to secure funding [2][3] Company Developments - DeepSeek has recently released and open-sourced its V3 model and R1 inference model, marking a significant period of activity for the company [1] - There are ongoing speculations regarding DeepSeek's potential financing or IPO plans, especially following the recruitment of several financial positions [4] - Despite the recruitment of a CFO, insiders suggest that this is not related to immediate financing or IPO plans, indicating a cautious approach from DeepSeek's leadership [4] Industry Trends - The rapid pace of technological iteration in the AI sector creates numerous opportunities for startups, particularly for those with experienced talent from leading companies [3] - The scarcity of AI talent with core technical expertise makes these individuals highly competitive in the entrepreneurial landscape [3] - The trend of executives leaving large firms to innovate in more flexible environments is becoming a common occurrence in the AI industry [3]
2025年第18期(总899期):开源大模型DeepSeek实现三个“首
Sou Hu Cai Jing· 2025-06-07 08:35
Core Insights - DeepSeek has established itself as a new benchmark in the global open-source AI model landscape, adhering to three core standards: complete code, public model parameters, and transparent training data, which sets it apart from traditional software open-source practices [1][13][14]. Group 1: DeepSeek's Innovations - DeepSeek has achieved three groundbreaking "firsts" in the AI model domain: 1. It has pioneered a second development path for large models through pure reinforcement learning (RL), demonstrating a viable "small but beautiful" approach that significantly reduces inference costs compared to mainstream models, thus aiding resource-limited countries [2][17]. 2. The application of DeepSeek has surged, with its app reaching 16 million downloads in just 18 days and daily active users surpassing 30 million, setting industry records and attracting global media attention [3][18]. 3. DeepSeek has initiated an "Android moment" in the AI field by fostering a comprehensive ecosystem that integrates models, chips, and systems, attracting numerous hardware and software manufacturers globally [4][20]. Group 2: Recommendations for AI Inclusivity - To promote AI inclusivity and equity, the following strategies are recommended: 1. Strengthen collaborative innovation by leveraging open-source platforms like GitHub and Hugging Face to encourage enterprises and research institutions to engage in secondary development based on DeepSeek's open-source achievements [5][21]. 2. Accelerate the application of open-source large models across various industries, developing specialized models and high-quality datasets to support the modernization of industries [6][21]. 3. Enhance public understanding of AI through educational initiatives, fostering partnerships between enterprises and educational institutions to build development platforms and organize events to raise awareness of AI technologies [7][22]. Group 3: Conclusion - The emergence of DeepSeek signifies a transition from technical exploration to ecosystem construction in open-source large models, with its low-cost, high-performance, and fully open characteristics reshaping the competitive landscape and providing a feasible path for global AI inclusivity and equity [8].
中国创新药的DeepSeek时刻:从“跟跑”到局部“领跑”
2 1 Shi Ji Jing Ji Bao Dao· 2025-06-06 08:31
Core Insights - The recent $1.25 billion upfront payment by 3SBio to Pfizer for the PD-1/VEGF bispecific antibody license marks a significant milestone in the Chinese pharmaceutical industry, reflecting a shift from "follower" to "leader" in innovation [1] - This transaction highlights the evolution of Chinese pharmaceutical companies from producing "me-too" products to developing "first-in-class" innovative drugs, allowing them to gain pricing power based on unique technologies [2] - The global pharmaceutical industry is witnessing a new value chain model where Chinese companies leverage their engineering and cost advantages for early-stage development, while multinational firms utilize their strengths in regulatory science and global market access [2] Industry Transformation - The integration of artificial intelligence (AI) in drug development is transforming the traditional, experience-based process into a data-driven, predictable, and optimized industrial process, significantly reducing time and costs [3] - China's large pool of high-quality engineering talent is being further amplified as drug design becomes more algorithmic, enhancing the country's competitive edge in pharmaceutical innovation [4] - The vast data resources available in China, due to its large patient base and improving healthcare information systems, are becoming a strategic asset for innovation in the AI era [4] Collaborative Ecosystem - China is building a comprehensive AI and biopharmaceutical innovation ecosystem, supported by policy reforms that shorten drug review times and improve market access for innovative drugs [4] - The dual drive of technological and policy innovation is enhancing the overall efficiency and commercialization success rates of the pharmaceutical industry [4] Future Outlook - The ongoing industrial revolution, driven by AI, presents unprecedented opportunities for the Chinese innovative pharmaceutical sector, with the potential for new industry leaders emerging from advancements in ADC, cell therapy, gene editing, and AI drug design [5] - The ability to seize these opportunities will shape the industry landscape for the next decade and beyond, making it a critical consideration for both entrepreneurs and the broader economic transformation in China [5]