Workflow
Seek .(SKLTY)
icon
Search documents
黄仁勋力赞DeepSeek 对中国创新能力充满信心
news flash· 2025-07-20 20:11
Core Insights - The CEO of Nvidia, Jensen Huang, expressed strong confidence in China's innovation capabilities, stating that the pace of innovation in China is unstoppable [1][2] - Huang emphasized the complexity of AI systems, likening them to a multi-layered cake, where Nvidia's chips serve as the foundational layer, supported by systems, network technologies, AI infrastructure, software, algorithms, and application services [1] - The innovation demonstrated by DeepSeek, particularly with their R1 model, was highlighted as a significant advancement in AI, showcasing the ability to leverage the H20 architecture effectively [1] Summary by Categories Innovation Capability - Huang believes that regardless of available resources, China can adapt and innovate effectively, showcasing remarkable capabilities in AI development [2] - The H20 architecture, while not Nvidia's top product, is still considered highly capable and has played a crucial role in defining the current AI revolution [2] AI System Complexity - AI development requires innovation at every layer of the system, and if progress is slow in one area, engineers can compensate through innovations in other layers to drive overall system advancement [1] DeepSeek's Contribution - DeepSeek's R1 model represents a true innovation by redesigning many operational aspects of AI models, allowing them to fully utilize the advantages of the H20 architecture [1]
KIMI K2:最前瞻的研究!OnlineRL新范式,大模型的又一DeekSeek时刻!
2025-07-19 14:02
KIMI K2:最前瞻的研究!OnlineRL 新范式,大模型的 又一 DeekSeek 时刻!20250718 摘要 Kimi K2 作为国内首个公开数据显示拥有万亿参数的 MOE 模型,其架 构与 Distill V3 相似,但专家拆解更细致,采用 CLIP 优化器缓解梯度输 出问题,并实现部分在线强化学习功能,通过融合多场景数据,在奖励 模型上优选最佳结果,产生高质量合成数据,推动开放式问题场景发展。 GPT2 引起轰动在于使用工具后能力提升显著(绝对提升 15%,相对提 升 80%),以及 Post-training 算力消耗超过 Pre-training,表明对算 力规模和 Skill-up 要求提高,促使海外构建更多大节点算力集群。 Kimi KR 模型因范式创新和强大的模型能力在海外引发讨论,即使是 Pre-training 版本,完成强化学习后有望对标甚至超越 GPT-3,并可能 超越国内外下一代模型,提升基础软硬件配套,推动短链和长链应用发 展。 从投资角度看,2025 年下半年进入预期兑现阶段,应关注最快落地的 项目和长期增量价值最大的项目。海外数据显示,云计算、基础软硬件 配套设施及实施 ...
杨植麟摸着DeepSeek过河
3 6 Ke· 2025-07-19 04:30
Core Insights - The release of the Kimi K2 model has generated significant global interest, showcasing its capabilities in programming and agent-based tasks, outperforming competitors like DeepSeek-V3 and Alibaba's Qwen3 [1][5][6] - K2's open-source model has quickly gained traction, with over 100,000 downloads within a week and ranking fourth in the LMSYS leaderboard, indicating strong developer engagement [1][4][10] - Kimi's strategic shift towards focusing on model development rather than consumer applications reflects a response to market pressures and a commitment to advancing AGI [5][21] Model Performance and Features - K2 is a MoE model with 1 trillion parameters and 32 billion active parameters, specifically designed for high performance in agentic AI tasks [1][7] - The model emphasizes practical applications, allowing users to generate complex outputs like 3D models and statistical analyses quickly, moving beyond simple chat interactions [8][9] - K2's API pricing is significantly lower than competitors, with costs reduced by over 75%, making it an attractive option for developers in the AI programming space [10][11] Market Impact and Community Engagement - The release has been likened to a "DeepSeek moment," indicating its potential to reshape the AI landscape and challenge existing models [6][14] - Kimi's approach to community engagement through social media has fostered a positive reception and increased visibility among developers [4][17] - The model's introduction has led to a resurgence in Kimi's web traffic, with a 30% increase in visits, highlighting the effectiveness of its open-source strategy [20] Technological Innovations - Kimi has introduced a new optimizer, Muon, which reduces computational requirements by 48% compared to the previous AdamW optimizer, enhancing training efficiency [13][12] - The focus on agentic capabilities and practical task completion sets K2 apart from other models, prioritizing real-world applications over theoretical reasoning [7][8] Strategic Positioning - Kimi's pivot towards enhancing model capabilities aligns with industry trends favoring technical advancements over consumer application growth, positioning it as a leader in the AGI pursuit [15][21] - The competitive landscape has shifted, with Kimi adopting a strategy similar to that of established players like Anthropic, focusing on programming and agent capabilities [16][21]
黄仁勋评价DeepSeek和通义千问:都是世界顶尖开源大模型
Core Insights - The third China International Supply Chain Promotion Expo highlighted the significance of open-source AI models like DeepSeek and Tongyi Qianwen, which are considered top-tier globally, showcasing China's excellence in open-source initiatives [1][2] - NVIDIA's CEO emphasized the importance of the Chinese market for NVIDIA, describing it as one of the largest and most vibrant markets in the world [3] Group 1: AI Technology Development - AI technology has evolved from perception-based to generative AI, with significant advancements in computer vision, speech recognition, and language understanding surpassing human capabilities [1] - The future trend of AI development is expected to penetrate the physical world, leading to the rise of physical AI applications in robotics [1][2] Group 2: China's Role in AI - China leads the world in the number of AI research papers published, indicating its pivotal role in the AI technology landscape [2] - Open-source models are facilitating the formation of China's AI ecosystem and are also contributing to the development of AI ecosystems in other regions globally [2] Group 3: NVIDIA's Strategic Position - NVIDIA announced the resumption of H20 chip sales in China and the launch of a new GPU compatible with the Chinese market, signaling positive developments for the AI industry chain [3] - The company's products are being utilized in various sectors in China, including supply chain digital management and training embodied intelligent models [3] Group 4: Future Outlook - NVIDIA's technology roadmap covers nearly a decade, with the CEO indicating that there is substantial work ahead, particularly in the context of AI and chip technology advancements [3] - Innovations in silicon technology are anticipated in transistor structure, packaging technology, and silicon photonics, which will drive future developments in the chip sector [2]
《自然》网站:中国AI模型“又一个DeepSeek时刻”
Xin Hua She· 2025-07-17 06:46
Core Insights - The release of the Kimi K2 AI model by Beijing Moon's Dark Side Technology on July 11 has generated significant excitement, marking another pivotal moment in AI development following the earlier release of DeepSeek-R1 in January [1][2] - Kimi K2 demonstrates exceptional performance in programming tasks, achieving high scores on the LiveCodeBench dataset, and also shows strong writing capabilities in various professional tests [1] - The model features a trillion-parameter scale (1T) but utilizes a mixed expert architecture that activates only 32 billion parameters per task, optimizing computational efficiency [1] - Kimi K2 is released under an open-source protocol, allowing researchers to download and deploy it locally, and it is competitively priced compared to mainstream closed-source models like "Claude 4" [1] Industry Commentary - Nathan Lambert, a machine learning researcher at the Allen Institute for AI, describes Kimi K2 as the best new open-source model globally, indicating its significance in the trajectory of AI development [2]
黄仁勋对谈王坚:赞DeepSeek写出A+论文,称“嫉妒年轻人”
Di Yi Cai Jing· 2025-07-17 04:41
Core Insights - The discussion between Huang Renxun and Wang Jian highlights the transformative impact of AI and computing power on the younger generation, who are seen as "natives" of artificial intelligence [1][6] - Wang Jian emphasizes that computing power is the foundational infrastructure for AI, which has evolved significantly over the past decade [4] - Huang Renxun notes that AI has surpassed human capabilities in various tasks, and the next wave of AI will integrate more with the physical world, such as robotics [4] AI and Computing Power - Wang Jian identifies computing power as the most exciting technological change, stating that it underpins the development of AI [4] - Huang Renxun reflects on the evolution of AI, mentioning that algorithms can now learn and predict outcomes from existing data, marking a shift from traditional coding methods [4] - The emergence of generative AI has enabled machines to understand and generate information across different formats, indicating a significant leap in AI capabilities [4] Open Source and Research - Huang Renxun highlights the recent shift towards open-source models in AI research, which has led to a surge in publications, particularly from Chinese researchers [5] - He praises the quality of AI-related papers, noting that open-source development ensures safety through global scrutiny [5] Future of AI and Chip Technology - Huang Renxun discusses the future of AI development, indicating a shift from traditional silicon-based chips to more advanced composite chips that can perform higher-level functions [5] - He mentions that there is still a significant amount of work to be done in this area, with a timeline of 5 to 10 years for further advancements [5] Opportunities for the Younger Generation - The conversation emphasizes the lifelong opportunities that AI presents, particularly for the younger generation, who are encouraged to engage with AI technology [5][6] - Huang Renxun advises that while AI can solve many problems, it is essential for individuals to develop critical thinking skills to interact effectively with AI [6] - He believes that AI will promote equality across different demographics, urging everyone to adopt AI technologies swiftly [6]
DeepSeek使用率暴跌至3%,新模型未推出或成主因
Xi Niu Cai Jing· 2025-07-15 02:09
Core Insights - DeepSeek's user engagement has significantly declined, with usage rates dropping from a peak of 7.5% at the beginning of the year to 3% currently [2] - The anticipated launch of the new model, DeepSeek-R2, has been delayed multiple times, contributing to the decrease in user interest and engagement [2] - Competitors like ChatGPT and Google Gemini have seen substantial growth in website traffic, with increases of 40.6% and 85.8% respectively during the same period [3] Usage Statistics - DeepSeek's usage rate fell from 7.5% in early January to 3% now, indicating a significant drop in user engagement [2] - The usage rate of DeepSeek R1 also halved from 7% in February to 3% by the end of April [2] - The share of token traffic hosted on third-party platforms dropped from 42% in March to 16% in May [2] Model Development and Competition - The delay in the release of DeepSeek-R2 is attributed to the CEO's dissatisfaction with the model's performance, leading to ongoing internal enhancements [2] - The shortage of NVIDIA H20 chips has impacted both the release of the new model and the deployment of existing models [2] - Despite DeepSeek's challenges, competitors are actively innovating and gaining market share [3] Data Limitations - There are concerns regarding the limitations of the data from Semianalysis and Poe, particularly in relation to the Chinese market and the scope of their coverage [3] - Poe's usage data is based solely on its subscribers and does not account for third-party integrations with DeepSeek, such as Tencent and Baidu [3]
K2开源大模型,会是Kimi的DeepSeek时刻吗?
Hu Xiu· 2025-07-14 03:20
Core Insights - The article discusses the emergence of MoonShot's latest open-source model K2, which has a parameter scale of 1 trillion, making it the largest open-source model currently available [2] - K2's performance in various benchmarks positions it as a strong competitor against established models like Claude 4 Opus and GPT-4.1, highlighting China's growing influence in the global AI landscape [2][4] - The competitive landscape in the AI sector is intensifying, with Chinese companies like MoonShot and MiniMax leading the charge in open-source innovation, challenging Western counterparts [4][6] Company Developments - MoonShot's K2 model has quickly gained popularity, becoming the top trending open-source model on HuggingFace shortly after its release [4] - The model's architecture incorporates fewer attention heads and more experts, enhancing efficiency in processing long contexts, which is a significant improvement over previous models [8][10] - MoonShot has disclosed a total funding amount of approximately $1.5 billion, which is significantly lower than that of its Western competitors, indicating a more efficient operational model [6] Market Impact - K2's compatibility with OpenAI and Anthropic's API formats positions it favorably in the AI application development market, potentially allowing it to capture a significant share of the market [7] - The article notes that the competitive dynamics between MoonShot and DeepSeek have intensified, with both companies releasing multiple models aimed at various AI applications [5][12] - The focus on multi-agent collaboration and the integration of various models into K2 may enhance its commercial viability and market appeal [12]
半年盘点|中国创新药迎DeepSeek一刻,对外授权规模激增
Di Yi Cai Jing· 2025-07-12 05:13
Core Insights - The number of approved innovative drugs in China has surged, with 43 new approvals in the first half of the year, indicating a significant growth in the innovative drug industry [1][7] - Chinese companies are increasingly engaging in licensing agreements with international partners, with transaction values exceeding $40 billion in the first half of the year [1][9] - Key areas of focus for innovative drug licensing include GLP-1 weight loss drugs, bispecific antibodies, antibody-drug conjugates (ADCs), and AI-driven drug development [1][3][6] Licensing Agreements - Hansoh Pharma granted Regeneron global exclusive rights for its GLP-1/GIP dual receptor agonist HS-20094 outside Greater China [3] - A $2 billion licensing deal was made between United Biomedical and Novo Nordisk for the GLP-1/GIP/GCG triple receptor agonist UBT251 [4] - Pfizer entered a licensing agreement with 3SBio for the PD-1/VEGF bispecific antibody SSGJ-707, with an upfront payment of $1.25 billion and potential milestone payments of up to $4.8 billion [4] - HBM7020, a bispecific T cell engager, was licensed to Otsuka Pharmaceutical for a total of $670 million [4] - A strategic collaboration between Hansoh Pharma and AstraZeneca was established for two preclinical immunology projects, with a total upfront payment of $175 million and potential milestone payments of up to $4.4 billion [5] - A new ADC, XNW27011, was licensed to Astellas for over $1.5 billion [5] Market Trends - The DeepSeek effect in China's biopharmaceutical sector is highlighted by significant transactions, such as BioNTech's acquisition of a drug from a Chinese company for over $10 billion [7] - AstraZeneca is in talks to acquire Summit's lung cancer drug, Ivorisumab, which was previously acquired from a Chinese company for up to $5 billion [8] - Goldman Sachs predicts that Ivorisumab could reshape the $90 billion immuno-oncology market, with peak sales projected at $53 billion by 2041 [8] - The expiration of patents for major drugs presents a significant opportunity for Chinese innovative drugs to fill the gap in the market [9] Investment Climate - Chinese biopharmaceutical companies are increasingly prioritizing licensing as a strategic goal, with nearly 30% of global drug development attributed to China [9][10] - The rapid pace and lower costs of drug development in China have attracted attention from multinational pharmaceutical companies [10] - The first half of 2025 is expected to see a surge in IPOs in the Hong Kong biopharmaceutical market, with 10 companies successfully listed in the first half of the year [11] - The biopharmaceutical sector raised HKD 15.6 billion in IPOs, making it the second-highest fundraising industry on the Hong Kong Stock Exchange [11]
美联储降息突变,DeepSeek分析:2025年黄金价格会跌到600元
Sou Hu Cai Jing· 2025-07-12 04:19
Core Viewpoint - The recent volatility in gold prices has raised concerns among investors, with predictions indicating a potential drop to 600 yuan per gram by 2025 [1][2]. Group 1: Price Movements and Predictions - The spot gold price in Shanghai has fallen from 834.6 yuan per gram in April to 765 yuan per gram, with brand gold jewelry prices dropping below 900 yuan per gram [1]. - On July 8, spot gold experienced a significant drop of 1.02%, reaching a low of 3298.7 USD per ounce (approximately 780 yuan per gram), marking a 6% decline from the April peak of 3509 USD [2]. - Analysts predict that if gold prices fall below 3300 USD, it could trigger automatic stop-loss orders worth 20 billion USD, leading to panic selling [2]. Group 2: Influencing Factors - The probability of a Federal Reserve rate cut in September decreased from 78% to 63%, reducing the market's demand for gold as a safe haven [2]. - Internal divisions within the Federal Reserve regarding monetary policy are increasing, with "dovish" members advocating for rate cuts to combat inflation, while "hawkish" members warn of potential inflation from tariff policies [3]. - Geopolitical events, such as the Middle East drone attacks and rumors of a ceasefire in Ukraine, have shown to significantly impact gold prices, with the latter causing an 8% flash crash [5]. Group 3: Market Sentiment and Demand - The demand for gold jewelry is being affected by Trump's tariff policies, which are raising inflation but simultaneously suppressing consumer spending [5]. - In the domestic market, while the People's Bank of China is expected to increase gold reserves by 219 tons in 2024, the pace of accumulation is anticipated to slow down in 2025 due to high prices [5]. - Current gold jewelry prices include over 30% in brand premiums and processing fees, indicating that even if the base gold price drops to 600 yuan per gram, jewelry prices may not fall below 800 yuan per gram [6]. Group 4: Investment Strategies and Market Outlook - Goldman Sachs has revised its gold price forecast for the end of 2025 from 3100 USD to 3000 USD, while UBS recommends gradual accumulation at 3250 USD (approximately 730 yuan per gram) [6]. - Zijin Mining is increasing overseas gold mine acquisitions, suggesting that a price range of 700-750 yuan per gram is currently seen as a bottom [6]. - Investors who purchased gold at higher prices are facing difficult decisions, while those who entered the market at lower prices are relatively calm [6].