开源

Search documents
马斯克背叛理想
Sou Hu Cai Jing· 2025-08-24 12:52
并扬言,xAI很快就将超越除谷歌外的任何公司,甚至谷歌也挡不了俺多久了。 顺带日常吹下中国的工业实力,"但中国公司将是最强劲的竞争对手,因为他们的电力比美国多得多,而且在硬件建 设方面实力超强。" 继OpenAI之后,马斯克也开始兑现鸽了许久的承诺。 8月24日,马斯克在X上官宣:开源xAI去年最好的Grok-2模型,约六个月后拟开源Grok 3模型。 对于这件事,绝大多数人都是盛赞。 老马不愧是人类之光,就是有格局,言而有信,说开源就开源…… 但你仔细琢磨下,这件事是不是还可以翻译成一句大白话。 爷现在阔了,以前的那些旧破烂儿用不上了,送你们啦。 这味儿就变了…… 01变质的承诺 明显有些食言的意味。 即便现在虽迟但到,Gork-2终于开源,Gork-3也将于半年后开源。 看上去是兑现了承诺。 但细细琢磨,这份承诺,似乎已经变质。 时间过去了一年多,AI领域日新月异,大多数人可能早已忘记Gork 2到底是个什么水平。 早在去年,Gork-2刚发布的时候,马斯克就曾承诺:每次创建新版本,都将开源前一个版本的Gork。 Gork-1随即开源。 Gork 2系列刚发布时,在阅读理解、编程尤其是视觉相关任务上,都是 ...
在OpenAI炼Agent一年半,回国做出首个开源Agent训练框架!这个30岁清华天才却说:创业不是技术命
AI前线· 2025-08-23 05:32
姚班、伯克利、OpenAI、清华……年仅 30 多岁的吴翼身上已经聚集了众多亮眼的标签。 从小到大,似乎无论在哪个阶段、哪个领域,吴翼都可以交出一份不错的答卷:他是 ACM 世界奖牌得主,也是带队冲击 IOI 的教练;他亲历了 Facebook 2012 的崛起、字节跳动 2016–2018 的飞速成长,以及 OpenAI 爆火前的关键时期;他也自己参与了创业、全力做着开源项目。 吴翼创立的边塞科技在 2024 年被蚂蚁收购,团队积累 4 年的规模化强化学习成果如今都积累到了开源项目 AReaL 中,这是一个专为大型推理模型设 计的完全异步的强化学习训练框架。目前在在 Github 上已收获 2.4k stars。AReaL 完全围绕 Agent 打造。谈及定位,吴翼直言:"按照这个定位我们没 有竞品"。 在 10 月 23 日 -25 日的 QCon 上海站,吴翼将分享主题为《智能体时代的强化学习:AReaL 框架与 Agent 最佳实践》的演讲。在此之前,我们对吴翼 进行了一次采访,他详细阐述了自己求学、OpenAI 工作和创业的经历和感受。主要观点如下: 在 OpenAI,我学会了 编辑 | Tina、 ...
英伟达开源9B参数小模型,比Qwen3快6倍
量子位· 2025-08-19 05:25
Core Insights - The article discusses the emergence of small AI models, highlighting the launch of NVIDIA's new small language model, Nemotron Nano v2, which is designed to perform complex reasoning tasks efficiently [1][3][7]. Group 1: Model Features and Performance - Nemotron Nano v2 is a 9 billion parameter model that matches or exceeds the accuracy of the leading open-source model Qwen3-8B in complex reasoning benchmarks while being 6 times faster [1][7]. - The model supports a "reasoning trace" feature, allowing it to generate reasoning processes before providing final answers, which enhances the quality of responses, especially for complex tasks [8][11]. - Users can control the "thinking budget," specifying the number of tokens the model can use during reasoning, which helps in managing the model's performance [10][12]. Group 2: Training and Data - The model underwent extensive pre-training on over 20 trillion tokens, utilizing FP8 precision and a Warmup-Stable-Decay learning rate schedule [19]. - Post-training involved various techniques, including supervised fine-tuning and reinforcement learning from human feedback, with about 5% of the data containing intentionally truncated reasoning traces [21]. - NVIDIA has also released a significant portion of the data used for training, including a diverse pre-training dataset with 66 trillion tokens across multiple categories [26][23]. Group 3: Open Source Strategy - NVIDIA's approach contrasts with other tech giants moving towards closed-source models, emphasizing an open-source strategy with the Nemotron ecosystem [30][32]. - The company has made significant strides in open-sourcing its models, which may influence the competitive landscape in AI development [29][33].
深度|英伟达最新挑战者Cerebras创始人对话谷歌前高管:我们正处于一个无法预测拐点的阶段
Z Potentials· 2025-08-15 03:53
Core Insights - The article discusses the transformative impact of AI on industries, emphasizing the role of open-source and data in global AI competition, as well as the challenges of AI safety and alignment, and the limitations of power in the development of AGI [2][16]. Group 1: AI Hardware Innovations - Cerebras Systems, led by CEO Andrew Feldman, is focused on creating the fastest and largest AI computing hardware, which is crucial for the growing demand for AI technologies [2][3]. - The company’s chip is 56 times larger than the largest known chip, designed specifically for AI workloads that require massive simple computations and unique memory access patterns [8][9]. - The collaboration between hardware and software is essential for accelerating AGI development, with a focus on optimizing matrix multiplication and memory access speeds [11][12]. Group 2: Open Source and Global Competition - The open-source ecosystem is seen as a vital area for innovation, particularly benefiting smaller companies and startups in competing against larger firms with significantly more capital [18][19]. - The cost of processing tokens has dramatically decreased, from $100 per million tokens to as low as $1.50 or $2, fostering innovation and broader application of technology [19]. - The competition in AI is perceived to be primarily between the US and China, with emerging markets also adopting Chinese open-source models [18]. Group 3: Power Supply and AGI Development - Power supply is identified as a critical limitation for AGI development, with high electricity costs in Europe posing challenges [42][45]. - The discussion highlights the need for significant energy resources, such as nuclear power, to support large data centers essential for AI operations [44][46]. - The article suggests that the future of AGI may depend on the establishment of new nuclear power plants to meet the energy demands of advanced AI systems [46]. Group 4: AI Safety and Alignment - AI alignment refers to ensuring that AI systems reflect human values and norms, with ongoing efforts to develop testing methods to check for potential dangers in AI models [35][36]. - The challenge remains in maintaining alignment in self-improving systems, raising concerns about the potential risks of releasing advanced AI without proper oversight [37][38]. - The responsibility for AI safety is shared between hardware and software, emphasizing the need for collaboration in addressing these challenges [39].
大模型路线之争:中国爱开源 美国爱闭源?
2 1 Shi Ji Jing Ji Bao Dao· 2025-08-08 05:14
Core Viewpoint - The article discusses the contrasting approaches of China and the United States in the development of large AI models, highlighting China's preference for open-source models while the U.S. leans towards closed-source models [1][2][3]. Group 1: Open-source vs Closed-source Models - China's open-source models dominate the Hugging Face leaderboard, with major players like Tencent, Alibaba, and Zhiyuan consistently ranking high [1]. - Tencent's recently released multi-modal model has achieved significant recognition, including a top position in the Hugging Face paper rankings [1]. - In contrast, U.S. companies like Meta are moving away from open-source models, with experts noting that the U.S. is effectively withdrawing from the competitive landscape of open-source large language models [1][2]. Group 2: Reasons for the Divergence - The technological development stage in China is characterized by a need for rapid iteration and community involvement, which open-source models facilitate [1]. - Chinese enterprises are integrating large models with specific industries, making open-source models more accessible and accelerating implementation [2]. - U.S. companies, on the other hand, are investing heavily in closed-source models to maintain competitive advantages and create high barriers to entry, exemplified by companies like OpenAI and Anthropic [2]. Group 3: Future Outlook - Industry experts suggest that both open-source and closed-source models may coexist in the future, with a potential hybrid approach combining open-source foundational models and closed-source vertical applications [3]. - The competition between China and the U.S. in the AI model space is framed as a struggle between open-source and closed-source strategies, with China's open-source approach seen as a potentially advantageous decision [3].
AI浪潮下,VC/PE如何抢抓投资机遇?
Sou Hu Cai Jing· 2025-08-03 10:35
Core Insights - The rapid development of artificial intelligence (AI) is significantly transforming various industries, including investment, creating new opportunities for investors [1] - The 2024 AI industry investment report indicates a total investment of nearly 85 billion yuan, with 1,156 investment cases reported [2] - Key investment trends in the AI sector include a focus on early-stage investments, with nearly 70% of cases in A-round and earlier stages [2][3] Investment Trends - The AI industry is experiencing active investment in sectors such as AI+ healthcare, intelligent driving, AI infrastructure, humanoid robots, AI large models, and AI chips, which collectively account for 78.4% of investment cases [3] - The AI large model sector alone attracted approximately 26 billion yuan, representing over 30% of total investment [3] - Beijing leads in both the number of investment cases (326) and total investment amount (36.26 billion yuan), followed by Shanghai, Shenzhen, Jiangsu, and Zhejiang [2] Future Directions - Five major trends in the AI industry have been identified: 1. Increased establishment of AI industry funds and sustained investment intensity 2. Transition towards general intelligence with cost reduction and open-source models creating new opportunities 3. Rapid growth in AI computing power, fostering a "domestic computing power + large model" ecosystem 4. Emergence of multimodal large models enhancing AI agent capabilities and scene innovation 5. Transformation in AI content generation, highlighting the importance of ethical governance and privacy protection [3] Market Valuation - The valuation of AI innovation assets in China is undergoing a reassessment, with many GPU, semiconductor, and chip companies still valued at 2021 levels [4] - The significant rise in stock prices of companies like Nvidia and Cambrian indicates the potential for similar valuation adjustments in AI-related assets [4] Investment Strategies - Investment strategies in the current AI ecosystem should focus on small-scale investments that can yield substantial returns, with an emphasis on building resilient investment portfolios [5] - Identifying key segments within the industry and investing heavily in top-performing companies is recommended, as demonstrated by successful investments in companies like Hesai Technology [5][6] - A sustainable software ecosystem is crucial for the integration of AI and applications, with a focus on developing healthy business models that encourage software monetization [6]
促开放协作与跨界融合 2025CCF中国开源大会在上海召开
Zhong Guo Xin Wen Wang· 2025-08-02 13:15
Core Insights - The 2025 CCF China Open Source Conference opened in Shanghai, focusing on key directions such as open-source large models and embodied intelligence [1][3] - Experts from academia and industry shared forward-looking views on critical technology areas including large models, open-source hardware, and intelligent operating systems [3] Group 1: Key Developments - The conference featured the introduction of efficient inference systems Mooncake and KTransformers developed by a team led by Zheng Weimin, showcasing their core role in supporting workloads in the intelligent era [3] - Academician E Wei Nan emphasized the paradigm shift in AI from a "model-centric" to a "data-centric" approach, highlighting the need for high-quality data infrastructure to lower the barriers for AI implementation [3] Group 2: Community and Ecosystem Initiatives - The CCF Ubiquitous Operating System Open Community was established with participation from top universities and research institutions, focusing on technology research, project incubation, standard development, application promotion, and talent cultivation [4] - A series of strategic initiatives were launched, including the establishment of the CCF-Mulan Innovation Open Source Incubator and the Omni-Infer Cloud Co-Creation Plan [3][4] Group 3: Educational and Collaborative Efforts - Shanghai Jiao Tong University aims to integrate open-source concepts into its curriculum, fostering talent for next-generation operating systems [5] - The collaboration model between Shanghai Jiao Tong University and Huawei emphasizes shared goals and resources to support core technology breakthroughs [5]
AI 投资浪潮来袭 如何在变革中抢抓投资机遇?
Zheng Quan Shi Bao Wang· 2025-08-02 03:41
Group 1: Core Insights - The rapid development of artificial intelligence (AI) is significantly transforming various industries, including investment, creating new opportunities for investors [1] - The 2024 AI industry investment in China is projected to reach nearly 85 billion yuan, with 1,156 investment cases reported [2] - Investment in the AI sector is predominantly early-stage, with nearly 70% of cases in A-round and earlier stages, and average investment amounts exceeding 10 million yuan [2] Group 2: Investment Trends - Key investment areas in 2024 include AI+ healthcare, intelligent driving, AI infrastructure, humanoid robots, AI large models, and AI chips, accounting for 78.4% of total cases [3] - The AI large model sector alone is expected to attract around 26 billion yuan, representing over 30% of total investment [3] - Beijing leads in AI investment cases and amounts, with 326 cases and 36.26 billion yuan, followed by Shanghai, Shenzhen, Jiangsu, and Zhejiang [2] Group 3: Market Dynamics - The AI industry is experiencing a phase of asset revaluation, particularly in GPU, semiconductor, and chip companies, which have not fully reflected their market value [4] - The open-source movement is crucial for China's technological development, with Chinese companies leading in the global open-source model landscape [4] - Investment strategies should focus on long-term trends and cultivating sustainable business models, with an emphasis on key segments within the industry [5][6] Group 4: Future Directions - The importance of a healthy software ecosystem is highlighted, as AI applications are fundamentally software-driven, necessitating a sustainable development model [6] - Companies are encouraged to adopt a diversified investment strategy to build resilient portfolios while tolerating a certain level of failure [5] - The integration of AI with robotics and the development of next-generation computing architectures are identified as critical investment areas for the coming years [6]
拆箱开源版Coze:Agent核心三件套大公开,48小时揽下9K Star
量子位· 2025-07-28 03:25
Core Viewpoint - The article discusses the recent open-source release of Coze's products, which aims to facilitate the development and deployment of AI agents, marking a significant step towards making agent technology more accessible and practical for developers [1][45]. Group 1: Open Source Products - Coze has released two new open-source products: Coze Studio and Coze Loop, alongside the previously released Eino framework, creating a comprehensive open-source ecosystem for agent development [2][5][32]. - Coze Studio is a low-code platform designed to simplify the creation of AI workflows, while Coze Loop focuses on the development, evaluation, and monitoring of agents [12][21][25]. - The open-source products are licensed under the Apache 2.0 license, allowing for commercial use and modifications without the requirement to open-source changes [7][57]. Group 2: Market Trends and Challenges - The article highlights the growing popularity of agents, transitioning from novelty items to practical tools, as evidenced by the increasing support from major companies and the emergence of various successful agent applications [3][46]. - Despite the enthusiasm, the widespread adoption of agents faces challenges, including inconsistent user experiences and high development barriers, which Coze aims to address through its open-source offerings [47][50]. Group 3: Development and Evaluation Capabilities - Coze Studio provides a complete workflow engine, allowing developers to easily create agents by dragging and dropping functional components, thus lowering the technical barrier for entry [16][19]. - Coze Loop offers a comprehensive solution for prompt development, evaluation, and monitoring, enabling developers to assess agent performance across multiple dimensions [25][30]. - Eino, the earlier released framework, provides a unified component abstraction and flexible orchestration capabilities, enhancing the development process for AI applications [36][39]. Group 4: Future Implications - The open-source initiative is expected to accelerate the deployment of agents across various industries, particularly in internal automation, small teams, and vertical sectors like healthcare and finance [43][42]. - Coze's open-source strategy is seen as a proactive move to capitalize on the impending explosion of agent technology, aiming to create a robust ecosystem that fosters collaboration and innovation among developers [45][56].
扣子开源全家桶,Apache 2.0加持,AI Agent又一次卷到起飞
机器之心· 2025-07-28 02:47
Core Viewpoint - The company has launched two core open-source products, Coze Studio and Coze Loop, as part of its AI Agent development platform, aiming to enhance developer engagement and competition in the open-source space [4][6][39]. Group 1: Product Launch and Features - The open-source products have collectively garnered 9.5K stars on GitHub, indicating significant interest in the AI agent development field [7]. - Coze Studio is a no-code development platform that allows users to create functional AI agents through a visual interface, making it accessible even for those without coding skills [10][11]. - Coze Loop serves as a comprehensive lifecycle management tool for AI agents, facilitating development, evaluation, observation, and optimization [28][34]. Group 2: Technical Architecture and Performance - The platform's backend is built on Golang, while the frontend utilizes React and TypeScript, ensuring a robust and efficient architecture [19][21]. - The microservices architecture allows for clear responsibilities and modular development, enhancing maintainability and collaboration within the open-source community [22]. - The platform supports containerized deployment, simplifying the setup process for developers [23]. Group 3: Open Source Strategy and Community Engagement - The decision to open-source these products under the Apache 2.0 license reflects the company's commitment to fostering a developer-friendly ecosystem, allowing for commercial use without licensing fees [43][45]. - By opening up its technology, the company aims to attract a larger developer community to contribute to the ecosystem, enhancing product evolution through collaborative efforts [54][55]. - The open-source initiative is seen as a strategic move to establish a new standard in the AI agent development space, positioning the company as a leader in this emerging market [44][58].