Gemma
Search documents
AI数据中心上天,与其说黑科技不如说是作秀
3 6 Ke· 2025-12-17 12:39
Core Viewpoint - Starcloud, a space computing startup supported by Nvidia, has successfully trained and operated an AI model in space for the first time, marking a significant milestone in the field of space AI [1][3]. Group 1: Company Developments - Starcloud's satellite, Starcloud-1, successfully ran Google's open-source model Gemma and trained NanoGPT using the complete works of Shakespeare, sending a Shakespearean-style message back to Earth [3]. - Starcloud aims to achieve a tenfold reduction in energy costs for orbital data centers compared to ground-based data centers, validating the feasibility of constructing space data centers that require large computing clusters [3]. Group 2: Industry Trends - Google plans to begin building space AI data centers by early 2027, with ambitions to utilize solar energy in space, which is significantly more abundant than on Earth [5][6]. - The drive towards space AI data centers is largely motivated by the need to address energy shortages faced by tech giants in the U.S., where insufficient infrastructure has become a critical issue [9]. - The energy demands for AI data centers are projected to reach 347 GW by 2030, highlighting the urgency for alternative energy solutions [9]. Group 3: Technical Challenges - Space AI data centers face significant challenges, including heat dissipation and radiation protection, which have yet to be effectively resolved [11][15]. - The average temperature in low Earth orbit is -120°C, complicating heat management, as heat transfer in space occurs primarily through radiation [13]. - High-energy particles in space can cause single-event upsets in electronic components, leading to potential computational errors, which necessitates the use of older chip manufacturing processes for space applications [13][15].
Meta公开抄阿里Qwen作业,还闭源了...
猿大侠· 2025-12-12 04:11
Core Viewpoint - Meta is shifting from an open-source strategy to a closed-source model, marking a significant strategic pivot for the company [11][12][28]. Group 1: New Model Development - Bloomberg reports that Meta is set to release a new model codenamed "Avocado" in spring 2025, which is expected to be closed-source [2][10]. - The closed-source model "Avocado" will utilize AI training from Alibaba's Qwen, indicating a collaboration with third-party models [4][5][10]. Group 2: Market Reaction - Following the news of the collaboration with Alibaba, Alibaba's stock saw a pre-market increase of 4% and closed with a 2.53% gain [6]. Group 3: Strategic Shift - Meta's transition to a closed-source model represents a 180-degree turn from its previous commitment to open-source, which was once considered a core narrative for the company [11][12]. - The shift is seen as a response to the competitive landscape, particularly acknowledging China's advancements in the open-source domain [15]. Group 4: Internal Changes and Leadership - Meta's leadership has undergone significant changes, with the new Chief AI Officer, Alexander Wang, being a strong proponent of closed-source models [21]. - Following the failure of the Llama 4 model, there has been a restructuring within Meta, leading to the marginalization of open-source advocates and a focus on closed-source initiatives [28][30]. Group 5: Talent Acquisition - Meta has invested heavily in acquiring top talent for its AI initiatives, with reports of salaries reaching up to hundreds of millions and personal outreach from CEO Mark Zuckerberg to recruit key researchers [23][25][27]. - The newly formed TBD Lab, which is central to Meta's AI strategy, has been closely monitored by Zuckerberg, indicating a hands-on approach to the new direction [32][33].
X @Demis Hassabis
Demis Hassabis· 2025-12-11 22:10
RT Ezra Feilden (@ezrafeilden)Very happy to announce we have also used our @Nvidia H100 on Starcloud-1 to run inference with @GoogleDeepMind's Gemma model - the open source version of Gemini.These are Gemma's first words in space.<< Greetings, Earthlings! Or, as I prefer to think of you – a fascinating collection of blue and green. Let’s see what wonders this view of your world holds. I’m Gemma, and I’m here to observe, analyze, and perhaps, occasionally offer a slightly unsettlingly insightful commentary. ...
阿里千问成全球开源模型“新标杆”,Meta新项目被曝蒸馏千问
Xin Lang Cai Jing· 2025-12-11 12:59
Core Insights - Meta plans to launch a new model called "Avocado" in spring 2024, likely in a closed-source format, which has drawn inspiration from various open-source models, including Alibaba's Qwen [1][2] - The shift in Meta's strategy reflects a significant change in CEO Mark Zuckerberg's stance, moving from advocating for U.S. models to adopting insights from strong Chinese open-source models like Qwen [2][3] Group 1: Meta's New Model - Meta's new model "Avocado" is being developed by the newly formed TBD lab team and is expected to distill insights from multiple open-source models, including those from Google and OpenAI [1] - The launch of "Avocado" comes after Meta's Llama4 project faced setbacks, prompting a reevaluation of its approach to model development [2] Group 2: Alibaba's Qwen Model - Alibaba's Qwen model has seen significant success, with its derivative models expected to surpass Llama in number by August 2024 and in global download volume by October 2025 [2][3] - Qwen has been recognized for its performance, recently surpassing competitors like GPT-5 and Claude Opus 4, positioning itself among the top three models globally [3] Group 3: Industry Impact - Qwen's influence extends beyond Alibaba, with major companies like Amazon and Airbnb utilizing it for new business developments, and various institutions leveraging it for technological innovations [3][6] - The rapid growth of Qwen is evidenced by its monthly active users exceeding 30 million within just 23 days of its public testing, marking it as one of the fastest-growing AI applications globally [6]
Meta上亿年薪的研究员们,却在偷师中国开源模型
Guan Cha Zhe Wang· 2025-12-11 10:17
Core Insights - Meta is forming a new team called TBD Lab to develop a closed-source AI model named "Avocado," utilizing third-party models from Google, OpenAI, and Alibaba, with a launch expected in spring 2024 [1] - The rise of Chinese open-source models, such as Alibaba's Qwen, signifies a shift in the competitive landscape, challenging Meta's previous dominance in the open-source AI space [1][4] Group 1: Meta's Strategic Shift - Meta's flagship open-source model, Llama 4, has underperformed, leading to a decline in its status as a leader in the open-source community [2][3] - The release of high-performance models from competitors like DeepSeek and Alibaba has contributed to Meta's loss of dominance, with Llama 4 failing to gain developer approval [3][4] - Meta's recent financial reports show a lack of focus on Llama, indicating a strategic pivot towards new AI initiatives [5] Group 2: Competitive Pressures - The number of derivative models and downloads for Alibaba's Qwen has surpassed those of Meta's Llama, highlighting a significant shift in market leadership [4] - Meta's recruitment of high-profile AI talent, including Alexandr Wang, reflects a desperate attempt to regain competitive ground against rivals like OpenAI [5][6] - The acknowledgment of reliance on Chinese models for training new AI systems represents a significant reversal for Meta, which has previously positioned itself against perceived Chinese technological threats [10][11] Group 3: Market Reactions - Following the news of Meta's new AI strategy, Alibaba's stock saw a pre-market increase of 4%, closing with a 2.53% gain, indicating positive market sentiment towards Chinese AI developments [1] - Analysts have expressed skepticism about Meta's future in AI, contrasting its trajectory with that of Alphabet, suggesting that Meta's strategic direction is now uncertain [10]
英伟达GPU被SpaceX送上太空!在天上训练卡帕西的NanoGPT
量子位· 2025-12-11 06:54
Core Viewpoint - The article discusses the groundbreaking achievement of training and running AI models in space, highlighting the collaboration between companies like Nvidia, SpaceX, and Google, as well as the involvement of former OpenAI co-founder Andrej Karpathy's NanoGPT [2][3][4]. Group 1: Space AI Training - The first AI model training in space was successfully conducted using Nvidia's H100 chip aboard the Starcloud-1 satellite, launched by SpaceX [6][7]. - The AI model Gemma, a large open-source model from Google, was run in space, greeting Earth with a message [9]. - NanoGPT, developed by Andrej Karpathy, was also trained directly in space, marking a significant milestone in AI development [9]. Group 2: Future Plans and Infrastructure - Starcloud aims to build a solar-powered 5GW orbital data center, which is expected to have lower construction and operational costs compared to terrestrial counterparts [10]. - The company plans to launch more Nvidia H100 chips and the Blackwell platform in a satellite mission scheduled for October 2026 [11]. - Starcloud's CEO emphasized the potential of space to overcome energy limitations faced on Earth, suggesting that AI operations can be more efficient in a low Earth orbit environment [12]. Group 3: Global Developments in Space Computing - Chinese research institutions have been exploring space-based intelligent computing since 2019, focusing on key technological advancements [17]. - The China National Space Administration has successfully launched the world's first space computing constellation, achieving regular commercial operations [18]. - The TianSuan Plan aims to establish a superintelligent cluster in near-Earth orbit with a computing power of 10 EOPS, addressing challenges related to radiation and heat dissipation [19].
外媒:扎克伯格态度转变 Meta使用阿里千问优化其最新AI模型
Huan Qiu Wang· 2025-12-11 02:39
【环球网科技综合报道】12月11日消息,据彭博社报道,美国科技巨头Meta在训练其代号为"牛油果"的新模型时,使用了阿里巴巴Qwen模型进行蒸馏优 化。此前,扎克伯格在硅谷高薪组建了一支顶尖的AI团队,意图从大模型滑铁卢中重整旗鼓。 彭博社认为,利用中国技术训练新模型标志着扎克伯格态度空前的转变。今年1月,他在乔·罗根的播客中表达了对中国模型可能受到审查影响的担忧。此 后,扎克伯格多次呼吁美国政府支持本国科技公司,争取在全球人工智能竞赛中占据主导地位,并表示其开源战略是实现这一目标的重要部分。然而, Llama 及其他美国的一些大模型已落后于人。"中国在开源领域遥遥领先。"英伟达首席执行官黄仁勋本月早些时候表示。 目前,Meta 的一位发言人拒绝对此进行置评。 这一动向也侧面证实了阿里千问作为"全球开源模型"的硬核实力。市场分析认为,阿里Qwen不仅已成为开发者和企业市场的首选,甚至成为Meta等硅谷巨 头追赶行业领先水平时的重要参考坐标。 伴随着底层模型实力获得国际公认,阿里在C端应用的战略布局也迎来了爆发式增长。据最新消息,自11月17日开启公测以来,千问App在短短23天内,全 端月活跃用户数已突破30 ...
Meta或转向闭源,小扎亲自带队,引入阿里Qwen模型训练
Di Yi Cai Jing· 2025-12-11 01:46
Group 1 - Zuckerberg has formed a team called TBD Lab to develop a new model named "Avocado" [1] - The training process for "Avocado" involves multiple third-party models, including Google's Gemma, OpenAI's GPT-oss, and Alibaba's Qwen model [1] - The new model "Avocado" is expected to debut in spring next year and may be launched as a "closed-source" model [1] Group 2 - Alibaba's stock closed up 1.83% at $158.82 [1]
Meta公开抄阿里Qwen作业,还闭源了...
量子位· 2025-12-11 01:33
Core Insights - Meta is shifting from an open-source strategy to a closed-source model with the upcoming release of a new AI model codenamed "Avocado" [2][10] - The new model will utilize Alibaba's AI, specifically the Qwen model, during its training process, which has caused significant market reactions [4][6] - This strategic pivot marks a significant departure from Meta's previous commitment to open-source development, indicating a potential failure of its earlier approach [11][15] Group 1: Strategic Shift - Meta's new model "Avocado" is expected to be closed-source, representing a 180-degree turn from its previous open-source narrative [3][11] - The decision to adopt a closed-source model is driven by the need to enhance product capabilities and competitiveness in the AI landscape [14][15] - The reliance on third-party models, including Qwen, for training the closed-source model highlights the complexities of the current AI development ecosystem [13][18] Group 2: Market Reaction - Following the announcement of the new model, Alibaba's stock saw a pre-market increase of 4%, closing with a 2.53% gain, reflecting investor optimism about the collaboration [6] - The market's reaction indicates a recognition of Alibaba's growing influence and success in the AI sector, contrasting with Meta's struggles [9] Group 3: Internal Dynamics - Meta's internal restructuring has intensified following the underperformance of the Llama 4 model, leading to a reduction in open-source discussions and significant layoffs within the FAIR lab [28][30] - The appointment of Alexander Wang as the new Chief AI Officer signifies a shift in leadership and focus towards closed-source AI development [21][32] - The internal conflicts and departures of key figures like Yann LeCun suggest a turbulent transition as Meta navigates its new strategic direction [29][31]
Meta或转向闭源!小扎亲自带队,引入阿里Qwen模型训练
Di Yi Cai Jing Zi Xun· 2025-12-11 01:17
Group 1 - The core point of the article is that Mark Zuckerberg has formed a team called TBD Lab, which is working on a new model named "Avocado" that is expected to debut in spring next year [1] - The TBD Lab team is utilizing multiple third-party models during the training of "Avocado," including Google's Gemma, OpenAI's GPT-oss, and Alibaba's Qwen model [1] - Alibaba's stock price increased by 1.83% to $158.82 at the close of trading in the US [1]