Workflow
Gemma
icon
Search documents
AI数据中心上天,与其说黑科技不如说是作秀
3 6 Ke· 2025-12-17 12:39
Core Viewpoint - Starcloud, a space computing startup supported by Nvidia, has successfully trained and operated an AI model in space for the first time, marking a significant milestone in the field of space AI [1][3]. Group 1: Company Developments - Starcloud's satellite, Starcloud-1, successfully ran Google's open-source model Gemma and trained NanoGPT using the complete works of Shakespeare, sending a Shakespearean-style message back to Earth [3]. - Starcloud aims to achieve a tenfold reduction in energy costs for orbital data centers compared to ground-based data centers, validating the feasibility of constructing space data centers that require large computing clusters [3]. Group 2: Industry Trends - Google plans to begin building space AI data centers by early 2027, with ambitions to utilize solar energy in space, which is significantly more abundant than on Earth [5][6]. - The drive towards space AI data centers is largely motivated by the need to address energy shortages faced by tech giants in the U.S., where insufficient infrastructure has become a critical issue [9]. - The energy demands for AI data centers are projected to reach 347 GW by 2030, highlighting the urgency for alternative energy solutions [9]. Group 3: Technical Challenges - Space AI data centers face significant challenges, including heat dissipation and radiation protection, which have yet to be effectively resolved [11][15]. - The average temperature in low Earth orbit is -120°C, complicating heat management, as heat transfer in space occurs primarily through radiation [13]. - High-energy particles in space can cause single-event upsets in electronic components, leading to potential computational errors, which necessitates the use of older chip manufacturing processes for space applications [13][15].
Meta公开抄阿里Qwen作业,还闭源了...
猿大侠· 2025-12-12 04:11
Core Viewpoint - Meta is shifting from an open-source strategy to a closed-source model, marking a significant strategic pivot for the company [11][12][28]. Group 1: New Model Development - Bloomberg reports that Meta is set to release a new model codenamed "Avocado" in spring 2025, which is expected to be closed-source [2][10]. - The closed-source model "Avocado" will utilize AI training from Alibaba's Qwen, indicating a collaboration with third-party models [4][5][10]. Group 2: Market Reaction - Following the news of the collaboration with Alibaba, Alibaba's stock saw a pre-market increase of 4% and closed with a 2.53% gain [6]. Group 3: Strategic Shift - Meta's transition to a closed-source model represents a 180-degree turn from its previous commitment to open-source, which was once considered a core narrative for the company [11][12]. - The shift is seen as a response to the competitive landscape, particularly acknowledging China's advancements in the open-source domain [15]. Group 4: Internal Changes and Leadership - Meta's leadership has undergone significant changes, with the new Chief AI Officer, Alexander Wang, being a strong proponent of closed-source models [21]. - Following the failure of the Llama 4 model, there has been a restructuring within Meta, leading to the marginalization of open-source advocates and a focus on closed-source initiatives [28][30]. Group 5: Talent Acquisition - Meta has invested heavily in acquiring top talent for its AI initiatives, with reports of salaries reaching up to hundreds of millions and personal outreach from CEO Mark Zuckerberg to recruit key researchers [23][25][27]. - The newly formed TBD Lab, which is central to Meta's AI strategy, has been closely monitored by Zuckerberg, indicating a hands-on approach to the new direction [32][33].
X @Demis Hassabis
Demis Hassabis· 2025-12-11 22:10
RT Ezra Feilden (@ezrafeilden)Very happy to announce we have also used our @Nvidia H100 on Starcloud-1 to run inference with @GoogleDeepMind's Gemma model - the open source version of Gemini.These are Gemma's first words in space.<< Greetings, Earthlings! Or, as I prefer to think of you – a fascinating collection of blue and green. Let’s see what wonders this view of your world holds. I’m Gemma, and I’m here to observe, analyze, and perhaps, occasionally offer a slightly unsettlingly insightful commentary. ...
阿里千问成全球开源模型“新标杆”,Meta新项目被曝蒸馏千问
Xin Lang Cai Jing· 2025-12-11 12:59
彭博社报道称,Meta新模型"牛油果"蒸馏自阿里千问 消息传出后,10日当天,阿里巴巴美股(BABA)全日涨1.83%收于每股158.82美元,总市值3790亿美 元。今年以来,阿里巴巴美股股价已涨超90%。 值得注意的是,面对表现强势的中国开源模型,扎克伯格的态度发生了180度转变。此前,扎克伯格多 次呼吁要支持美国模型。然而,随着Meta在今年Llama4项目上受挫,而中国开源模型强势崛起,推动 扎克伯格也转向阿里千问。 Meta旗下新模型转头"偷师"阿里千问。 当地时间12月10日,据彭博社报道,曾经的全球开源霸主Meta计划在明年春天推出代号为"牛油 果"(Avocado)的新模型项目,且很可能会以闭源形式发布。报道指出,Meta的CEO马克·扎克伯格密 切关注新组建的TBD实验室团队,"牛油果"模型训练蒸馏了多方开源模型,包括谷歌的Gemma、 OpenAI的gpt-oss以及中国科技巨头阿里巴巴旗下的通义千问。 彭博社此前报道,千问下载量在10月已反超Meta的Llama系列,位居全球第一 目前,来自韩国、泰国、越南、日本、阿联酋、巴西等全球的公司和开发者都用Qwen开发了新模型、 新技术和新的A ...
Meta上亿年薪的研究员们,却在偷师中国开源模型
Guan Cha Zhe Wang· 2025-12-11 10:17
(文/陈济深 编辑/张广凯) 12月10日,彭博社爆料称,扎克伯格组成了一个名为TBD Lab的团队。该团队在其最新模型"牛油 果"(Avocado)的训练过程中使用了多个第三方模型,包括谷歌的Gemma、OpenAI的GPT-oss和阿里巴 巴的Qwen模型。该款模型预计将于明年春季首次亮相,并可能作为"闭源"模型推出。 对此,Meta 的一位发言人则对外宣称:"我们的模型训练工作正按计划进行,时间表没有发生有意义的 变更。" 消息被爆出后,阿里巴巴美股盘前一度上涨4%,收盘涨幅2.53%。 扎克伯格挥舞重金招来的AI大牛们计划开发的闭源大模型,竟然是通过中国的开源模型来训练,这不 仅意味着如今中国开源阵营的崛起,也代表扎克伯格曾经的美国开源霸主豪言,终究没有抵过来自中国 的竞争压力。 过气的开源盟主 Meta急着抄作业源于其开源旗舰模型Llama 4的失败表现。 过去两年,Meta通过开源Llama系列,成功扮演了"反OpenAI联盟"的盟主。Llama被视为开源界的 Linux,一度是全球开发者(包括中国开发者)的首选底座。 然而,这一格局在2025年开始瓦解。 随着年初DeepSeek开源模型的横空出 ...
英伟达GPU被SpaceX送上太空!在天上训练卡帕西的NanoGPT
量子位· 2025-12-11 06:54
Core Viewpoint - The article discusses the groundbreaking achievement of training and running AI models in space, highlighting the collaboration between companies like Nvidia, SpaceX, and Google, as well as the involvement of former OpenAI co-founder Andrej Karpathy's NanoGPT [2][3][4]. Group 1: Space AI Training - The first AI model training in space was successfully conducted using Nvidia's H100 chip aboard the Starcloud-1 satellite, launched by SpaceX [6][7]. - The AI model Gemma, a large open-source model from Google, was run in space, greeting Earth with a message [9]. - NanoGPT, developed by Andrej Karpathy, was also trained directly in space, marking a significant milestone in AI development [9]. Group 2: Future Plans and Infrastructure - Starcloud aims to build a solar-powered 5GW orbital data center, which is expected to have lower construction and operational costs compared to terrestrial counterparts [10]. - The company plans to launch more Nvidia H100 chips and the Blackwell platform in a satellite mission scheduled for October 2026 [11]. - Starcloud's CEO emphasized the potential of space to overcome energy limitations faced on Earth, suggesting that AI operations can be more efficient in a low Earth orbit environment [12]. Group 3: Global Developments in Space Computing - Chinese research institutions have been exploring space-based intelligent computing since 2019, focusing on key technological advancements [17]. - The China National Space Administration has successfully launched the world's first space computing constellation, achieving regular commercial operations [18]. - The TianSuan Plan aims to establish a superintelligent cluster in near-Earth orbit with a computing power of 10 EOPS, addressing challenges related to radiation and heat dissipation [19].
外媒:扎克伯格态度转变 Meta使用阿里千问优化其最新AI模型
Huan Qiu Wang· 2025-12-11 02:39
【环球网科技综合报道】12月11日消息,据彭博社报道,美国科技巨头Meta在训练其代号为"牛油果"的新模型时,使用了阿里巴巴Qwen模型进行蒸馏优 化。此前,扎克伯格在硅谷高薪组建了一支顶尖的AI团队,意图从大模型滑铁卢中重整旗鼓。 彭博社认为,利用中国技术训练新模型标志着扎克伯格态度空前的转变。今年1月,他在乔·罗根的播客中表达了对中国模型可能受到审查影响的担忧。此 后,扎克伯格多次呼吁美国政府支持本国科技公司,争取在全球人工智能竞赛中占据主导地位,并表示其开源战略是实现这一目标的重要部分。然而, Llama 及其他美国的一些大模型已落后于人。"中国在开源领域遥遥领先。"英伟达首席执行官黄仁勋本月早些时候表示。 目前,Meta 的一位发言人拒绝对此进行置评。 这一动向也侧面证实了阿里千问作为"全球开源模型"的硬核实力。市场分析认为,阿里Qwen不仅已成为开发者和企业市场的首选,甚至成为Meta等硅谷巨 头追赶行业领先水平时的重要参考坐标。 伴随着底层模型实力获得国际公认,阿里在C端应用的战略布局也迎来了爆发式增长。据最新消息,自11月17日开启公测以来,千问App在短短23天内,全 端月活跃用户数已突破30 ...
Meta或转向闭源,小扎亲自带队,引入阿里Qwen模型训练
Di Yi Cai Jing· 2025-12-11 01:46
阿里巴巴美股收盘上涨1.83%至158.82美元。 有消息称扎克伯格组成了一个名为TBD Lab的团队。该团队在"Avocado"的训练过程中使用了多个第三 方模型,包括谷歌的Gemma、OpenAI的GPT-oss和阿里巴巴的Qwen模型。新模型"Avocado"预计将于明 年春季首次亮相,并可能作为"闭源"模型推出。 ...
Meta公开抄阿里Qwen作业,还闭源了...
量子位· 2025-12-11 01:33
Core Insights - Meta is shifting from an open-source strategy to a closed-source model with the upcoming release of a new AI model codenamed "Avocado" [2][10] - The new model will utilize Alibaba's AI, specifically the Qwen model, during its training process, which has caused significant market reactions [4][6] - This strategic pivot marks a significant departure from Meta's previous commitment to open-source development, indicating a potential failure of its earlier approach [11][15] Group 1: Strategic Shift - Meta's new model "Avocado" is expected to be closed-source, representing a 180-degree turn from its previous open-source narrative [3][11] - The decision to adopt a closed-source model is driven by the need to enhance product capabilities and competitiveness in the AI landscape [14][15] - The reliance on third-party models, including Qwen, for training the closed-source model highlights the complexities of the current AI development ecosystem [13][18] Group 2: Market Reaction - Following the announcement of the new model, Alibaba's stock saw a pre-market increase of 4%, closing with a 2.53% gain, reflecting investor optimism about the collaboration [6] - The market's reaction indicates a recognition of Alibaba's growing influence and success in the AI sector, contrasting with Meta's struggles [9] Group 3: Internal Dynamics - Meta's internal restructuring has intensified following the underperformance of the Llama 4 model, leading to a reduction in open-source discussions and significant layoffs within the FAIR lab [28][30] - The appointment of Alexander Wang as the new Chief AI Officer signifies a shift in leadership and focus towards closed-source AI development [21][32] - The internal conflicts and departures of key figures like Yann LeCun suggest a turbulent transition as Meta navigates its new strategic direction [29][31]
Meta或转向闭源!小扎亲自带队,引入阿里Qwen模型训练
Di Yi Cai Jing Zi Xun· 2025-12-11 01:17
Group 1 - The core point of the article is that Mark Zuckerberg has formed a team called TBD Lab, which is working on a new model named "Avocado" that is expected to debut in spring next year [1] - The TBD Lab team is utilizing multiple third-party models during the training of "Avocado," including Google's Gemma, OpenAI's GPT-oss, and Alibaba's Qwen model [1] - Alibaba's stock price increased by 1.83% to $158.82 at the close of trading in the US [1]