Meta Llama系列模型
Search documents
倒反天罡,Meta抄阿里千问作业,没拿授权
3 6 Ke· 2025-12-11 11:51
据悉,阿里千问为开源模型,所有人都可以自主下载,无需授权。根据官方披露,截至目前,千问模型 的全球累计下载量已突破7亿次。 Meta引入阿里千问模型,曾是开源霸主 今年以来,由于Llama 4发布后表现拉胯,Meta与OpenAI、谷歌等竞争对手的差距逐渐拉大,在此背景 下,牛油果项目应运而生。 作为Meta下一代旗舰级AI大模型,"牛油果"大模型被视为Meta在AI军备竞赛中的"救命稻草",目标性能 直指GPT-5,计划于2026年第一季度发布。 当时就有海外网友恶搞做了张梗图:牛油果的芯是鲸鱼(DeepSeek)。 只不过最终牛油果切开不是DeepSeek,而是千问。 12月10日,据报道,Meta在新一代大模型"牛油果"的研发中,引入了阿里巴巴通义千问模型,来对新模 型进行微调优化。 科技每日推送从阿里云独家获悉,Meta事先没有找阿里索要授权,阿里昨晚也是刚知道。 要知道前两年,Meta的Llama模型在全球开源界是绝对的霸主地位,当时国内很多大模型都会被质疑是 套壳Llama,谁能想到几年过去,中国开源模型崛起,Meta反而成了真·套壳的那个。 并且,Meta彻底违背初心,抛弃开源路线。 牛油果大模 ...
GPU寿命,远超想象
半导体芯闻· 2025-11-20 10:49
Core Viewpoint - The prevailing concern regarding the depreciation of GPUs in the AI industry is largely unfounded, as the actual depreciation cycle is more favorable than many investors believe [1][2]. GPU Depreciation and Lifespan - Analysts suggest that the profit cycle for GPUs is approximately 6 years, and the depreciation accounting practices of major cloud computing firms are deemed reasonable [2]. - The cost of operating GPUs in AI data centers is significantly lower compared to the GPU rental market, allowing for a high marginal contribution rate when extending the lifespan of older GPUs [3]. - GPUs can have a practical lifespan of 7 to 8 years, with many companies still using GPUs that are over 5 years old and generating substantial profits [5]. Lifecycle Transition of GPUs - GPUs transition from high-performance tasks, such as training advanced AI models, to lower-demand inference workloads, allowing older GPUs to remain in active service [6]. - The variety of AI workloads enables older GPUs to be repurposed effectively, maintaining their profitability [6]. Cost Considerations - AI cloud computing companies often choose GPUs based on user expectations and budget, with older GPUs being utilized for lower-tier services while newer models are reserved for premium offerings [7]. - Many AI services can run on open-source models that require less computational power, further enhancing the utility of older GPUs [8]. Economic Advantages of Older GPUs - Despite higher energy consumption, older GPUs are often preferred due to their lower procurement costs, making them more cost-effective overall [10].
微软以Maia 280开启新局对垒英伟达,Meta/微美全息开源联动引领AI创新
Zhong Guo Chan Ye Jing Ji Xin Xi Wang· 2025-07-14 03:32
Group 1 - Microsoft has delayed the launch of its self-developed AI chip Braga to 2026 due to design issues, and will introduce a transitional product, Maia 280, which is expected to improve performance by 30% [1][2] - The delay of the Braga chip has also pushed back the release of subsequent chips, Braga-R and Clea, raising concerns that these products may be outdated upon release and struggle to compete with NVIDIA's latest AI chips [2][4] - Microsoft aims to reduce its reliance on NVIDIA's expensive AI chips and has been embedding AI technology into its products through early collaboration with OpenAI [4][5] Group 2 - NVIDIA has seen a tenfold increase in annual sales over the past three years, driven by the AI boom, and is expected to maintain an average annual growth rate of 32% over the next three years [5][7] - NVIDIA's market capitalization is approaching $4 trillion, solidifying its position as a leader in the AI chip market, while companies like Meta and Amazon are working to develop their own chips to reduce dependence on NVIDIA [7][8] - Meta is facing unprecedented challenges and opportunities in the AI wave, investing heavily in AI research and development, with the Llama series models being a significant outcome [8][10] Group 3 - Meta's Llama models still show a significant performance gap compared to advanced models like OpenAI's GPT-4o, prompting Zuckerberg to initiate a "superintelligence team" to attract top talent and overcome current technological bottlenecks [10] - Microsoft is adjusting its ambitious strategy in light of delays in internal AI chip development, shifting towards a more pragmatic and iterative design approach to maintain competitiveness with NVIDIA [10][12] - WIMI is seeking to leverage the growing demand for AI services by establishing a quantum research center in collaboration with universities and research institutions, focusing on quantum computing and edge chips [12][13]