Workflow
AI inference
icon
Search documents
Nvidia to sell 1 million chips to Amazon by end of 2027 in cloud deal
Reuters· 2026-03-19 21:31
Core Insights - Nvidia will sell 1 million graphics processing unit (GPU) chips to Amazon's cloud computing unit by the end of 2027, marking a significant partnership in the AI and cloud computing space [1][2]. Group 1: Deal Overview - The sales of the 1 million GPUs will commence this year and continue through 2027, aligning with Nvidia's projected sales opportunity of $1 trillion for its Rubin and Blackwell chip families [2]. - The deal includes a variety of Nvidia chips beyond the GPUs, such as Spectrum networking chips and Groq chips, which were recently released following a $17 billion licensing agreement with an AI chip startup [3]. Group 2: Technical Aspects - AWS plans to utilize a combination of Nvidia's Groq chips and six other chips for enhanced inference capabilities, which is crucial for AI systems to perform tasks effectively [4]. - The agreement also involves deploying Nvidia's Connect X and Spectrum X networking equipment in AWS data centers, which is significant given AWS's history of developing custom networking solutions [4][5].
The Artificial Intelligence (AI) Inference Market Could Reach $255 Billion by 2030. This Stock Is Best Positioned to Win.
Yahoo Finance· 2026-03-16 14:22
Artificial intelligence (AI) stock investors have earned massive returns. The best-performing stocks succeeded because they built a competitive advantage that other companies could not easily match, if they could match at all. This means the industry has produced several winners, and that makes choosing the "best" one to succeed the most difficult. Nonetheless, that designation should arguably go to one AI stock, and here's why. Will AI create the world's first trillionaire? Our team just released a repor ...
Bank of America has stark message for Nvidia investors ahead of GTC
Yahoo Finance· 2026-03-11 21:47
Core Viewpoint - Bank of America maintains a buy rating on Nvidia with a price target of $300, highlighting the upcoming GTC keynote as a potential catalyst for closing the current valuation gap [1][2]. Group 1: Current Valuation and Market Position - Nvidia shares are trading at a historically low forward price-to-earnings multiple of 17x, which is considered a trough following significant sales growth from the Blackwell architecture, totaling $500 billion [2]. - The GTC keynote is anticipated to be a key event that could initiate a recovery in Nvidia's stock valuation [2]. Group 2: Key Areas of Focus for GTC - Bank of America identifies three critical areas for investors to monitor during the GTC keynote, which extend beyond mere product announcements to indicate Nvidia's competitive advantage [3]. - The focus includes Nvidia's strategic shift towards inference, which is the process of running AI models at scale, marking a new battleground in AI infrastructure [5]. Group 3: Product Roadmap and Innovations - Nvidia is expected to outline its product roadmap from the current Vera Rubin platform to the Feynman GPUs by 2028, providing a three-generation visibility that secures commitments from developers and enterprises ahead of competitors [7]. - Anticipated announcements include a new range of customized products such as CPX chips for inference prefill workloads and a Language Processing Unit (LPU) for low-latency decoding, potentially integrated into next-generation rack systems [7]. - Bank of America is also looking for details on Nvidia's next-generation 102.4T Spectrum-6 switch and the 115T Quantum-X with co-packaged optics, which could be crucial for large-scale AI cluster deployments [7].
Super Micro Computer, Inc. (SMCI) Expands Support for Sovereign Artificial Intelligence Platforms
Yahoo Finance· 2026-03-10 18:38
Core Insights - Super Micro Computer, Inc. (NASDAQ:SMCI) is recognized as one of the best affordable growth stocks to consider for investment [1] Group 1: Company Developments - On March 2, Super Micro Computer Inc. expanded its support for infrastructure solutions that power sovereign artificial intelligence platforms and Radio Access Networks [2] - The company introduced three new systems designed specifically for telecom networks, including the ARS-111L-FR, a 1U system for RAN workloads featuring an NVIDIA Grace C1 CPU, and the ARS-221GL-NR, a high-performance 2U system with up to 2 double-width NVIDIA GPUs for demanding AI workloads [3] - The ARS-111GL-NHR is a 1U GH200 Grace Hopper Superchip system, versatile for both research and AI inference at the network edge, highlighting the company's focus on meeting the growing demand for sovereign AI platforms [4] Group 2: Strategic Positioning - The demand for sovereign AI platforms is creating strategic opportunities for telecom operators, and Super Micro Computer is positioned to enable rapid deployment and expansion of AI data centers through its innovative Data Center Building Block Solutions (DCBBS) [4][5] - The company's flexible DCBBS and collaborations within the ecosystem facilitate the quick deployment of high-performance, energy-efficient solutions that support data sovereignty and scalability [5] - Super Micro Computer designs and manufactures high-performance, energy-efficient server and storage systems, offering customizable solutions including AI-optimized servers and liquid-cooling technology for data centers and cloud computing [6]
Should You Chase the Perplexity-Driven Rally in CoreWeave Stock?
Yahoo Finance· 2026-03-04 20:49
Core View - CoreWeave (CRWV) has secured a significant multi-year strategic partnership with Perplexity AI, a rapidly growing search disruptor, to utilize its dedicated GB200 NVL72 clusters for scaling Perplexity's Sonar and Search API ecosystem [1][4] Partnership Details - In exchange, CoreWeave will integrate Perplexity's Enterprise Max across its operations, indicating a collaborative effort to enhance both companies' capabilities [2] Market Position and Performance - Despite the recent gains, CoreWeave's stock is still down over 25% from its year-to-date high, highlighting ongoing volatility in its stock performance [2] - The partnership with Perplexity shifts the narrative for CRWV from capital-intensive buildouts to high-margin execution, as Perplexity answers over 1.5 billion questions monthly, validating CoreWeave's AI infrastructure [4] Revenue and Growth Potential - The partnership is expected to diversify CoreWeave's revenue stream beyond experimental labs into daily consumer applications, potentially leading to further gains in 2026 [5] - CoreWeave reported a substantial revenue backlog of $67 billion and a recent $5 billion agreement with Meta Platforms, reinforcing its position as a primary provider of Nvidia-powered infrastructure [6] - The company has a 3.1-gigawatt power pipeline and a clear path to achieving $30 billion in annual recurring revenue (ARR), positioning it strongly for long-term dominance in the GPU-as-a-Service market [7] Analyst Sentiment - Wall Street analysts are optimistic about CRWV shares, with a projected 140% year-on-year revenue increase to over $12 billion this year, indicating strong market confidence [10]
Akamai Technologies CEO Details AI Inference Cloud Push, 45%-50% Cloud Growth at Conference
Yahoo Finance· 2026-03-04 14:07
Core Insights - Akamai Technologies is positioning itself as a cloud company with significant growth in cloud infrastructure services, expecting a revenue growth of 45%-50% in this segment [3][15] - The company is focusing on AI-driven solutions, particularly in AI inference and security, to meet the increasing demand for low-latency and high-performance computing [6][4] Commerce Performance - Commerce companies benefit from AI-driven user experiences that enhance performance and reliability, leading to higher conversion rates [1] Live Video and Streaming - Akamai is utilized by hyperscalers for live sports streaming to ensure synchronized viewing, which is crucial for betting applications [2] - The company's compute platform is preferred for latency-sensitive workloads, demonstrating its capability in delivering lower latency and improved scalability [2] Cloud Infrastructure Services - The cloud infrastructure segment finished Q4 with $94 million in revenue, marking a 45% year-over-year increase [3] - Akamai anticipates continued acceleration in growth for this segment throughout the year [3] Security Solutions - API security and Guardicore segmentation generated $90 million in revenue, up 35% year-over-year, and are seen as key growth drivers [4] - The company is investing in AI-related security capabilities to address the new attack surfaces created by AI adoption [4][5] Delivery Revenue Outlook - Akamai expects a mid-single-digit decline in delivery revenue for the year, influenced by traffic growth and pricing declines [10] - The company is maintaining pricing discipline and has opted not to pursue certain business opportunities if the economics are unfavorable [11] Capital Allocation and M&A Strategy - Akamai repurchased approximately $800 million in shares last year and continues to allocate capital for M&A and ongoing capital expenditures [13] - The company is actively looking for acquisitions in security and compute, focusing on product adjacencies that align with its current platform [14] Investor Perception - There is a shift in investor perception, with increasing recognition of Akamai as a cloud company rather than solely a CDN provider, highlighting its growth in cloud infrastructure services [15]
Digital Realty Trust (NYSE:DLR) 2026 Conference Transcript
2026-03-02 23:37
Summary of Digital Realty Trust (NYSE: DLR) 2026 Conference Call Company Overview - **Company**: Digital Realty Trust (DLR) - **Industry**: Data Center and Digital Infrastructure Key Points and Arguments Financial Performance - Digital Realty reported strong fourth quarter and full-year results in February, with record bookings in the 0 to 1 megawatt segment and a near-record backlog [3][4] - The company achieved nearly 300 basis points above its initial guidance of over 5% Core FFO growth for 2025, setting a target of approximately 8% growth for 2026 [4][6] Growth Drivers - Key growth drivers include: - Continued momentum in the 0 to 1 megawatt interconnection business, with over 30% growth in signings and over 20% in interconnection bookings [5][6] - A healthy backlog of over $600 million in expected revenue for 2026, de-risking the revenue profile [6][7] - Favorable supply-demand dynamics leading to improved pricing across segments [6][10] AI and Digital Transformation - The company sees long-term tailwinds from digital transformation and AI, particularly in inference workloads, which are still in early stages [5][19] - AI-related workloads have increased, with approximately 20% of bookings in the 0 to 1 megawatt category driven by AI inference in 2025 [17][19] Occupancy Metrics - Transitioning to power-based occupancy metrics to align with the power-oriented nature of the business, guiding an improvement of 50 to 100 basis points in occupancy [11][12] Pricing Dynamics - Current pricing environment is favorable due to high demand and constrained supply, with mark-to-market improvements expected [28][29] - Mark-to-market for the greater than 1 megawatt portfolio was over 10% last year, with expectations for similar performance in 2026 [30][33] International Expansion - Recent market entries include Malaysia, Portugal, Israel, and Indonesia, aimed at growing presence in APAC and enhancing global connectivity [34][35] - The company aims to build a connected campus ecosystem in new markets to support enterprise growth and AI deployment [35][36] Power Solutions - Digital Realty emphasizes long-term utility grid power as the primary source, while exploring bridging solutions like fuel cells and turbines to address power capacity constraints [38][39] - The company is positioned to manage stricter grid access requirements due to established relationships with utility providers [41][42] Supply Chain Management - The company has proactively managed supply chain risks, particularly in light of elevated hyperscaler CapEx plans, ensuring procurement aligns with expected megawatt deliveries [48][49] Sovereign AI Opportunities - Digital Realty sees growing demand for sovereign AI solutions, particularly in international markets, and has the capacity to meet these needs through partnerships with hyperscalers [26][27] Additional Important Insights - The company is focused on maintaining a diverse customer base with high credit quality, which supports its ability to raise capital and manage financial commitments effectively [46][47] - Liquid cooling solutions are being rolled out primarily in new deployments, with some retrofitting in existing facilities to meet increasing density demands [24][25] This summary encapsulates the key insights and strategic directions discussed during the conference call, highlighting Digital Realty's robust growth prospects and proactive management in a dynamic market environment.
5 biggest takeaways from Nvidia's Q4 earnings — from the new Vera Rubin chips to addressing an emerging risk
Business Insider· 2026-02-26 02:06
Core Viewpoint - Nvidia's recent earnings report highlights its strong position in the AI sector, surpassing Wall Street expectations and indicating sustained momentum in the AI boom [1][2]. Group 1: Nvidia's Role in AI - Nvidia is positioning itself as the backbone of the AI industry, with significant partnerships, including a multibillion-dollar deal with OpenAI and collaborations with Meta [3][4]. - The company aims to ensure that all forms of AI, from large language models to robotics, are built on its platform, capitalizing on the new computing era [4]. Group 2: Future Developments - Nvidia is integrating Groq's low-latency AI inference technology into its architecture, with more details expected at the upcoming GTC conference [5][6]. - Early samples of the next-generation Vera Rubin chips have been shipped, with broader shipments anticipated in the second half of 2026, promising significant performance improvements over the current Blackwell model [8][9]. Group 3: Strategic Investments and Partnerships - Nvidia is close to finalizing a deal with OpenAI, part of a larger AI infrastructure initiative potentially worth $100 billion, aimed at strengthening the AI ecosystem [13][15]. - The company's strategy involves investing in AI firms like Anthropic and OpenAI to ensure that future software and hardware developments are built on Nvidia's platform [14].
Bank of America drops a surprising Nvidia warning before earnings
Yahoo Finance· 2026-02-25 18:33
Core Viewpoint - Nvidia's upcoming earnings report is a critical event for the semiconductor industry, with analysts highlighting the need to consider broader implications beyond just GPU demand [1][5]. Group 1: Earnings Report Insights - Nvidia is set to report earnings on February 25, which is anticipated to significantly impact the AI trade and investor sentiment [1][2]. - Analysts are suggesting that the market may not fully grasp the implications of AI workloads transitioning from training to deployment, which could reshape perceptions of Nvidia and its competitors [3][6]. Group 2: Market Dynamics and Hardware Implications - The shift towards AI inference, which requires more CPUs for orchestration and scheduling, indicates a change in hardware demand, suggesting that Nvidia's future earnings may depend on its ability to capture more of the overall system rather than just focusing on GPUs [6][9]. - The debate around Nvidia's valuation centers on whether GPU demand can sustain its growth rate, with potential stability in earnings if the company expands its role in the compute, memory, and networking layers [7][9].
X @Starknet (BTCFi arc) 🥷
Starknet 🐺🐱· 2026-02-16 01:28
RT Victor Amaya (@vaamx_)STWO on @Starknet just shattered the ZK-ML record.In 2024, zkLLM proved 13B-parameter AI inference in 15 minutes on basic CUDA.That was the best in the world.In 2026, STWO does it 50x faster on the most advanced GPU proving backendever built. ...