Nemotron 3 - filings, earnings calls, financial reports, news

Truist Raises NVIDIA (NVDA) PT After Nemotron 3 AI Model Launch

DeepSeek模型

Qwen（通义千问）模型系列

gpt - oss模型

Yahoo Finance· 2026-01-08 15:09

Group 1 - NVIDIA Corporation (NASDAQ:NVDA) is highlighted as a must-buy AI stock, with Truist raising its price target from $255 to $275 while maintaining a Buy rating [1] - Truist's analysis indicates that AI infrastructure semiconductor stocks are currently undervalued relative to their growth potential, despite challenges in AI infrastructure and funding [3] - The firm anticipates increased upward pressure on estimates for AI semiconductor stocks compared to diversified analog semiconductors as they approach 2026 [3] Group 2 - NVIDIA recently launched the Nemotron 3 family of open models, which includes three variants: Nano (30 billion parameters), Super (100 billion parameters), and Ultra (500 billion parameters) [4] - The Nemotron 3 models utilize a hybrid mixture-of-experts architecture, combining Mamba and Transformer technologies, resulting in a 4x increase in throughput for the Nano model compared to its predecessor [5] - NVIDIA specializes in designing GPUs and data center solutions that are essential for training and running large-scale AI models, supported by its CUDA software platform [6]

Nvidia(US:NVDA)

CUDA software platform

GPUs

data center solutions

英伟达成美国大模型开源标杆：Nemotron 3连训练配方都公开，10万亿token数据全放出

CUDA software platform

GPUs

data center solutions

量子位· 2025-12-26 06:35

Core Viewpoint - Nvidia is aggressively advancing in open-source models with the introduction of the "most efficient open model family" Nemotron 3, utilizing a hybrid Mamba-Transformer MoE architecture and NVFP4 low-precision training [1][22]. Group 1: Model Architecture and Efficiency - Nemotron 3 combines Mamba and Transformer architectures to maximize inference efficiency [7]. - The model architecture features a unique arrangement of Mamba-2 layers and MoE layers, significantly reducing the reliance on self-attention layers [10]. - In typical inference scenarios with 8k input and 16k output, Nemotron 3 Nano 30B-A3B achieves a throughput 3.3 times greater than Qwen3-30B-A3B, with advantages becoming more pronounced as sequence length increases [12]. - The model demonstrates robust performance on long-context tasks, scoring 68.2 on the RULER benchmark with 1 million token input length, compared to only 23.43 for Nemotron 2 Nano 12B [14]. Group 2: LatentMoE Architecture - For larger models, Nvidia introduces the LatentMoE architecture, which performs expert routing in a latent space [15]. - LatentMoE addresses two bottlenecks in MoE layer deployment: low-latency scenarios and high-throughput scenarios, reducing the weight loading and communication costs significantly [16][18]. - LatentMoE utilizes 512 experts with 22 activated, compared to the standard MoE's 128 experts with 6 activated, achieving better performance across various tasks [20]. Group 3: Training Innovations - Nvidia employs NVFP4 format for training, achieving a peak throughput three times that of FP8, and has successfully trained models on up to 250 trillion tokens [22]. - The training process retains high precision for certain layers to maintain model stability, while most layers are quantized to NVFP4 [23]. - Nemotron 3's post-training utilizes multi-environment reinforcement learning, covering a wide range of tasks simultaneously, which enhances stability and avoids common issues associated with phased training [24][26]. Group 4: Performance Metrics and Open Source - The model shows consistent accuracy across various downstream tasks, with NVFP4-trained models closely matching BF16 versions in performance [28]. - The entire post-training software stack is open-sourced under the Apache 2.0 license, including NeMo-RL and NeMo-Gym repositories [32]. - Nemotron 3 allows for cognitive budget control during inference, enabling users to specify the maximum number of tokens for thought chains, thus balancing efficiency and accuracy [34].

腾讯研究院· 2025-12-20 02:33

Group 1: Core Insights - The article presents a weekly roundup of the top 50 keywords in the AI sector, highlighting significant developments and trends in the industry [2]. - Key players mentioned include Google, Apple, ByteDance, NVIDIA, and OpenAI, indicating a competitive landscape in AI technology and applications [3][4]. Group 2: Chip Developments - Google is advancing its AI chip technology with the introduction of TorchTPU [3]. - Apple is focusing on AI server chips, which may enhance its capabilities in AI applications [3]. Group 3: Model Innovations - Google has launched the Gemini 3 Flash model, while ByteDance introduced Seed1.8, showcasing ongoing innovation in AI models [3]. - Other notable models include MiMo-V2-Flash from Xiaomi and Nemotron 3 from NVIDIA, indicating a diverse range of AI model developments [3]. Group 4: Application Trends - OpenAI is expanding its ecosystem with the ChatGPT application store and various applications like ChatGPT Images and SAM Audio [3][4]. - Companies like Tencent and xAI are also developing unique applications, such as the writing mode and Grok Voice, respectively [3][4]. Group 5: Technological Insights - The article discusses various technological insights, including AI memory systems and recursive self-improvement, which are critical for future AI advancements [4]. - The AI adult content market and AGI predictions are also highlighted, reflecting the broader implications of AI technology [4].

AGI

TorchTPU

AI服务器芯片

Gemini 3 Flash

AGI

As Nvidia Launches New Nemotron 3 Models, Should You Buy, Sell, or Hold NVDA Stock?

TorchTPU

AI服务器芯片

Gemini 3 Flash

Yahoo Finance· 2025-12-18 13:46

The valuation indicates that investors are willing to pay a premium. NVDA’s forward price-to-earnings (P/E) ratio is 40x versus the sector’s 24.34x, which means the market is pricing in faster growth and stronger long-term earnings power than the average stock in the group.Over the past 52 weeks, the stock is up about 40%, so the bigger trend remains positive, although sentiment has cooled. Over the past month, shares have declined by about 8%, which appears more like a normal reset after a strong run than ...

Nvidia(US:NVDA)

Agentic AI

Graphics Processing Units (GPUs)

GB300 platforms

Agentic AI

Graphics Processing Units (GPUs)

GB300 platforms

Top 3 big tech stocks to buy in 2026

Finbold· 2025-12-16 12:34

Core Viewpoint - The technology sector presents a compelling investment opportunity, with analysts predicting continued momentum into 2026, highlighting Alphabet, Nvidia, and Tesla as the top three tech stocks to consider [1][14]. Group 1: Alphabet (GOOGL) - Alphabet has significantly outperformed its peers and the S&P 500, with shares trading above $308, reflecting a nearly 63% year-to-date increase [2]. - The company has excelled in the AI sector with its Gemini models and Tensor Processing Unit (TPU), enhancing its competitiveness in the data center market [3]. - Potential partnerships are anticipated around TPUs, with companies like Meta showing interest, which could unlock new revenue streams [4]. Group 2: Nvidia (NVDA) - Nvidia is closely associated with AI, achieving a 31.6% gain year-to-date, with shares trading around $176 [5]. - The company's GPUs are widely used by leaders in the AI field, making them essential for data centers [7]. - Nvidia's recent launch of open-source AI models, Nemotron 3, aims to democratize AI development, potentially solidifying its market position further by 2026 [8]. Group 3: Tesla (TSLA) - Tesla, while primarily an automaker, is increasingly recognized as a tech stock, with shares at nearly $473, up 17% year-to-date [9]. - CEO Elon Musk's focus on automated driving and AI has attracted analyst attention, with a potential price target of $800 by 2026 suggested by Wedbush [11]. - Positive investor sentiment is supported by successful autonomous vehicle testing in Austin and efforts to improve sales in Europe with more affordable models [12][13].

Tensor Processing Unit (TPU)

Tensor Processing Unit (TPU)

AI日报丨英伟达收购SchedMD；Skild AI采购星动纪元灵巧手

美股研究社· 2025-12-16 10:11

Group 1 - The article highlights the rapid development of artificial intelligence technology, presenting significant opportunities in the market [3] - Skild AI, a US-based robotics company valued at $14 billion, has adopted a Chinese company's advanced dexterous hand technology, marking a significant entry of Chinese components into the global humanoid robot supply chain [5] - Ant Group has upgraded its AI health application AQ to "Antifufu," focusing on a "health+" strategy with new features for health companionship, inquiries, and services [6] Group 2 - SenseTime has launched the Seko 2.0, the first multi-episode generative AI agent, showcasing significant advantages in consistency for multi-episode video generation [7][8] - NVIDIA has acquired SchedMD, a leading developer of open-source workload management systems for high-performance computing and AI, planning to continue the development of the Slurm software [10] - NVIDIA has introduced the Nemotron 3 open model family, aimed at providing an efficient platform for building agent-based AI applications, with the first model already available and larger models expected in 2026 [11]

Nvidia(US:NVDA)

Humanoid Robot

Robotics

全直驱五指灵巧手XHAND1

Slurm

Humanoid Robot