Workflow
COSMOS
icon
Search documents
X @Forbes
Forbes· 2025-08-14 19:18
374-foot-long COSMOS just launched at a Lurssen shipyard in Germany. (Photo: Tom Van Oossanen)https://t.co/Ej9NxtALW5 https://t.co/I9HRew36IE ...
X @Forbes
Forbes· 2025-08-13 20:02
374-Foot-Long COSMOS Launches A New Era Of Superyacht Design And Construction https://t.co/1P4xS8PUW1 https://t.co/1P4xS8PUW1 ...
LeCun出手,造出视频世界模型,挑战英伟达COSMOS
机器之心· 2025-07-29 09:58
Core Viewpoint - The article discusses the development and advantages of a new video world model called DINO-world, which aims to improve the efficiency and effectiveness of predicting future frames in various environments, particularly in the context of artificial intelligence and machine learning [9][10]. Data Challenges - The acquisition of large-scale, high-quality video datasets is costly, especially when action annotations are required. Current successful applications of world models are limited to specific fields like autonomous driving and video games [5]. - Accurately modeling physical laws and behaviors in unconstrained, partially observable environments remains a significant challenge, even for short time scales. Advanced pixel-based generative models consume enormous computational resources, with training times reaching up to 22 million GPU hours for models like COSMOS [6]. Model Development - DINO-world utilizes a frozen visual encoder (DINOv2) to pre-train the video world model in a latent space, followed by fine-tuning with action data for planning and control [9]. - The architecture of DINO-world significantly reduces resource consumption during both training and inference phases compared to current state-of-the-art models [10]. Training and Evaluation - DINO-world was trained on a large dataset of approximately 60 million uncleaned network videos, enabling it to learn transferable features across different domains [11]. - In the VSPW segmentation prediction task, DINO-world achieved a mean Intersection over Union (mIoU) improvement of 6.3% when predicting future frames, outperforming the second-best model [13]. Methodology - The model employs a frame encoder that does not directly model pixels but instead uses latent representations based on video patches, which significantly lowers the computational cost of training the predictor [19]. - The training objective is set as "next frame prediction," allowing for efficient parallelization and focusing on the most relevant tokens for loss calculation [27]. Action-Conditioned Fine-Tuning - DINO-world can be adapted for action-conditioned tasks by incorporating an action module that updates the query vector based on the corresponding actions, which can be trained on a small dataset of action-conditioned trajectories [30][33]. Experimental Results - DINO-world demonstrated superior performance in dense prediction tasks across various datasets, including Cityscapes, VSPW, and KITTI, validating the effectiveness of the proposed paradigm [37][38]. - The model's performance in intuitive physics tests showed a strong understanding of physical behaviors, comparable to larger models like V-JEPA [40][41]. Planning Evaluation - The action-conditioned model was trained on offline trajectories, showing significant performance improvements compared to models trained from scratch, particularly in more complex environments [44].
隔夜美股全复盘(6.26) | 英伟达涨逾4%,股价创新高再度成为全球市值最高的公司,黄仁勋称机器人技术是英伟达下一个万亿美元级别的增长机会
Sou Hu Cai Jing· 2025-06-25 23:04
Market Overview - US stock indices experienced volatility, with the Dow Jones down 0.25%, Nasdaq up 0.31%, and S&P 500 flat [1] - The VIX index decreased by 4.12% to 16.76, indicating reduced market fear [1] - The US dollar index fell by 0.28% to 97.7, while the yield on the 10-year Treasury bond dropped by 0.116% to 4.292% [1] - Spot gold increased by 0.27% to $3332.02 per ounce, and Brent crude oil fell by 0.61% to $66.4 [1] Industry & Stocks - In sector performance, semiconductor, technology, and healthcare sectors rose by 0.9%, 0.85%, and 0.09% respectively, while other sectors including real estate and consumer staples saw declines [2] - Chinese concept stocks showed mixed results, with TSMC up 1.2% and Alibaba down 2.1% [2] - Major tech stocks mostly rose, with Nvidia increasing by 4.33% to become the world's most valuable company [3] - Nvidia's CEO highlighted robotics as a significant growth opportunity, alongside AI, with potential applications in autonomous vehicles [6][8] Company Highlights - QuantumScape announced significant advancements in manufacturing processes for solid-state batteries, improving production speed by approximately 25 times [4] - AeroVironment reported Q4 earnings of $1.61 per share, exceeding expectations, with revenue reaching a record $275.1 million [5] - Nvidia's quarterly sales in the automotive and robotics sector grew by 72% year-over-year, contributing to overall revenue growth [6][8] - Goldman Sachs advised caution regarding Tesla's Robotaxi service, citing operational challenges and a significant drop in EU sales by 40.5% in May [9][14]