量子位 - filings, earnings calls, financial reports, news

量子位

Search documents

量子位· 2025-10-06 05:42

henry 发自凹非寺量子位 | 公众号 QbitAI 你见过这样的"盲眼"机器人demo吗？它在完全看不见的情况下——没有摄像头、雷达或任何感知单元——主动搬起9斤重的椅子，爬上1米高的桌子，然后翻跟头跳下。不光耍酷，干起活来，搬箱子也不在话下。还能一个猛子跳上桌子。手脚并用爬坡也照样OK。这些丝滑小连招来自亚马逊机器人团队FAR （Frontier AI for Robotics）发布的首个人形机器人（足式）研究成果—— OmniRetarget ！ OmniRetarget使强化学习策略能够在复杂环境中学习长时程的"移-操一体"（loco-manipulation）技能，并实现从仿真到人形机器人的零样本迁移。网友表示：又能跑酷、还能干活，这不比特斯拉的擎天柱强10倍？此外，保留任务相关的交互使得数据能够进行高效的数据增强，进而从单个演示推广到不同的机器人本体、地形和物体配置，以减少不同变体的数据收集成本。在与其他动作重定向方法的对比中，OmniRetarget在所有关键方面：硬约束、物体交互、地形交互、数据增强表现出了全面的方法优势。 | Methods | Hard Ki ...

Sora2还在5秒打转，字节AI生视频已经4分钟“起飞”

量子位· 2025-10-06 05:42

Core Insights - ByteDance has developed a new method called Self-Forcing++ that enables the generation of long videos up to 4 minutes and 15 seconds without compromising quality, a significant improvement over existing models that typically generate videos of only 5 to 10 seconds [1][2][28] Group 1: Technology and Methodology - Self-Forcing++ utilizes a unique approach that does not require changing model architecture or collecting new long video datasets, allowing for the generation of high-quality long videos [1][2] - The method improves video generation by optimizing the training process through noise initialization, distribution matching distillation, and a rolling KV cache mechanism [13][14][15] - The model learns to generate stable long videos by iteratively correcting its mistakes, enhancing its ability to produce coherent and high-fidelity content over extended durations [15][17] Group 2: Performance Metrics - In short-duration scenarios (5 seconds), Self-Forcing++ achieved a semantic score of 80.37 and a total score of 83.11, outperforming several existing models [22][23] - For longer durations (50 seconds), it achieved a visual stability score of 90.94, significantly higher than competitors like CausVid and Self-Forcing [24] - The model demonstrated exceptional performance in generating videos of 75 to 100 seconds, maintaining high fidelity and consistency without common failure modes such as motion stagnation or quality degradation [26][28] Group 3: Future Implications - The advancements in long video generation suggest that the era of AI-generated films may be approaching, with potential applications in various media and entertainment sectors [6][28] - The introduction of Self-Forcing++ could lead to new standards in video quality and generation capabilities, impacting how content is created and consumed in the digital landscape [6][28]

AI视频生成

Self-Forcing++方法

Artificial Intelligence

Artificial Intelligence

Self-Forcing++

Sora2

字节Wan

重生之在《我的世界》做山姆·奥特曼：网友在线手搓ChatGPT

量子位· 2025-10-06 05:42

Core Viewpoint - The article discusses the impressive achievement of creating a ChatGPT model within the game Minecraft, showcasing the potential of using redstone circuits to simulate complex computational tasks [1][2][4]. Group 1: Model Specifications - The constructed ChatGPT model has approximately 5 million parameters, specifically 5,087,280 [16]. - It utilizes a TinyChat dataset for training, with an embedding dimension of 240 and a vocabulary of 1,920 tokens [18]. - The model features 6 layers and 5 attention heads, with a context window size of 64 tokens, suitable for very short conversations [19]. Group 2: Construction Process - The process involves training a small GPT model on a personal computer, compressing weights to low precision, and exporting the model structure [25]. - The next steps include translating computational methods into pixel block language and defining reusable circuit modules [26][27]. - Finally, a "compiler" script is used to map the trained model to redstone modules, facilitating the construction of the entire setup [28][30]. Group 3: Redstone Circuit Functionality - Redstone circuits in Minecraft operate on binary logic, where signals can be either on (1) or off (0), allowing players to build complex logic gates and circuits [32][34]. - This capability enables the construction of basic computational systems, such as adders and counters, leading to the potential for creating CPUs and neural networks [34]. Group 4: Broader Implications - The article highlights that the development of computational systems in Minecraft is still in its infancy, with only about 1% of the potential explored [37]. - Other projects within Minecraft include building CNNs for digit recognition and creating various games and even an internet simulation [39][46]. - The narrative suggests that players in Minecraft may eventually surpass current AI capabilities, hinting at a future where Minecraft could play a role in advancing artificial general intelligence (AGI) [48][49].