Transformer架构 - filings, earnings calls, financial reports, news

Transformer架构

Search documents

量子位· 2025-07-08 07:30

时令发自凹非寺量子位 | 公众号 QbitAI AI无需监督就能学习思考？弗吉尼亚大学团队最新提出 EBT（Energy-Based Transformers）架构，通过全新能量机制，首次实现在跨模态以及数据、参数、计算量和模型深度等多个维度全面超越Transformer++（基于Llama 2的Transformer优化版本）的模型。在离散（文本）和连续（视觉）模态下，EBT在数据量、批次大小、参数量、计算量和模型深度等方面比Transformer++提升了约35%。 EBT是基于EBM（Energy-Based Models）原理发展而来的具体模型架构。这让模型具备了像人类一样"想清楚再回答"的能力。在推理过程中，EBT在测试时也比Transformer++提高了29%。那么，这种模拟人类思考模式的新架构EBT，到底是如何实现的呢？ EBT方法：基于能量的Transformer EBT通过能量最小化过程模拟思考：从随机预测开始，通过梯度下降反复优化，直到能量收敛，从而动态决定"思考步数"。它通过学习一个能量函数，为每一种输入配置分配一个标量值。能量越低，表示输入变量之间的兼 ...

基于能量的Transformer架构

能量最小化过程

能量函数

EBT（基于能量的Transformer）

Transformer++

EBM（Energy-Based Models）

基于能量的Transformer架构

能量最小化过程

能量函数

EBT（基于能量的Transformer）

Transformer++

EBM（Energy-Based Models）

特斯拉、英伟达机器人背后的“卖水人”

虎嗅APP· 2025-07-06 03:31

Core Viewpoint - The article discusses the rise of embodied intelligence and the critical role of data providers, like CyberOrigin, in the robotics industry, emphasizing that data is the new oil for the development of humanoid robots [3][5][23]. Group 1: Industry Trends - The emergence of embodied AI has led to significant interest from major companies like Tesla and NVIDIA, which are now focusing on humanoid robot development [11][20]. - The Transformer architecture has revolutionized the robotics field by enabling better spatial understanding and generalization capabilities, allowing robots to learn from vast amounts of data [12][13][14]. Group 2: Company Insights - CyberOrigin, founded by Yin Peng, aims to become a leading data supplier for humanoid robots, focusing on real-world interaction data rather than just hardware [5][22]. - The company has established partnerships with major AI firms and is actively collecting millions of hours of real-world data to enhance robot training [25][26][29]. Group 3: Data Importance - Data is essential for the evolution of both the physical robot and its cognitive capabilities, with the analogy that models are engines while data is the fuel [23][24]. - The company prioritizes collecting real-world data over synthetic data, believing that authentic data significantly improves model training outcomes [26][27]. Group 4: Challenges and Opportunities - The robotics industry is currently in a chaotic phase, with many new entrants recognizing the value of data, leading to increased competition [51]. - The company acknowledges the long commercial chain in the robotics sector but believes that data can quickly form a commercial loop, making it a strategic focus [22][23].