Seed3D 1.0
Search documents
传媒行业点评:头部厂商持续入局世界模型,关注影视、游戏环节应用潜力
China Post Securities· 2025-12-29 08:44
Industry Investment Rating - The industry investment rating is "Outperform the Market" and is maintained [1] Core Insights - The report highlights the continuous entry of leading companies into the world model space, with a focus on the film and gaming sectors [3] - The world model is identified as a significant direction in AGI research, with major companies actively investing in this area [4] - The capabilities of world models are expected to evolve, providing ongoing empowerment to the film and gaming industries [5] Summary by Relevant Sections Industry Overview - The closing index level is 802.63, with a 52-week high of 897.3 and a low of 590.32 [1] Investment Highlights - Major companies like Google, Runway, and ByteDance are developing world models that simulate real-world environments and generate content based on multimodal inputs [4] - Google’s latest model, Genie 3, can generate dynamic worlds based on text prompts, while Runway has released GWM-1, which includes variants for environment exploration, character dialogue, and robotics [4] - ByteDance has established a team focused on multimodal interaction and world models, recently launching the 3D generation model Seed3D 1.0 [4] Future Potential - In the film sector, world models aim to enhance video generation by creating physically accurate virtual environments, which could lead to advancements in long video production and complex storytelling [5] - In gaming, the three-dimensional world generation and interactivity of world models align well with game development processes, with companies like Tencent and xAI exploring these capabilities [5] Investment Recommendations - Companies to watch include Kunlun Wanwei for world model development, Huace Film & TV, Light Media, and Hengdian Film for AI in film production, and Perfect World and Giant Network for large-scale 3D game development [6]
95 后团队做 3D 大模型,拿下头部游戏重磅合作,正在定义 3D 生成的新规则
Founder Park· 2025-11-18 11:06
Core Insights - The article highlights the significant advancements made by Yingmou Technology in the field of 3D generation, particularly through their model Rodin and its latest iteration, Rodin Gen-2, which has achieved substantial improvements in generation quality and controllability [2][6][9]. Group 1: Company Achievements - Yingmou Technology's Rodin model was showcased at GDC, capturing the attention of top game developers and leading to the successful application of 3D generation technology in mobile gaming [2]. - The company recently completed a multi-million dollar funding round led by BlueRun Ventures, with participation from ByteDance and Sequoia China, positioning it as a leading startup in the 3D large model sector [2]. - The research paper "CLAY" received nominations for best papers at SIGGRAPH, marking a significant milestone for the young team that has been focused on 3D research since its inception [2][3]. Group 2: Technological Innovations - Rodin Gen-2 has been upgraded to utilize a dataset of millions and billions of parameters, resulting in a qualitative leap in generation quality, including smoother geometric surfaces and reduced post-processing costs [6][9]. - The introduction of the "Bang to Parts" feature allows users to decompose generated models into smaller components, enhancing the controllability of 3D models and streamlining workflows in various applications [9][12]. - The model's ability to generate clean and clear 3D meshes reduces the need for extensive repairs in software like Blender and Unity, making it more production-ready [8]. Group 3: Industry Trends - Major companies are increasingly investing in 3D generation technologies, with Roblox open-sourcing CUBE 3D and ByteDance releasing Seed3D 1.0, indicating a growing trend in the industry [6]. - The demand for rapid and accurate 3D model generation is driving innovations, with Yingmou's technology achieving model generation speeds of under 10 seconds, catering to diverse industry needs [24]. - The team believes that 3D generation will play a crucial role in future applications, serving as a foundational technology for various sectors, including digital content creation, industrial design, and AR/VR interactions [29].
发力人形具身机器人?字节跳动高薪招人
Guan Cha Zhe Wang· 2025-11-05 07:55
Core Insights - ByteDance's Volcano Engine team is launching a high-profile recruitment for a "Senior Algorithm Expert (Embodied Intelligence)" focused on humanoid robot development, with a monthly salary ranging from 95,000 to 120,000 RMB [1][4] Recruitment Details - The position is specifically for humanoid robot operation algorithms, requiring expertise in architecture, grasping, VLA models, and dexterous hands [4] - Candidates must have a master's or doctoral degree in computer science, automation, or related fields, and be familiar with mainstream technologies like VLM/VLA (e.g., BERT, CLIP) [4] - The role involves leading the development of algorithms, participating in large model training and evaluation, and optimizing embedded performance through system integration and simulation [4] Team and Project Overview - ByteDance's Seed team, established in 2023, focuses on general intelligence research, including large models, multi-modal perception, AI agents, and embodied intelligence [10] - The Seed team is actively recruiting for key positions related to robotics, including product and engineering leads, emphasizing multi-modal perception and reinforcement learning [10][11] - Recently, the Seed team released the 3D generation model Seed3D 1.0, which can seamlessly integrate with simulation engines like Isaac Sim for embodied intelligence model training [11] Industry Collaboration - ByteDance has partnered with Seres Group to explore the mass production and application of humanoid robots, forming a joint venture [14][15] - The collaboration focuses on intelligent robot decision-making, control, and human-machine enhancement technologies [16] Job Market Insights - Currently, there are 26 job openings related to "embodied intelligence" and 183 related to "robotics" within ByteDance [12]
传字节跳动发力人形机器人领域
Guan Cha Zhe Wang· 2025-11-05 07:17
Core Insights - ByteDance's subsidiary, Volcano Engine, is launching a high-profile recruitment drive for a "Senior Algorithm Expert (Embodied Intelligence)" focused on humanoid robotics, with a monthly salary ranging from 95,000 to 120,000 RMB [1][4] Recruitment Details - The position is specifically for developing operational algorithms for humanoid robots, including architecture, grasping, VLA models, and dexterous hands, as well as training, evaluating, and deploying large embodied models [4] - Candidates are required to have a master's or doctoral degree in computer science, automation, or related fields, or equivalent industrial experience, and must be familiar with mainstream technologies like VLM/VLA [4] Team and Project Overview - ByteDance's Seed team, established in 2023, is a core R&D team in AI, focusing on general intelligence, large models, multi-modal perception, AI agents, and embodied intelligence [5][11] - The Seed team has recently launched the Seed3D 1.0 model, which generates 3D models compatible with simulation engines like Isaac Sim, facilitating embodied intelligence model training [13] Collaboration and Future Plans - ByteDance has partnered with Seres Group to explore the mass production and application of humanoid robots, establishing a joint venture [15][16] - The collaboration will focus on intelligent robot decision-making, control, and human-machine enhancement technologies [17]
「AI新世代」巨头入场赛道升温,VAST打响3D大模型破圈之战
Hua Xia Shi Bao· 2025-11-04 12:56
Core Insights - VAST is a key player in the AI 3D model sector, which is gaining traction as a new competitive arena in the AI landscape, particularly as traditional modeling transitions to AI-driven 3D models [2][3] - The AI 3D model technology has vast application potential across various industries, including gaming and industrial sectors, but it faces challenges in reaching mainstream consumers [2][4] - VAST's annual recurring revenue (ARR) has surpassed $12 million, indicating strong commercial potential despite being a relatively new company [5][6] Company Overview - VAST was established in March 2023 and has rapidly developed its Tripo AI 3D model series, which includes over 200 billion parameters [6] - The company has served over 5 million professional users and 40,000 enterprise clients, generating more than 50 million 3D models [3][5] - VAST completed a multi-million dollar Pre-A+ funding round in June 2023, indicating investor confidence in its growth potential [6] Market Dynamics - The 3D vision market in China is projected to grow significantly, with a compound annual growth rate (CAGR) of approximately 25.73% from 2024 to 2028, reaching around 7 billion yuan [3] - Major competitors like Tencent and ByteDance are entering the AI 3D model space, posing challenges for VAST as it seeks to establish itself [8] - The industry is still in its early stages, with a lack of benchmark applications, which may hinder broader adoption of AI 3D models [7] Technological Challenges - Developing AI 3D models is more complex and resource-intensive than 2D models, requiring significant computational power and specialized talent [6][7] - The transition from traditional modeling to AI-driven solutions necessitates overcoming technical barriers and educating potential users about the benefits of AI 3D technology [7][8] Future Outlook - VAST aims to expand its user base by targeting professional modelers, amateur creators, and eventually casual consumers, with market potential scaling from millions to billions [7] - The company is focused on democratizing 3D content creation, allowing more individuals to participate in the 3D ecosystem [4][7]
人工智能周报(25年第43周):OpenAI 推出 AI 浏览器,DeepSeek 发布开源 DeepSeek-OCR 模型-20251028
Guoxin Securities· 2025-10-28 14:28
Investment Rating - The report maintains an "Outperform" rating for the AI industry, indicating expected performance above the market benchmark [3][4]. Core Insights - The AI sector has demonstrated significant impacts on the advertising business of internet giants, cloud computing scenarios, and corporate efficiency, as evidenced by Tencent's advertising growth of 20% in Q2 and Alibaba Cloud's acceleration to 26% [2][29]. - Recent developments include the launch of proprietary chips by companies like Baidu and Alibaba, which are expected to enhance market share through a complete chain layout of chips, models, and applications [2][29]. - Key companies recommended for investment include Tencent Holdings, Alibaba, Kuaishou, Baidu Group, Meitu, and Tencent Music, which is less correlated with macroeconomic fluctuations [2][29]. Company Dynamics - OpenAI launched the AI browser ChatGPT Atlas, integrating large models into web browsing processes, enhancing automation capabilities [15]. - Meta restructured its AI team, laying off 600 employees to focus on advanced model development while increasing its capital expenditure limit to $72 billion [17]. - Google upgraded its AI Studio with vibeCoding, streamlining the development process and enhancing its competitive edge in the AI ecosystem [18]. - Huawei released HarmonyOS 6, enabling cross-ecosystem data transfer and introducing AI capabilities for various applications [19]. - Alibaba's Quark launched a dialogue assistant, marking the first outcome of its internal "C Plan" aimed at enhancing AI capabilities for consumer applications [20]. - Tencent is set to release the ima2.0 version of its AI workbench, enhancing its productivity tools with new features [21]. Underlying Technologies - DeepSeek introduced the open-source DeepSeek-OCR model, achieving a 7-20 times increase in text token efficiency while maintaining over 97% accuracy [22]. - Tencent released the WorldMirror model, a unified 3D reconstruction model that significantly improves processing efficiency [23]. - Baichuan Intelligent launched the Baichuan-M2 Plus model, addressing the credibility of medical AI through a six-source evidence reasoning paradigm [24]. - The Hong Kong University of Science and Technology released the DreamOmni2 model, enhancing multi-modal creative capabilities [25]. Industry Policies - The 18th meeting of the 14th National People's Congress reviewed amendments to the cybersecurity law, proposing a framework for AI safety and development [27]. - The Ministry of Science and Technology outlined core directions for AI development during the 14th Five-Year Plan, focusing on foundational research and international cooperation [28].
计算机行业周报:HarmonyOS6发布,行业喜迎新机遇-20251027
Guoyuan Securities· 2025-10-27 03:44
Investment Rating - The report maintains a "Recommended" investment rating for the computer industry [6]. Core Insights - The computer industry index (Shenwan) rose by 3.58% during the week of October 20-24, 2025, outperforming the Shanghai Composite Index, which increased by 2.88% [1][11]. - The release of HarmonyOS 6 by Huawei on October 22 is a significant event, focusing on deep ecological collaboration and enhanced user experience, with a 15% improvement in smoothness compared to HarmonyOS 5 [4][22]. - The report highlights the strong performance of sub-sectors, with the computer equipment index rising by 4.74%, IT services II by 3.00%, and software development by 3.29% [1][12]. Summary by Sections 1. Index Performance - The computer industry index increased by 3.58%, ranking high among other indices, with notable performances from sub-sectors [1][11][12]. 2. Major Events - Huawei's launch of HarmonyOS 6 is a pivotal development, enhancing user experience and ecosystem collaboration [4][22]. - Other significant announcements include Kuaishou's AI programming products and ByteDance's 3D generation model [16][18]. 3. Key Announcements - Guangdian Yuntong obtained a Money Service Operator License in Hong Kong, marking a key advancement in cross-border payment services [2][20]. - Tonghuashun reported a 56.72% year-on-year increase in revenue for Q3 2025, reaching 1.481 billion yuan [2][20]. 4. Investment Perspective - The report suggests focusing on companies deeply involved in the HarmonyOS ecosystem, as it is expected to drive new momentum for domestic software development [4][22].
TMT行业周报(10月第4周):国内外AI应用生态迎来新进展-20251027
Century Securities· 2025-10-27 02:35
Investment Rating - The report provides a positive outlook on the TMT industry, particularly focusing on AI applications, suggesting a strong investment opportunity in this sector [1]. Core Insights - The TMT sector outperformed the Shanghai and Shenzhen 300 index, with significant weekly gains in sub-industries such as communication network equipment (17.85%) and printed circuit boards (14.05%) [3][5]. - OpenAI launched its first AI-native browser, ChatGPT Atlas, which integrates browsing, chatting, and task automation, aiming to enhance user engagement and expand commercial applications [3][18]. - Huawei's HarmonyOS 6 was released with AI as a core feature, showing improved performance and enhanced user experience, indicating a growing penetration of AI applications in mobile devices [3][18]. Market Weekly Review - The TMT sector's performance from October 20 to October 24 showed significant gains across various sub-industries, with communication leading the way [3][5]. - The overall TMT sector outperformed the broader market, indicating strong investor interest and potential for growth [3][5]. Industry News and Key Company Announcements - OpenAI's new browser and Huawei's HarmonyOS 6 release highlight the rapid advancements in AI applications, suggesting a competitive landscape among tech giants [3][18]. - The report notes various strategic partnerships and product launches in the AI space, indicating a robust ecosystem developing around AI technologies [3][17][21].
夸克“C计划”曝光,剑指豆包;OpenAI发布AI浏览器,挑战Chrome;美国女子AI生成号码中10万美元彩票丨一周AI要闻
36氪· 2025-10-25 09:27
Group 1 - OpenAI launched ChatGPT Atlas, an AI-driven web browser, challenging Google Chrome's dominance and traditional web browsing methods [2][9] - The browser integrates multiple OpenAI products, allowing paid users to control their mouse and keyboard through an "agent" feature [9] - The launch of ChatGPT Atlas led to a 3% drop in Alphabet's stock price shortly after the announcement [9] Group 2 - ByteDance's Seed team introduced Seed3D 1.0, a model that generates high-precision 3D models from a single image, showcasing superior performance in geometry and texture generation [3] - Tencent announced the upcoming internal testing of ima 2.0, which will feature a task mode based on agent capabilities [3] Group 3 - Baichuan released the M2 Plus model, a medical AI that significantly reduces hallucination rates compared to competitors, achieving a hallucination rate three times lower than DeepSeek [4] - Alibaba's Tongyi Qwen 3-VL expanded its model sizes, making it suitable for mobile devices and enhancing developer accessibility [4] Group 4 - Anthropic launched Claude Code, a web-based coding environment that allows developers to run coding tasks directly in the browser [5] - Google integrated real-time information from Google Maps into its Gemini API, enabling AI to access structured data for over 250 million locations [5] Group 5 - Meta confirmed a strategic restructuring involving the layoff of approximately 600 employees in its AI department to enhance operational efficiency [6][7] - Kuaishou announced its entry into the AI programming sector with a comprehensive product matrix, including tools and models for developers [7] Group 6 - Reddit filed a lawsuit against AI startup Perplexity for allegedly scraping data from its platform to train its AI systems [7] - Visual China partnered with several AI companies to develop commercially viable visual models, securing orders from major firms like Alibaba and Microsoft [7] Group 7 - Alibaba's Quark is advancing a significant AI initiative called "C Plan," targeting conversational AI applications and potentially competing with ByteDance's "Doubao" [8] - Netflix expressed its commitment to leveraging AI to enhance storytelling, emphasizing that AI will serve as a tool to improve creator efficiency rather than replace human creativity [8] Group 8 - LiblibAI completed a $130 million Series B funding round, marking the largest disclosed investment in China's AI application sector this year [9] - Shenzhen's Xingji Guangnian Technology launched a new dexterous robotic hand and completed a Pre-A funding round to support its development [9] Group 9 - A recent experiment on a decentralized trading platform showcased AI models competing in cryptocurrency trading, with DeepSeek leading the performance with a 130% increase in total assets [10] - Alibaba's Quark AI glasses are set to begin pre-sales, offering various promotional benefits to early buyers [10] Group 10 - A coalition of over 800 public figures, including AI experts, called for a ban on the development of "superintelligent" AI systems until safety and control measures are established [11] - China's legislative body is working on amendments to the cybersecurity law to enhance AI ethics and safety regulations [11] Group 11 - Huawei is actively recruiting top global AI talent to build a leading AI team and develop advanced models [12] - KuaFuAI launched AipexBase, China's first AI-native backend service platform, aimed at simplifying backend development for developers [12]
腾讯研究院AI每周关键词Top50
腾讯研究院· 2025-10-25 04:34
Core Insights - The article presents a weekly roundup of the top 50 keywords related to AI developments, highlighting significant advancements and trends in the industry [2]. Group 1: Computing Power - Oracle is recognized for its development of the largest AI supercomputer [3]. Group 2: Chips - NVIDIA is noted for its advancements in domestic wafer production in the United States [3]. Group 3: Models - The Glyph framework has been developed by Tsinghua University and Zhiyu [3]. - Google's Gemini 3.0 model is highlighted as a significant development [3]. - DeepSeek has introduced the DeepSeek-OCR model [3]. - Baidu has launched the PaddleOCR-VL model [3]. Group 4: Applications - Google Skills is a new application introduced by Google [3]. - Sora has upgraded its Sora2 application [3]. - Kuaishou has developed a matrix of AI programming products [3]. - Hong Kong University of Science and Technology has released DreamOmni2 [3]. - ByteDance has launched Seed3D 1.0 [3]. - OpenAI has introduced ChatGPT Atlas [3]. - Claude has released a desktop version of its application [3]. - Google AI Studio has developed Vibe Coding [3]. - Tencent has launched the Hunyuan World Model 1.1 [3]. - Baichuan has introduced Baichuan-M2 Plus [3]. - Huawei has released HarmonyOS 6 [3]. - X platform has integrated Grok [4]. - Adobe has introduced AI Foundry [4]. - The AI avatar application has been developed by Hunyuan [4]. - Yuanbao has launched an AI recording pen [4]. - Vidu has released Vidu Q2 [4]. - Google has integrated Gemini with Maps [4]. - Anthropic has introduced Agent Skills [4]. - RTFM has been developed by Fei-Fei Li [4]. - Manus has released Manus 1.5 [4]. - Microsoft has announced a major update for Windows 11 [4]. - Kohler has launched the Dekoda smart toilet [4]. Group 5: Technology - Google has developed a quantum echo algorithm [4]. - Dexmal has introduced Dexbotic [4]. - Original Force has launched Bumi [4]. - Samsung has released Galaxy XR [4]. - Anthropic has developed a specialized Claude for biological sciences [4]. - Yushu has introduced a bionic humanoid robot [4]. - DeepMind has been working on a project related to artificial suns [4]. Group 6: Perspectives - Vercel is noted for the Kimi K2 replacement [4]. - a16z discusses the specialization of video models [4]. - Manus has introduced cognitive processes for agents [4]. - Jason Wei shares key thoughts on AI advancements [4]. - Harvard University discusses the invasion of AI in the workplace [4]. - Reddit presents the theory of the death of the internet [4]. - Karpathy addresses expectations management for AGI [4]. Group 7: Events - Meta has announced layoffs in its AI department [4]. - McKinsey reports on token consumption [4]. - nof1.ai has conducted experiments in Alpha Arena [4].