Workflow
3D生成
icon
Search documents
3D生成补上物理短板!首个系统性标注物理3D数据集上线,还有一个端到端框架
量子位· 2025-07-23 04:10
Core Viewpoint - The article discusses the introduction of PhysXNet, the first systematically annotated physical property 3D dataset, which aims to bridge the gap between virtual 3D generation and physical realism [1][3]. Group 1: Introduction of PhysXNet - PhysXNet contains over 26,000 richly annotated 3D objects, covering five core dimensions: physical scale, materials, affordance, kinematic information, and textual descriptions [3][11]. - An extended version, PhysXNet-XL, includes over 6 million programmatically generated 3D objects with physical annotations [12]. Group 2: Current Research Landscape - Existing 3D generation methods primarily focus on geometric structure and texture, neglecting the modeling based on physical properties [2][8]. - The demand for physical modeling, understanding, and reasoning in 3D space is increasing, necessitating a comprehensive physical-based 3D object modeling system [8][9]. Group 3: Data Annotation Process - The team designed a human-in-the-loop annotation process to efficiently collect and annotate physical information [16][19]. - The annotation framework consists of two main phases: initial data collection and determination of kinematic parameters [19]. Group 4: Generation Methodology - PhysXGen is introduced as a novel framework for generating 3D assets with physical properties, utilizing pre-trained 3D priors to achieve efficient training and good generalization [13][26]. - The method synchronously integrates basic physical properties during the generation process, optimizing structural branches for dual objectives [29][30]. Group 5: Experimental Evaluation - The team conducted qualitative and quantitative evaluations of the model, comparing it against a baseline that uses a separate structure to predict physical properties [33][34]. - PhysXGen demonstrated significant performance improvements in generating physical attributes, achieving relative performance gains of 24%, 64%, 28%, and 72% across various dimensions [38]. Group 6: Future Directions - The article emphasizes the importance of addressing key challenges in physical 3D generation tasks and outlines future research directions [43].
直击CVPR现场:中国玩家展商面前人从众,腾讯40+篇接收论文亮眼
具身智能之心· 2025-06-18 10:41
Core Insights - The article highlights the significant participation of Chinese companies in CVPR 2025, showcasing their technological advancements and commitment to AI development [4][9][46] - Key trends identified include a focus on multimodal and 3D generation technologies, with Gaussian Splatting emerging as a prominent technique [8][15][17] Group 1: Event Overview - CVPR 2025 has gained increased attention and social engagement, with a record number of Chinese enterprises participating [2][4] - The conference is recognized as a leading event in the field of computer vision, with the acceptance of papers indicating cutting-edge technological trends [12][13] Group 2: Research Trends - Multimodal and 3D generation are highlighted as popular research directions, with Gaussian Splatting being a frequently mentioned keyword in accepted papers [8][15][17] - A total of 2878 papers were analyzed, revealing high-frequency terms such as "Multimodal" (75 occurrences) and "Diffusion Model" (153 occurrences) [16] Group 3: Chinese Companies' Participation - Chinese companies, particularly Tencent, have shown deep involvement, with Tencent alone having over 40 accepted papers across various research areas [33][34] - The participation of Chinese firms in sponsorship and workshops indicates their commitment to the conference and the broader AI landscape [36][38] Group 4: Technological Advancements - Tencent's investment in AI research is substantial, with R&D spending exceeding 70.686 billion RMB in 2024, reflecting a strong commitment to technological innovation [46] - The company has also made significant strides in patent applications, with over 85,000 applications filed globally [46] Group 5: Talent Attraction - The presence of Chinese companies at top conferences serves to attract talent, emphasizing the importance of technical recognition over salary for top-tier professionals [47] - Tencent's diverse application scenarios, including WeChat and gaming, provide a robust ecosystem that supports ongoing technological development [49][50]
直击CVPR现场:中国玩家展商面前人从众,腾讯40+篇接收论文亮眼
量子位· 2025-06-17 07:41
Core Insights - The CVPR 2025 conference showcased significant participation from Chinese companies, highlighting their growing influence in the global AI and computer vision landscape [3][7][30] - The conference emphasized advanced topics such as multimodal and 3D generation technologies, with Gaussian Splatting emerging as a key focus area [6][15][17] - The acceptance rate for papers at CVPR 2025 was 22.1%, indicating a competitive environment and increasing recognition for high-quality research [11][13] Group 1: Conference Highlights - The conference received a record number of submissions, with 13,008 valid papers and 2,878 accepted, reflecting a growing interest in cutting-edge research [11] - Key topics included multimodal models, diffusion models, and large language models, with "multimodal" appearing 175 times in accepted paper titles [14] - The integration of computer vision and graphics was noted, with a significant rise in 3D-related research due to advancements in neural rendering [17][18] Group 2: Chinese Companies' Participation - Chinese companies, particularly Tencent, demonstrated strong engagement, with Tencent alone having over 40 accepted papers across various research areas [32] - The participation of Chinese firms in sponsorship and workshops indicates their commitment to advancing technology and attracting talent [34][36] - Tencent's investment in R&D reached approximately 70.686 billion RMB in 2024, showcasing their dedication to AI and technology development [44] Group 3: Talent Acquisition and Development - The conference served as a platform for companies to attract top talent, with Tencent's "Qingyun Plan" offering competitive salaries and career advancement opportunities [50][51] - The focus on technical talent is evident, with 73% of Tencent's workforce in technology roles, emphasizing the importance of skilled personnel in driving innovation [51] - The initiative aims to create a positive cycle where talent is nurtured and retained, contributing to the company's long-term technological advancements [46][48]
3D大模型公司VAST再获数千万美元融资 全球首个AI 3D工作台Tripo Studio:从 “算法领先” 到 “工作流闭环”
智通财经网· 2025-06-11 10:52
Core Insights - VAST has successfully completed a multi-million dollar Pre-A+ funding round led by the Beijing Artificial Intelligence Industry Investment Fund, with participation from Jingya Capital and other investors [1][12] - The company has launched Tripo Studio, the world's first AI-driven all-in-one 3D workspace, and is set to release the new algorithm Tripo 3.0, focusing on the development of the Tripo series of large models and the construction of an ecosystem platform [1][2] - VAST aims to create a comprehensive product system that covers professional (PGC), influencer (PUGC), and general user (UGC) creator profiles, solidifying its global leadership in the 3D generation field [1][3] Funding and Investment - The recent funding round will primarily be invested in the research and development of the Tripo series and the Tripo Studio product [1] - The Beijing Artificial Intelligence Industry Investment Fund and Jingya Capital express confidence in VAST's potential in the 3D model generation sector, highlighting the company's innovative capabilities and market opportunities [11][12] Product Development - VAST has iterated on the Tripo large model series, launching versions from Tripo 1.0 to Tripo 2.5, and has developed widely recognized 3D foundational models [2] - Tripo Studio has received high praise from users, with a 2.5x increase in platform payment rates and an annual recurring revenue (ARR) surpassing $3 million [2] - The company has introduced several innovative features in Tripo Studio, including intelligent part segmentation, magic texture brushes, intelligent low-poly generation, and automatic rigging, significantly enhancing the 3D creation process [4][5][6][8] Market Position and User Engagement - VAST has provided services to over 2 million 3D creators, 20,000 small developers, and 700 large enterprises, generating nearly 30 million models [2] - The company aims to redefine the 3D content creation process, allowing non-professional users to independently complete the entire workflow [9] - VAST collaborates with various industries, including gaming, industrial design, and home 3D printing, to enhance user engagement and creativity in 3D content generation [10] Future Outlook - VAST's CEO emphasizes the shift from merely providing tools to delivering complete solutions that enhance creator control and creativity [11] - The company envisions a future where 3D content creation becomes as ubiquitous and creative as photography, transforming the industry landscape [12]
阶跃星辰×光影焕像联合打造超强3D生成引擎Step1X-3D!还开源全链路训练代码
机器之心· 2025-05-16 02:42
阶跃星辰携手光影焕像发布并开源 3D 大模型 ——Step1X-3D。Step1X-3D 模型总参数量达 4.8B(几何模块 1.3B,纹理模块 3.5B),凭借坚实的数据基础与先进的 3D 原生架构,可生成 高保真、可控 的 3D 内容。 Step1X-3D 不止于视觉「 好看」,更追求实现「好用」与「可控」 ,旨在为 3D 内容创作提供强大而可靠的技术引擎。这款模型可以广泛应用在游戏娱乐、影视 与动画制作、工业制造与设计等各种场景。 Step1X-3D 公布了完整的数据清洗策略,数据预处理策略,以及 800K 高质量的 3D 资产,3D VAE、3D Geometry Diffusion 以及 Texture Diffusion 的全链路训练代 码开源,助力 3D 生成社区发展。 欢迎大家上手体验: Online Demo(立即体验):https://huggingface.co/spaces/stepfun-ai/Step1X-3D 核心特性与技术支撑 Step1X-3D 尝试解决 3D 内容生成的关键挑战,在数据、生成质量与可控性上进行了创新实践。 1. 数据驱动与算法协同优化 好数据是好模型的基础。 ...
3D版DeepSeek卷起开源月:两大基础模型率先SOTA!又是VAST
量子位· 2025-03-28 10:01
衡宇 鱼羊 发自 凹非寺 量子位 | 公众号 QbitAI 3D生成版DeepSeek再上新高度! 国产、易用、性能强且开源—— 新模型一露面就刷新SOTA,并且 第一时间加入开源全家桶 。 顺时针转个圈圈给大家看,效果是这样: 加上"皮肤"是这样: 再来一个,效果是这样: 肉眼可见,这次妥妥升级变成了更细节的细节控~ 以上效果,都来自 3D大模型明星初创公司VAST ,其刚刚上新的两个基础模型,TripoSG和TripoSF,为团队的最新研发成果。该团队去年3 月开源了TripoSR,在开源3D生成基础模型中爆火全球。 TripoSG ,发布即开源,一露面就刷新开源3D生成模型SOTA,让广大开发者第一时间享受技术进步的成果。 TripoSF ,目前为开源第一阶段,已经用实力证明了自己:横扫一切开源和闭源的现有方法,拿下新SOTA。 你就说秀不秀吧 (手动狗头) ?! ——但基础模型还只是VAST最近大秀一波技术肌肉的上半程表演。 量子位获悉, 接下来VAST要连续开源一个月,每周都有新开源项目公布 。而TripoSG和TripoSF是开源月里第二周的项目。 在整个开源月里,除了第一波单张图像端到端生成三维 ...
上海隐秘大学,正排队宣布融资
投资界· 2025-01-15 07:46
高校,创新源头。 作者 I 刘博 陈晓 报道 I 投资界PEdaily 一群年轻面孔闯入创投圈。 投资界获悉,3D生成大模型公司影眸科技完成数千万美元 A 轮融资,由美团龙珠、字节 跳动领投,老股东红杉中国种子基金及奇绩创坛持续跟投。令人意外的是,公司团队平均 年龄只有2 4 岁。 这是一个孵化自上海科技大学的创业项目——2 0 2 0年,吴迪、张启煊、张龙文、曾初啸 等人创立影眸科技,团队与上科大共同提出的可控 3D 原生 DiT生成框架 CLAY 与 3D 服装生成框架 Dr e ssCod e,均获计算机图形学顶会 ACM SIGGRAPH 2 0 24 最佳论文提 名,被认为是新一代 3D 生成基础框架。 "现在我们经常跑上科大蹲项目。"此前一家知名早期投资机构的分享引起我们的注意。 不同于清华、上海交大、哈工大等传统名校,上科大乍听略显陌生,成立仅仅十余年,殊 不知已经累计孵化4 0多家科创企业。如此一幕,堪称中国科技成果转化大潮的一缕写 照。 团队平均24岁 刚刚,美团字节联手投了 影眸科技的故事,始于上科大的一间实验室。 出生于19 9 7年,吴迪在20 1 5年进入上科大学习,是该校招的第二届 ...