腾讯用AI把美术管线重新做了一遍，混元3D Studio架构曝光

Core Viewpoint - Tencent has developed a professional-grade AI workstation called Hunyuan 3D Studio, designed specifically for 3D designers, game developers, and modelers, which streamlines the entire design process from concept to final game assets, significantly reducing production time from days to minutes [3][4]. Group 1: Key Features of Hunyuan 3D Studio - The platform includes seven core technology modules that ensure a seamless and automated workflow throughout the asset creation process [6]. - The workflow encompasses component splitting, controllable image generation, high-fidelity geometry generation, low-poly topology generation, semantic UV unwrapping, texture generation and editing, and rigging and animation effects [9][10]. Group 2: Component Splitting - The component splitting module utilizes connectivity analysis and semantic segmentation algorithms to automatically decompose complex models into logically and functionally independent components, allowing for independent editing and animation [9][10]. - The process involves using a feature extractor and segmentation heads to predict masks for component boundaries, ensuring high accuracy in the segmentation results [15][18]. Group 3: Controllable Image Generation - The controllable image generation module allows users to generate 3D design images in various mainstream game art styles by providing input images and style instructions [33][34]. - The system employs a dataset constructed from pairs of images to achieve precise mapping between realistic images and stylized outputs, enhancing the model's ability to generate consistent and high-quality designs [34][41]. Group 4: High-Fidelity Geometry Generation - High-fidelity geometry generation is based on the Hunyuan 3D framework, which includes a variational autoencoder for compressing and reconstructing 3D geometries [43][45]. - The process utilizes a diffusion model to efficiently generate high-quality samples from single input images, ensuring that the generated geometries align closely with the input prompts [47][50]. Group 5: Low-Poly Topology Generation - The low-poly topology module aims to create clean and art-compliant topology structures from high-fidelity geometries, employing a self-regressive model to predict vertices and faces directly from point clouds [55][56]. - The module incorporates a tokenization method that enhances training and inference efficiency by modeling the mesh as a sequence [59][60]. Group 6: Texture Generation and Editing - The texture generation framework extends 2D diffusion models to support multi-view texture generation, addressing challenges such as cross-view consistency and the transition from RGB textures to physically-based rendering (PBR) materials [76][78]. - A text-guided texture editing model has been developed, allowing for robust texture synthesis and editing based on high-quality PBR material datasets [81][84]. Group 7: Rigging and Animation Effects - The rigging and animation module includes a humanoid character animation branch and a general character animation branch, ensuring accurate bone generation and skinning through a template-based approach [97][100]. - The system allows for parameterized control, enabling high-level artistic adjustments throughout the pipeline while maintaining the ability to incrementally update without complete recalculation [104][105].