Workflow
生成式AI
icon
Search documents
从训练到推理的「瘦身」演进:首篇高效扩散语言模型(dLLM)深度综述
机器之心· 2026-03-09 09:48
Core Insights - The article discusses the rise of Diffusion Language Models (dLLMs) as a promising alternative to Autoregressive (AR) models, highlighting their potential for parallel generation and improved controllability [2][3]. Training Efficiency: Leveraging Existing Models - dLLMs require significant data and computational resources for training, making it crucial to utilize existing pre-trained AR models [7]. - The paper identifies two main strategies for improving training efficiency: "migration from AR to dLLM" and "architecture optimization" [8]. - Techniques such as Block Diffusion allow for a hybrid approach that retains some AR structure while enabling parallel processing, thus reducing adaptation costs [9]. - Architectural innovations like the Encoder-Decoder structure and the introduction of Mixture of Experts (MoE) help to lower training costs by reusing features and reducing parameter calculations during inference [9]. Inference Acceleration: Parallel Decoding and Compression - Inference speed is a critical challenge for dLLMs, as the diffusion process involves multiple iterative steps [11]. - The paper categorizes inference acceleration strategies into "parallel decoding" and "compression techniques" [11]. - dLLMs can update multiple tokens simultaneously, but determining which tokens to update is essential for efficiency [14]. - Compression methods, including fine-grained quantization, are tailored to the unique characteristics of the diffusion process, achieving extremely low bit quantization [14]. KV Cache Management: Addressing Dynamic Challenges - The management of KV Cache is a significant difference between dLLMs and AR models, as the sequence changes at each denoising step [15]. - The paper outlines three strategies for managing KV Cache: architectural adjustments, adaptive refresh techniques, and heuristic methods [18][19]. - Architectural adjustments like Block Diffusion allow for fixed prefixes and dynamic suffixes, while adaptive refresh techniques utilize token stability to minimize cache updates [18]. - Heuristic methods leverage uncertainty to determine which tokens to retain, enhancing efficiency without retraining the model [19]. Speculative Decoding: Self-Game and Collaborative Strategies - Speculative decoding in dLLMs manifests in two unique forms: self-speculation and synergy with AR models [21][26]. - Self-speculation involves the model predicting intermediate states, while synergy combines the strengths of both dLLMs and AR models for improved throughput [26]. Summary and Future Directions - The paper emphasizes the need for a unified evaluation standard to compare efficiency across different models, considering training costs and memory usage [24]. - Hardware-aware kernel optimizations are necessary to translate theoretical acceleration into practical performance improvements [24]. - The potential for multimodal integration in dLLMs presents an exciting avenue for future research and application [25]. - The article serves as a roadmap for the transition of dLLMs from academic exploration to industrial application, indicating their growing relevance in high-quality, controllable generation scenarios [25].
一年一度最值得关注的AI榜单来啦!申报即日启动
量子位· 2026-03-09 04:13
Core Insights - The article discusses the transition of generative AI in China from a "new technology" to a "new tool" and now to a reality that businesses must confront, impacting various aspects such as content production, R&D efficiency, marketing methods, team collaboration, and decision-making processes [1] Group 1: Event Overview - The Fourth China AIGC Industry Summit will take place in May 2026, where Quantum Bit will announce the results of its evaluation of generative AI companies and products based on their performance and feedback over the past year [1][2] - The summit aims to invite millions of industry practitioners to witness the recognition of outstanding companies [2] Group 2: Evaluation Criteria for AIGC Companies - Companies eligible for evaluation must be based in China or have their main business operations in China, focus on generative AI or have widely applied AI in their core business, and have shown outstanding performance in technology/products and commercialization over the past year [7] - The evaluation will consider innovation, forward-looking potential, and scalability of the AI companies [4] Group 3: Evaluation Criteria for AIGC Products - Products eligible for evaluation must be based on generative AI capabilities, have mature technology already in the market with a certain user scale, and have significant technological innovations or functional iterations in the past year that impact the industry [13] - Evaluation dimensions for products include technical strength, innovation, market performance, and future potential [12] Group 4: Registration Information - Registration for the evaluation is open now and will close on April 27, with final results announced at the May summit [14] - Companies can register through a provided link or contact Quantum Bit staff for inquiries [14][16] Group 5: Summit Theme and Goals - The theme of the 2026 China AIGC Industry Summit is "Everyone, Let's AI Now," focusing on how to effectively utilize AI [17] - The summit aims to engage AI entrepreneurs, developers, and experienced players to clarify and implement AI, encouraging broader participation in AI technology [17]
腾讯研究院AI速递 20260309
腾讯研究院· 2026-03-08 16:01
Group 1: Generative AI Developments - OpenAI released the GPT-5.4 series, integrating Computer Use capabilities, combining code, reasoning, and desktop control into a unified model [1] - The OSWorld desktop control evaluation scored 75.0%, surpassing the human benchmark of 72.4%, while GDPval professional work evaluation reached 83.0% [1] - Standard API pricing is set at $2.50 per million inputs and $15 per million outputs, with a Pro version priced at a 12x premium targeting complex agent scenarios [1] Group 2: OpenAI Initiatives - Peter Steinberger, founder of OpenClaw, joined OpenAI and launched the "Codex for Open Source" project, offering free API credits and 6 months of ChatGPT Pro access to open-source maintainers [2] - The application criteria target core maintainers and widely used public project operators, with non-standard projects eligible if they play a significant role in the ecosystem [2] - Steinberger claims to balance responsibilities between OpenAI and OpenClaw, aiming to support as many open-source contributors as possible [2] Group 3: Tencent Innovations - Tencent's Mix Yuan introduced the HY-WU paradigm, generating personalized LoRA parameters in real-time during inference, replacing traditional static fine-tuning methods [3] - This approach was applied to an 800 billion parameter image editing model, outperforming closed-source models in multiple metrics, with only a 0.11 point gap from GPT Image 1.5 [3] - The paradigm is designed for cross-modal applicability, with plans to expand functional memory to video generation, multi-modal alignment, and edge deployment [3] Group 4: Xiaomi's AI Agent - Xiaomi launched the miclaw mobile AI Agent product based on the MiMo model, encapsulating over 50 system-level tools for autonomous task orchestration [4] - The AI can interact with the entire home IoT ecosystem and supports third-party applications through an SDK [4] - It features self-evolution capabilities, allowing it to create sub-agents and continuously adapt based on user preferences and experiences [4] Group 5: Karpathy's Autoresearch - Karpathy released the autoresearch project, consisting of only 630 lines of code, enabling an AI agent to autonomously execute code editing, model training, evaluation, and iteration without human intervention [5] - Each training session lasts 5 minutes, using val_bpb as a unified evaluation metric, with the agent submitting improvements via Git [6] - Karpathy is running an enhanced version on eight H100 GPUs, positioning the project as a proof of concept for self-evolving LLMs, with potential for expansion into various research fields [6] Group 6: Security Innovations - Illia Polosukhin, co-author of the Transformer paper, rewrote OpenClaw in Rust, launching the secure version IronClaw with a four-layer defense architecture [7] - Key security features include WASM sandbox isolation, AES-256-GCM encrypted credential vaults, and a trusted execution environment (TEE) [7] - The project aligns with NEAR Protocol's "user-owned AI" strategy, establishing an AI cloud platform and a marketplace for intelligent agents [7] Group 7: Multiplayer Video World Model - The team led by Xie Sainin introduced Solaris, the first multiplayer video world model capable of generating consistent first-person perspectives among multiple players, validated in Minecraft [8] - They developed the SolarisEngine for data collection, creating a dataset of 12.64 million frames, the first annotated dataset for training multiplayer world models [8] - The model incorporates a multi-player self-attention layer to facilitate information exchange among players, significantly outperforming previous solutions [8] Group 8: AI in Theoretical Physics - Google Research utilized Gemini Deep Think, tree search, and automatic numerical feedback to solve the unresolved problem of cosmic string gravitational radiation power spectrum [9] - The AI explored approximately 600 candidate paths, with 80% pruned by an automatic verifier, ultimately identifying six solutions, with the Gergenbauer method being the most elegant [9] - The final closed-form solution was achieved through human-AI collaboration, showcasing a reusable AI-driven research paradigm [9] Group 9: Labor Market Impact of AI - Anthropic's labor market report indicates that AI is subtly impacting young people's first jobs, with a 14% decrease in the proportion of 22-25-year-olds entering high AI exposure occupations [10] - The AI task coverage for computer programmers reached 74.5%, but actual coverage across industries is only about one-third of theoretical values, indicating significant untapped potential [10] - Companies are shifting investments from "future human assets" to "immediate computational assets," leading to the disappearance of entry-level positions and emphasizing decision-making, aesthetic engineering, and AI collaboration skills as core competencies [11] Group 10: OpenClaw's Global Impact - OpenClaw's global popularity surged, with over 1,300 attendees at a New York gathering, where Huang Renxun described it as "the most important software release in history" [12] - Observations from the event indicated users spending an average of $1,000 to $2,000 monthly on model costs, with some burning 1 billion tokens daily [12] - Security concerns emerged as the primary issue, with no one believing the system is 100% secure, highlighting the genuine demand for personal intelligent agents and marking the onset of the consumer AI agent era [12]
建材行业事件点评:普通布进一步提涨,看好高景气持续
Investment Rating - The report rates the construction materials industry as "Overweight" indicating a positive outlook for the sector compared to the overall market performance [2][3]. Core Insights - The ordinary fabric has seen its third price increase of the year, rising by 0.5 yuan to 5.7 yuan/meter, with a total increase of 1.5 yuan/meter since January, reflecting sustained industry demand and pricing power [3]. - The demand for ordinary electronic fabric is robust, driven by the continuous growth in integrated circuit board production, which has seen a year-on-year increase of over 10% for seven consecutive months from June to December 2025 [3][10]. - Supply growth for ordinary electronic fabric is expected to decline as the expansion cycle driven by high demand from 2021-2022 is nearing its end, and many manufacturers are shifting production to high-margin specialty fabrics [3]. - Specialty fabrics are experiencing high demand, particularly low dielectric and low expansion coefficient fabrics, with significant price increases announced by leading companies in the sector [3]. - Key investment recommendations include focusing on core suppliers of ordinary fabric such as China Jushi and monitoring specialty fabric companies like China National Materials, International Composites, and Honghe Technology [3]. Summary by Sections Price Trends - The price of electronic fabric has been on a continuous upward trend, with significant increases noted in recent months [4][5]. Production Capacity - The growth rate of electronic fabric production capacity is entering a declining phase, with projections indicating a slowdown in capacity expansion [6]. Integrated Circuit Production - The production of integrated circuits in China has been consistently increasing, with record monthly outputs and strong year-on-year growth rates [9][10]. Company Valuation Comparisons - Key companies in the sector have varying valuations, with China Jushi rated as "Buy" and showing strong earnings growth projections for 2024-2026 [12].
一年一度最值得关注的AI榜单来啦!申报即日启动
量子位· 2026-03-08 04:26
Core Insights - The article discusses the evolution of generative AI in China, highlighting its transition from a "new technology" to an essential tool for businesses, impacting content production, R&D efficiency, marketing methods, team collaboration, and decision-making processes [1] - The fourth China AIGC Industry Summit will evaluate generative AI companies and products based on their performance and feedback over the past year, with results to be announced in May 2026 [1][2] Evaluation Criteria for AIGC Companies - Companies must be based in China or have their main business operations in China [7] - The primary business should focus on generative AI or have widely applied AI in its core operations [7] - Companies should have demonstrated outstanding performance in technology/products and commercialization over the past year [7] Evaluation Dimensions for AIGC Companies - **Technical Dimension**: Focus on the company's technical strength, R&D capabilities, and innovation, including technological achievements, R&D investment, and talent reserves [12] - **Product Dimension**: Emphasizes the innovation, market adaptability, and user experience of core products, including product innovation, user scale, and user experience [12] - **Market Dimension**: Evaluates the company's market performance and growth opportunities, including business models, market size, revenue situation, and cooperative ecosystem [12] - **Potential Dimension**: Assesses the strength of the core team and brand potential, including core team capabilities, financing progress, and brand influence [12] Evaluation Criteria for AIGC Products - Products must be based on generative AI capabilities [13] - Products should have mature technology, be market-released, and possess a certain user scale [13] - Significant technological innovations or functional iterations should have occurred in the past year, promoting the application of AI technology and impacting the industry [13] Evaluation Dimensions for AIGC Products - **Product Technical Strength**: Focus on the product's technological advancement, maturity, and efficiency, including technical architecture and outcomes [13] - **Product Innovation**: Emphasizes the uniqueness and innovation in functionality, experience, and application scenarios [13] - **Product Performance**: Evaluates user feedback and market performance, including user scale, retention rates, and product influence [13] - **Product Potential**: Assesses future development and market expansion potential, including product ecosystem and strategic planning [13] Registration Information - The registration for the evaluation starts immediately and ends on April 27, with final results to be announced at the May China AIGC Industry Summit [14] - Companies can register through a provided link or contact Quantum Bit staff for inquiries [14] About the China AIGC Industry Summit - The summit will take place in Beijing in May 2026, themed "Everyone, Let's AI Now," focusing on how to effectively utilize AI [17] - The event aims to engage AI entrepreneurs, developers, and experienced players to clarify and implement AI technologies [17]
投顾周刊:开年吸金超百亿元,混合理财产品热销
Wind万得· 2026-03-07 22:30
Economic Growth and Policy Focus - The government aims for an economic growth target of 4.5%-5% for the year, with a focus on maintaining medium to high-speed growth while nurturing new productive forces and transforming the economic structure [3] - The report emphasizes strengthening housing security for newly married and childbearing families, and encourages multiple channels to revitalize existing housing stock [3] Artificial Intelligence Industry - By 2025, China's core artificial intelligence industry is projected to exceed 1.2 trillion yuan, with over 6,200 companies, indicating strong growth in embodied intelligence and computing infrastructure [4] - The rapid development of AI is seen as a core engine for the transformation and upgrading of China's real economy [4] Financial Products and Market Trends - In early 2026, mixed financial products have seen significant popularity, with over 60 new products issued, totaling nearly 15 billion yuan, reflecting a shift in investor preference towards stable and enhanced returns [4] - Public fund institutions have initiated a self-purchase trend, accumulating over 910 million yuan, primarily in equity funds, signaling confidence in long-term investment value [6] Global Market Insights - IDC predicts that the global intelligent robotics hardware market will approach 30 billion USD by 2026, with China leading the market, expected to surpass 11 billion USD [7] - Recent geopolitical tensions, particularly involving Iran, have heightened risk perceptions in international energy markets, impacting oil prices [7] Stock Market Performance - Major global stock markets have experienced declines, with the Shanghai Composite Index down 0.93% and the Hang Seng Index down 3.28% [8][9] - The recent week saw a general downturn across various indices, indicating a cautious market sentiment [8][9] Bond Market Trends - Recent bond yield movements show a mixed performance, with 1-year Chinese government bond yields decreasing by 3.10 basis points to 1.29% [12] - The 10-year U.S. Treasury yield increased by 18 basis points to 4.15%, reflecting differing market conditions [12] Fund Performance and Market Dynamics - The overall fund index has shown a downward trend, with the total index down 1.35% in the recent week [14] - Fixed income and pure bond funds dominate the market, accounting for nearly 70% of the total number of products and over 90% of the total scale [17] Commodity Market Movements - Precious metals have seen a pullback, with COMEX gold down 1.27% and silver down 9.21%, while international oil prices surged by 28.06% [15][16] - The dollar index has risen by 1.34%, indicating a stronger dollar against other currencies [15][16]
2025年中国AI+互联网媒体行业研究报告
艾瑞咨询· 2026-03-07 08:38
Core Viewpoint - The article emphasizes that AI technology is fundamentally transforming the internet media industry by enhancing content production, distribution, and consumption processes, leading to a more efficient and innovative media ecosystem [1][2]. Group 1: Industry Overview - The Chinese internet media industry is transitioning into an AI-enabled intelligent ecosystem, with user growth slowing and competition shifting towards existing markets [2][6]. - Generative AI is accelerating the integration of multimodal applications, reshaping the content ecosystem and user experience, and driving the industry towards quality and efficiency [2][4]. Group 2: Deep Empowerment of AI - AI technology is deeply empowering the internet media industry, promoting intelligent transformation across the entire value chain, from production to consumption [2][24]. - Major media and social platforms in China, such as People's Daily and Weibo, are actively applying AI technology to enhance content creation, review, and distribution processes [2][36]. Group 3: Challenges and Opportunities - The internet media industry faces challenges such as content authenticity issues, high technical costs, and privacy risks, which need to be addressed for sustainable growth [3][46][54]. - Opportunities exist for media platforms to build competitive advantages through self-developed technologies, data governance, and intelligent recommendations [3][54]. Group 4: AI's Role in Content Production - Generative AI is reshaping the content production landscape by enabling users to create diverse content forms from simple text prompts, highlighting a trend towards mass user-generated content [24][28]. - AI technologies are optimizing content review processes, enhancing efficiency and accuracy in identifying complex violations [26][28]. Group 5: AI's Impact on Content Distribution and User Engagement - AI technology enhances content distribution efficiency by analyzing user behavior and optimizing recommendation paths, thereby increasing user engagement and platform stickiness [28][31]. - The integration of AI in user operations allows for personalized content matching and improved customer service, expanding commercial opportunities for media platforms [28][31]. Group 6: AI's Influence on Content Consumption - The shift from one-way communication to interactive engagement is facilitated by AI, allowing consumers to evolve into co-creators in the content cycle [31][46]. - AI technologies lower barriers to content access and enhance user understanding through intelligent summarization and dialogue-based services [31][46]. Group 7: Technological Evolution and Historical Context - The internet media industry has undergone significant transformations over the past three decades, driven by technological advancements from early portals to the current AI-enabled ecosystem [4][21]. - The evolution of AI technology has progressed from symbolic logic to data-driven models, culminating in the current era of generative AI applications [10][11]. Group 8: Case Studies of AI Implementation - The People's Daily has utilized generative AI to enhance video content creation and streamline the media production process [36]. - The Paper has integrated AI tools to improve content production efficiency and establish a robust content safety framework [38][39]. - Douyin (TikTok) has embedded AIGC technology throughout its content lifecycle, creating a comprehensive ecosystem for content creation and monetization [40].
一年一度最值得关注的AI榜单来啦!申报即日启动
量子位· 2026-03-07 02:24
组委会 发自 凹非寺 量子位|公众号 QbitAI 中国生成式AI正在进入产业深水区。 这两年,AI从"新技术"变成了"新工具",又从"新工具"慢慢变成企业必须面对的现实。它不只在改变内容生产,也在影响研发效率、营销方 式、团队协作,甚至决策流程。 时值第四届中国AIGC产业峰会, 量子位将根据过去一年里生成式AI企业、产品的表现与反馈,结合对2026年技术与场景的观察与预判,评 选出: 量子位将结合对公司的深入调研及数十位行业知名专家的意见,评选结果将于2026年5月中国AIGC产业峰会上公布。 届时,量子位也将邀请数百万行业从业者,共同见证这些优秀企业的荣誉。 2026年度值得关注的AIGC企业 将评选出拥有最创新、最前瞻或最有规模落地潜力的AI企业。 【参选条件】 2026年度值得关注的AIGC企业 2026年度值得关注的AIGC产品 1. 公司主体在中国或主营业务在中国; 2. 主营业务是生成式AI及相关,或已将AI广泛应用于其主营业务; 3. 近一年在技术/产品、商业化有出色表现的企业。 【评选维度】 2026年度值得关注的AIGC产品 将评选出拥有最创新、最实用、最热门或最有应用潜力的AI产品。 ...
剑指千亿美元,博通AI芯片营收暴涨,能否撕开英伟达算力护城河?
3 6 Ke· 2026-03-06 07:48
Core Insights - The global AI chip market is currently dominated by Nvidia, which leads the AI data center chip market due to its CUDA ecosystem. However, major tech companies are increasing their investment in self-developed chips and customized chips through Broadcom to diversify their supply and reduce reliance on Nvidia [1][3]. Group 1: Broadcom's AI Chip Revenue Growth - Broadcom's customized AI chips are significantly impacting Nvidia's market position, with AI chip revenue expected to exceed $100 billion by 2027. In Q1 of FY2026, Broadcom reported revenue of $19.311 billion, a 29% year-over-year increase, with net profit rising 34% to $7.349 billion. AI chip revenue doubled, reaching $8.4 billion, a 106% increase year-over-year [2][4]. - Broadcom's CEO noted that the strong demand for customized AI accelerators and AI network products is driving this growth, with expectations for AI semiconductor revenue to reach $10.7 billion in Q2 [2][4]. Group 2: Market Dynamics and Competition - The AI chip landscape is changing, with Broadcom positioned as a strong competitor to Nvidia. Nvidia's data center revenue reached $193.7 billion in FY2026, a 68% increase, benefiting from a surge in demand for computing power since the launch of ChatGPT [3][5]. - Major tech companies are increasing capital expenditures to secure high-performance chips from Nvidia, leading to supply shortages. Companies like Google, Meta, OpenAI, and Anthropic are exploring customized chips from Broadcom to reduce dependency on Nvidia [3][5]. Group 3: Future Projections and Market Trends - Broadcom's AI revenue is projected to reach $120 billion by 2027, with some analysts estimating it could exceed $200 billion. The ongoing expansion of AI infrastructure spending among large tech firms is expected to drive the global semiconductor market towards a trillion-dollar scale [2][4][6]. - The demand for customized AI chips is expected to grow rapidly, with Broadcom's ASIC chips being recognized for their competitive advantages in the AI market. This shift towards customized solutions is anticipated to disrupt Nvidia's dominance in the AI chip sector [5][6].
OpenAI点赞转发的冠军项目,背后藏着一个国人3D生成团队
机器之心· 2026-03-06 03:28
Core Viewpoint - The article discusses the emergence of StoryWorld, an iOS application that allows users to create 3D characters and objects using their mobile camera and voice input, marking a significant shift in 3D generation technology from demonstration tools to production components [1][5][43]. Group 1: 3D Generation Transition - StoryWorld generates complete 3D assets that can be positioned, scaled, and viewed from multiple angles, enabling users to control scene composition like a film director [5][6]. - The application utilizes DeemosTech's Hyper3D Rodin, which provides stability and controllability, making it a key technology for rapid asset production in hackathon environments [6][8]. - The focus of 3D generation is shifting from merely creating visuals to constructing scenes and camera language, thus elevating the requirements for stability and controllability in 3D products [7][8]. Group 2: Engineering and Production Pipeline - Hyper3D Rodin has been integrated into the workflows of top developers, including its use in NVIDIA's keynote production at CES, showcasing its role in high-standard engineering processes [12][15]. - The production pipeline requires maintaining detail and consistency across multiple stages, with Hyper3D Rodin handling 3D model generation effectively [15]. - The integration of Hyper3D into both rapid development and engineering standards indicates a growing emphasis on stability, controllability, and reusability in 3D generation [11][15]. Group 3: From Generation to Editing - Hyper3D has introduced a 3D model editing feature, Rodin Gen-2 Edit, which allows for localized modifications based on natural language input, marking a transition to an editable 3D generation workflow [20][34]. - This editing capability extends beyond Hyper3D's own models, allowing third-party 3D assets to be imported and modified, positioning it as a foundational infrastructure for 3D editing [23][34]. - The ability to edit and iterate on 3D models enhances the workflow for creators, making it easier to adapt and refine assets over time [35][43]. Group 4: Evolution of Multimodal Technology - The development of 3D generation technology follows a clear path from generation to enhanced controllability and ultimately to editing capabilities, moving away from random generation methods [26][30]. - Hyper3D has incorporated 3D ControlNet from its inception, allowing users to define model parameters precisely, thus improving the controllability of 3D generation [30][31]. - The introduction of editing features further solidifies the transition from a random generation approach to a controlled design process, enhancing the overall usability of 3D tools [34][43]. Group 5: Industry Focus Shift - The focus in AI 3D is shifting from mere generation capabilities to editability and reusability, becoming integral components of the creative and production workflow [42][43]. - For developers, 3D assets are evolving from static deliverables to dynamic elements that can be iteratively refined and reused, impacting the positioning of 3D model companies in the global creative chain [42][43]. - The successful application of Hyper3D Rodin in various contexts demonstrates its transition from experimental technology to a practical production tool [43].