计算机视觉
Search documents
PSAI企业版发布:定义下一代AI全链路电商视觉生产平台
Jin Tou Wang· 2025-12-11 04:29
2025年12月12日,计算机视觉行业领先的算法服务提供商及解决方案供应商虹软科技正式宣布,旗下智能商拍平 台PhotoStudio AI(以下简称"PSAI")推出全新PSAI企业版。 此次发布并非简单的功能叠加,而是基于对服饰电商行业运营场景的深度洞察,打造的一款集团队协作、规模化 制作与视觉衍生创作于一体的AI全链路电商视觉生产平台,标志着视觉AI工具正从"单点提效"迈向"体系化赋 能"的新阶段。 PSAI企业版介绍页面展示 在全球电商竞争日益白热化、内容需求呈指数级增长的当下,视觉素材的生产效率与成本控制已成为品牌的核心 竞争力。传统商拍及视觉设计流程繁琐、耗时冗长、成本高昂。而市面上多数AI工具又存在风格不统一、功能不 全面、团队难协作等短板。"工具不少,流程依旧混乱;生成很快,落地依然困难"成为许多企业数字化转型中的真 实写照。 PSAI企业版直面这一产业性难题,旨在通过一套系统化解决方案,重新定义电商视觉的生产方式。 不止于"工具",PSAI定义企业级"智能生产线" PSAI企业版的核心突破,在于它首次系统化地打通了电商视觉从"原始素材"到"营销爆款"的完整生产链条,将过 去分散、割裂的多个环节 ...
工业界大佬带队!三个月搞定3DGS理论与实战
自动驾驶之心· 2025-12-09 19:00
Core Insights - The article discusses the rapid advancements in 3D Generative Synthesis (3DGS) technology, highlighting its applications in various fields such as 3D modeling, virtual reality, and autonomous driving simulation [2][4] - A comprehensive learning roadmap for 3DGS has been developed to assist newcomers in mastering both theoretical and practical aspects of the technology [4][6] Group 1: 3DGS Technology Overview - The core goal of new perspective synthesis in machine vision is to create 3D models from images or videos that can be processed by computers, leading to numerous applications [2] - The evolution of 3DGS technology has seen significant improvements, including static reconstruction (3DGS), dynamic reconstruction (4DGS), and surface reconstruction (2DGS) [4] - The introduction of feed-forward 3DGS has addressed the inefficiencies of per-scene optimization methods, making the technology more accessible [4][14] Group 2: Course Structure and Content - The course titled "3DGS Theory and Algorithm Practical Tutorial" covers detailed explanations of 2DGS, 3DGS, and 4DGS, along with important research topics in the field [6] - The course is structured into six chapters, starting from foundational knowledge in computer graphics to advanced topics like feed-forward 3DGS [10][11][14] - Each chapter includes practical assignments and discussions to enhance understanding and application of the concepts learned [10][15] Group 3: Target Audience and Prerequisites - The course is designed for individuals with a background in computer graphics, visual reconstruction, and programming, particularly in Python and PyTorch [19] - Participants are expected to have a GPU with a recommended computing power of 4090 or higher to effectively engage with the course material [19] - The course aims to benefit those seeking internships, campus recruitment, or job opportunities in the field of 3DGS [19]
渤海证券研究所晨会纪要-20251209
BOHAI SECURITIES· 2025-12-09 02:49
Fund Research - The market saw a majority of equity indices rise, with the largest increase in the ChiNext Index, which rose by 1.86% during the week from December 1 to December 5, 2025 [2] - Public funds experienced significant inflows, with the first ETF tracking the CSI 300 Quality Index closing its fundraising, and Moer Thread emerging as the biggest winner in the offline allocation results [2] - Various equity fund types performed well, with equity funds averaging a rise of 0.93% and a positive return ratio of 76.27% [3] Company Research: Hongsoft Technology (688088) - Hongsoft Technology specializes in AI visual algorithms, providing algorithm licensing and system solutions, with mobile intelligent terminal visual solutions being the main revenue source [5][6] - The company reported a net profit of 142 million yuan for the first three quarters of 2025, marking a year-on-year increase of 60.51% [6] - The global smartphone shipment reached 923 million units in the first three quarters of 2025, with an AI smartphone penetration rate expected to reach 34% [6] - The smart glasses market saw a shipment of 4.065 million units in the first half of 2025, a 64.2% increase year-on-year, with expectations for the market to exceed 40 million units by 2029 [6] - The automotive sector showed a 12.4% year-on-year increase in sales, with the domestic passenger car market's DMS function penetration rate reaching a historical high of 26.2% in September 2025 [6] - The company’s PSAI product has penetrated multiple e-commerce platforms, serving hundreds of thousands of small businesses and over 300 major apparel brands [7] - Earnings per share (EPS) forecasts for 2025-2027 are 0.63 yuan, 0.85 yuan, and 1.18 yuan, with a 2025 price-to-earnings (PE) ratio of 76.07, which is below the average valuation of comparable companies [7] Industry Research: Light Industry Manufacturing & Textile Apparel - The light industry manufacturing sector outperformed the CSI 300 Index by 0.58 percentage points from December 1 to December 5, 2025, while the textile and apparel sector underperformed by 2.88 percentage points [8][10] - Recent price increases in packaging paper have been noted, with companies like Nine Dragons and others announcing price hikes of 50 yuan per ton [10] - The government is actively engaging in trade discussions with the U.S., which may positively impact export chain companies [10] - The domestic real estate market is under pressure, but recent policies to enhance home purchase subsidies may improve the situation in the medium term [10]
少量视角也能得到完整3D几何,即插即用的语义增强重建插件来了
机器之心· 2025-11-02 01:37
Core Viewpoint - The article discusses the SERES (Semantic-Aware Reconstruction from Sparse Views) method, which addresses the challenges of geometric accuracy, detail restoration, and structural integrity in 3D reconstruction from sparse views, providing a low-cost solution to enhance clarity and completeness of geometry [4][27]. Summary by Sections Introduction to SERES - SERES is developed by a collaborative team from Shanghai Jiao Tong University, the University of Manchester, and the Chinese University of Hong Kong, and has been accepted by IEEE Transactions on Visualization and Computer Graphics [6]. Method Overview - The SERES design focuses on two main lines: semantic matching priors and region-level regularization, integrating these into existing frameworks like NeuS or Neuralangelo without altering the core rendering and implicit surface expressions [8]. Semantic Matching Priors - The method involves extracting stable semantic blocks and geometric primitives from input images, allowing for interactive alignment and aggregation across multiple views, which helps the model recognize corresponding details during training [10][12]. Region-Level Regularization - SERES introduces interpretable region consistency in image space, aligning segmented regions with the model's rendered semantic distribution, which provides strong signals for how shapes should align, effectively reducing noise and improving surface coherence [14][22]. Experimental Results - In sparse multi-view settings, SERES significantly improves reconstruction quality and new view synthesis quality, showing a consistent decrease in geometric error as the number of views increases, indicating stable benefits across varying sparsity levels [17][18]. Conclusion - SERES transforms cross-view semantic consistency and structural region constraints into a low-cost, interpretable, and reusable training prior, making it suitable for integration into current sparse 3D reconstruction workflows, achieving high-fidelity geometry with fewer views [27].
南凌科技股份有限公司关于公司
Shang Hai Zheng Quan Bao· 2025-10-29 23:32
Core Viewpoint - Nanling Technology Co., Ltd. plans to invest 100 million RMB in Zhongke Fangcun (Nanjing) Technology Co., Ltd. to acquire a 12.9521% stake, enhancing its market competitiveness and integrating advanced AI technology into its core business [6][7][22]. Group 1: Investment Overview - The investment consists of 30 million RMB for acquiring 4.9345% equity from Nanjing Qilin Venture Capital Co., Ltd. and 70 million RMB for capital increase to obtain an additional 8.4338% equity [9][12]. - The transaction has been approved by the company's board and does not require shareholder approval, as it falls within the board's authority [8][9]. Group 2: Target Company Information - The target company, Zhongke Fangcun, specializes in computer vision AI technology, providing intelligent operation solutions for industries like electric power and petrochemicals [22]. - The company has a registered capital of 11.775413 million RMB and focuses on technology services, software development, and integrated circuit design [11]. Group 3: Strategic Importance - The investment aims to integrate cutting-edge AI technology into Nanling's "Lingyun Service," enhancing its competitive edge in the "cloud-intelligent network security" service matrix [22][23]. - This strategic move is expected to improve the company's overall business resilience and sustainable competitiveness [23]. Group 4: Financial Aspects - The total investment of 100 million RMB is sourced from the company's own or self-raised funds, ensuring no adverse impact on daily operations or cash flow [23]. - The valuation for the equity acquisition aligns with the latest financing round of the target company, indicating a reasonable investment decision [12][22].
混元3D开源端到端全景深度估计器,代码+精选全景数据已上线,在线可玩
量子位· 2025-10-14 04:08
Core Insights - The article discusses the development of DA, a novel end-to-end panoramic depth estimator by Tencent's Mixed Reality 3D team, which addresses the challenges of panoramic data scarcity and zero-shot generalization capabilities [2][8]. Group 1: Background and Challenges - Panoramic images provide a 360°×180° immersive view, essential for advanced applications like AR/VR and 3D scene reconstruction [5][6]. - Traditional methods for depth estimation in panoramic images are limited due to the scarcity of panoramic depth data and the inherent spherical distortion of panoramic images [10][12]. - The team aims to expand panoramic data and build a robust data foundation for DA [8]. Group 2: Data Augmentation Engine - The team developed a data management engine to convert high-quality perspective depth data into panoramic data, significantly increasing the quantity and diversity of panoramic samples [11][14]. - Approximately 543K panoramic samples were created, expanding the total sample size from about 63K to approximately 607K, addressing the issue of data scarcity [14]. Group 3: Model Architecture and Training - The SphereViT architecture was introduced to mitigate the effects of spherical distortion, allowing the model to focus on the spherical geometry of panoramic images [16][17]. - The training process incorporates distance loss for global accuracy and normal loss for local surface smoothness, enhancing the model's performance [18]. Group 4: Experimental Results - DA demonstrated state-of-the-art (SOTA) performance, with an average improvement of 38% in AbsRel performance compared to the strongest zero-shot methods [23][24]. - Qualitative comparisons showed that DA's training utilized approximately 21 times more panoramic data than UniK3D, resulting in more accurate geometric predictions [27]. Group 5: Application Scenarios - DA's exceptional zero-shot generalization capabilities enable a wide range of 3D reconstruction applications, such as panoramic multi-view reconstruction [28]. - The model can reconstruct globally aligned 3D point clouds from panoramic images of different rooms in a house or apartment, ensuring spatial consistency across multiple panoramic views [29].
苹果拟收购 Prompt AI,后者 Seemour 应用专攻监控识别分析领域
Huan Qiu Wang Zi Xun· 2025-10-11 04:31
Core Viewpoint - Apple is in the final stages of negotiations to acquire Prompt AI, a startup specializing in computer vision, which is seen as a significant move to enhance its smart home security and visual perception ecosystem [1][3]. Group 1: Acquisition Details - The acquisition aims to integrate Prompt AI's core technology and talent into Apple, enhancing its capabilities in smart home security [1]. - Prompt AI's leadership has informed employees about the transaction, indicating that some employees not joining Apple will receive compensation and can apply for other positions within the company [1][3]. - Investors in Prompt AI will recover some funds from the transaction, but not all of their initial investment [1][3]. Group 2: Company Background - Prompt AI was founded in 2023 and raised $5 million in seed funding led by AIX and Abstract Ventures [3]. - Its main product, Seemour, connects with home security cameras and offers advanced recognition and analysis features, including real-time alerts for unusual activities [3]. - Despite the technology's success, Prompt AI's CEO stated that the current business model did not meet expectations, leading to the decision to discontinue the Seemour application and delete user data for privacy [3]. Group 3: Industry Trends - The trend among Silicon Valley tech giants is to acquire AI talent through "acquihire" strategies, which helps enhance R&D capabilities while mitigating regulatory pressures [3]. - Compared to other major tech companies, Apple's acquisition is relatively small; for instance, Meta invested $14.3 billion in Scale AI, and Google spent $2.4 billion on Windsurf [3]. Group 4: Apple's Acquisition Strategy - Historically, Apple has maintained a cautious acquisition strategy, with its largest deal being the $3 billion purchase of Beats in 2014 [4]. - Apple prefers to acquire smaller tech teams to integrate their technology and talent into its product lines for organic upgrades [4]. - The company has been slow in the generative AI space, partly due to its reluctance to engage in large-scale acquisitions, resulting in a 2% decline in its stock price this year [4]. Group 5: Future Implications - If the acquisition is completed, Prompt AI's technology and team are expected to be integrated into Apple's HomeKit smart home division, strengthening its visual perception and security ecosystem [4].
Insta360最新全景综述:全景视觉的挑战、方法与未来
机器之心· 2025-10-04 03:38
Core Insights - The article discusses the transition from perspective vision to panoramic vision, highlighting the "perspective-panorama gap" as a central theme for understanding the challenges and opportunities in this field [6][19]. - It emphasizes the need for a systematic upgrade across data, models, and applications to enhance the usability of panoramic vision technologies [16][19]. Research Background and Motivation - The paper titled "One Flight Over the Gap: A Survey from Perspective to Panoramic Vision" aims to systematically analyze the differences between perspective and panoramic vision, covering over 300 papers and 20 representative tasks [4][19]. - The article provides a comprehensive overview of the challenges faced in panoramic vision, which are categorized into three main gaps: geometric distortion, non-uniform sampling, and boundary continuity [6][9]. Strategies Overview - Four main strategies are identified for adapting tasks to panoramic vision: 1. **Geometric Distortion**: Issues arise when spherical images are projected onto a plane, leading to shape distortion [7]. 2. **Non-uniform Sampling**: Pixel density varies significantly across different regions, affecting resolution [7]. 3. **Boundary Continuity**: The separation of boundaries in 2D images can lead to learning continuity issues [7]. - The article outlines a cross-method comparison to clarify the applicability of different strategies to various tasks [9][15]. Task Toolbox - The article lists over 20 tasks categorized into four main areas: enhancement and assessment, understanding, multi-modal, and generation, along with representative methods and key papers for each task [12][15]. - It highlights the rapid emergence of new paradigms such as diffusion and generative models, particularly in text-to-image/video and novel view synthesis [15]. Future Directions - To transition from "usable" to "user-friendly," advancements must be made in three main areas: data, model paradigms, and downstream applications [16][21]. - Key challenges include: 1. **Data Bottlenecks**: Lack of large-scale, diverse, and high-quality 360° datasets limits general training and reproducible evaluation [21]. 2. **Model Paradigms**: The need for robust models that can adapt from perspective to panoramic vision while maintaining performance across various tasks [21]. 3. **Downstream Applications**: Applications in spatial intelligence, XR, 3D reconstruction, and various industry sectors require effective deployment and compliance [21][22].
凌云光现5笔大宗交易 总成交金额1919.09万元
Zheng Quan Shi Bao Wang· 2025-09-18 14:40
Core Viewpoint - Lingyun Guang conducted five block trades on September 18, with a total trading volume of 446,300 shares and a total transaction amount of 19.19 million yuan, reflecting a discount of 14.03% compared to the closing price of the day [2][4] Group 1: Trading Activity - The total transaction amount for the five block trades was 19.19 million yuan, with an average transaction price of 43.00 yuan per share [2] - In the last three months, Lingyun Guang has seen six block trades with a cumulative transaction amount of 24.11 million yuan [3] - The stock closed at 50.02 yuan on the same day, marking a 12.46% increase, with a daily turnover rate of 9.95% and a total trading volume of 2.23 billion yuan [3] Group 2: Institutional Participation - Institutional proprietary seats appeared in three of the block trades, with a total transaction amount of 12.04 million yuan and a net purchase of 12.04 million yuan [2] - The latest margin financing balance for the stock is 610 million yuan, having increased by 141 million yuan over the past five days, representing a growth of 29.93% [4]
凌云光: 凌云光技术股份有限公司前次募集资金使用情况鉴证报告
Zheng Quan Zhi Xing· 2025-08-29 17:47
Core Viewpoint - The report provides a detailed account of the fundraising activities and the utilization of the raised funds by Lingyun Optical Technology Co., Ltd. as of June 30, 2025, confirming compliance with regulatory guidelines and reflecting the company's financial management practices [1][2][3]. Fundraising and Storage - The company raised a total of RMB 1,973.70 million by issuing 90 million shares at RMB 21.93 per share, with net proceeds amounting to RMB 1,805.28 million after deducting underwriting and other fees [3]. - As of June 30, 2025, the company had a total of RMB 427.21 million in its fundraising accounts, with RMB 399.50 million invested in financial products [16]. Fund Utilization - The report indicates that the company has not changed the investment projects for the raised funds, and it has approved the use of funds for its wholly-owned subsidiary to implement specific projects [8][9]. - The company has allocated funds for various projects, including the Industrial Artificial Intelligence Taihu Industrial Base and the development of intelligent visual equipment, with a total investment of RMB 150 million planned for these initiatives [18]. Project Performance - The report highlights that the actual investment in projects has not deviated from the commitments made during the fundraising process, with no external transfers or replacements of the investment projects reported [11][12]. - The company has achieved a cumulative utilization rate of 28,083.06 for the Industrial Artificial Intelligence Taihu Industrial Base project, although it is still under construction and has not yet generated profits [20]. Cash Management - The company has been authorized to use up to RMB 170 million of idle funds for cash management, investing in safe and liquid financial products, with the aim of optimizing the use of funds [12][13]. - As of June 30, 2025, the company had not used any of the raised funds for share subscriptions, indicating a focus on project investment rather than equity financing [12][16].