3D视觉
Search documents
机器视觉行业深度研究报告(一):从二维识别到三维重构,3D视觉正从“可选配置”走向“刚需标配”
Huachuang Securities· 2026-03-30 14:28
Investment Rating - The report maintains a "strong buy" recommendation for the company Sikan Technology (688583.SH) with a projected EPS of 1.63 CNY for 2025, increasing to 2.36 CNY by 2027, and a PE ratio decreasing from 71.46 in 2025 to 49.24 in 2027 [2] Core Insights - The machine vision industry is transitioning from optional configurations to essential standards, particularly in 3D vision technology, which enhances applications in various sectors [6][8] - The report highlights two major trends in the machine vision industry: the expansion of application scenarios for 3D vision and the penetration of AI algorithms [7] Industry Overview - The industry comprises 637 listed companies with a total market capitalization of 67,631.52 billion CNY and a circulating market value of 56,045.07 billion CNY [3] - The absolute performance of the industry over the past 12 months is 27.3%, with a relative performance of 12.6% [4] 3D Vision Technology - 3D vision technology provides depth, shape, and pose information, enabling recognition, positioning, and scene reconstruction, which is a significant advancement over traditional 2D imaging [11][14] - The 3D vision industry chain consists of upstream hardware suppliers, midstream algorithm developers, and downstream application providers [17][24] Core Technology Paths - The main 3D imaging technologies include binocular vision, structured light, and time-of-flight (TOF) systems, each with distinct advantages and applications [27][28] - Binocular vision is cost-effective and suitable for long distances, while structured light offers high precision at close range, and TOF technology provides stable accuracy in indoor environments [39][40] Market Expansion - The 3D vision technology initially focused on industrial applications, such as high-precision measurements, is now expanding into consumer markets, driven by advancements in components and algorithms [45][50] - The global market for 3D vision products was valued at 12.29 billion CNY in 2022 and is expected to grow to 60.3 billion CNY by 2027, with a CAGR of 26.6% [51] Investment Recommendations - The report suggests focusing on companies like Orbbec, Sikan Technology, and Opto, which are well-positioned in the 3D vision market due to their comprehensive technology and product offerings [65][66]
破解在线长时序重建难题!纯视觉、单卡实时的公里级流式3D重建|CVPR'26
量子位· 2026-03-24 04:59
Core Viewpoint - The article discusses the challenges and advancements in 3D reconstruction for long sequences in real-time settings, highlighting the introduction of LongStream as a solution to these challenges [1][2]. Group 1: Challenges in Long Sequence 3D Reconstruction - Existing methods perform well in short sequences but struggle with real-time long video scenarios, leading to significant issues in accuracy and stability [2]. - Key problems include reliance on the first frame for pose anchoring, attention sink phenomena, and KV cache pollution, which degrade performance over time [5][6]. Group 2: Innovations of LongStream - LongStream introduces a Gauge-decoupled streaming visual geometry architecture that addresses the limitations of traditional methods by: 1. Eliminating first-frame anchoring, allowing for relative pose predictions that enhance robustness in long sequences [10]. 2. Implementing cache-consistent training to minimize the training-inference gap and reduce attention sink effects [11]. 3. Utilizing periodic cache refresh to mitigate memory saturation and geometric drift, maintaining reconstruction consistency [11]. Group 3: Experimental Results - LongStream demonstrates competitive performance across various benchmarks, including KITTI, Waymo, and TUM-RGBD, achieving stable reconstruction with low memory usage and maintaining 18 FPS streaming inference [12][16]. - In comparison to baseline methods, LongStream shows significantly lower Average Trajectory Error (ATE) across multiple datasets, indicating superior long-sequence stability and accuracy [17][18]. Group 4: Importance of LongStream - The significance of LongStream lies in its ability to support continuous online 3D world modeling, which is crucial for applications in robotics, autonomous driving, AR glasses, and embodied AI [19][21]. - This approach shifts the paradigm from offline reconstruction to real-time world maintenance, making it a vital development for future visual systems [22].
Teledyne e2v推出Perciva™ 5D相机:为工业、零售及机器人成像提供近距离无遮挡3D视觉解决方案
Globenewswire· 2026-03-09 00:00
Core Insights - Teledyne e2v has launched the Perciva™ 5D camera, a breakthrough imaging innovation designed for high-quality short-range 3D vision in an economical and reliable manner [1] - The camera addresses the growing demand for depth perception in close and ultra-close applications, utilizing unique angle-sensitive pixel technology and advanced onboard processing for real-time 2D and 3D image fusion [1][2] Group 1: Product Features - The Perciva 5D camera generates 2D and 3D data using a single CMOS sensor, allowing for synchronized output of time-aligned 2D frames and pixel-aligned 3D depth maps [2] - It operates in ambient light mode, making it suitable for both indoor and outdoor use without the need for external near-infrared light sources, thus minimizing overall system costs [2] - The camera is designed for harsh environments, featuring a robust IP6x-rated protective housing and industrial-grade M12 connectors, and supports plug-and-play integration via GenICam-compliant GigE Vision interface [2][3] Group 2: Technical Specifications - The Perciva 5D weighs only 230 grams and consumes less than 5W of power, making it ideal for applications in robotics, retail self-checkout systems, and industrial 3D process monitoring [3] - It supports user-adjustable frame rates or trigger acquisition modes and offers multiple power supply options, ensuring flexibility in various operational settings [3] - The camera seamlessly integrates with Teledyne's Spinnaker® 4 API and SpinView® for 2D/3D visualization processing, and is compatible with major machine vision software platforms [3] Group 3: Industry Context - Teledyne's vision solutions provide a vertically integrated portfolio of comprehensive industrial and scientific imaging technologies, leveraging expertise from various subsidiaries to offer a wide range of sensing and related technologies [4] - The company aims to provide global customer support and technical expertise to tackle challenging tasks, with tools and solutions designed to give customers a competitive edge [4]
研判2026!中国视觉检测系统行业产业链、市场规模及发展趋势分析:智能化趋势下,行业稳健发展[图]
Chan Ye Xin Xi Wang· 2026-02-01 02:28
Core Viewpoint - The visual inspection system is transforming traditional industrial production by integrating automation and intelligence, moving away from reliance on manual inspection. The market size for China's visual inspection system is projected to reach approximately 3.264 billion yuan in 2024, with a year-on-year growth of 9.71% [6]. Industry Overview - The visual inspection system is an automated detection solution based on computer vision technology, utilizing industrial cameras, light sources, image processing, and algorithm modules for non-contact data collection, analysis, and judgment. It is categorized into online and offline inspection systems, with online systems providing real-time, fully automated inspection on production lines, while offline systems offer flexibility and cost advantages [1][3]. Industry Chain - The upstream of the visual inspection system industry includes components such as light sources, industrial lenses, industrial cameras, image sensors, and AI platforms. The midstream involves the manufacturing and system integration of visual inspection systems, while the downstream applications span various sectors including electronics, automotive, semiconductors, and healthcare [3]. Market Size - The visual inspection system market in China is expected to reach approximately 3.264 billion yuan in 2024, reflecting a year-on-year increase of 9.71% [6]. Key Companies' Performance - Key players in the visual inspection system industry include: - **Tianzhun Technology**: Focuses on creating a leading visual equipment platform, with a significant drop in revenue for visual inspection equipment in the first half of 2025, amounting to 0.065 billion yuan, a decrease of 70.81% year-on-year [7]. - **Dahua Technology**: Utilizes AI technology to drive intelligent video perception systems, expanding its product offerings across various sectors [9]. - **Lingyun Optical**: Achieved a revenue of 2.127 billion yuan in the first three quarters of 2025, marking a year-on-year growth of 34.30% [9]. Industry Development Trends 1. **Technological Transformation**: The core technology of visual inspection is shifting from 2D to 3D vision combined with AI, enabling the detection of complex defects and enhancing analysis efficiency [10]. 2. **Application Expansion**: The application of visual inspection technology is broadening from standardized manufacturing to flexible and diverse scenarios, including healthcare and logistics [11]. 3. **Ecosystem Development**: The focus is moving towards high-end breakthroughs and collaborative ecosystem building, emphasizing domestic innovation and reducing reliance on imports [12].
三大“碰一下”龙头股价齐创新高 NFC热潮助推A股科技股
Zhong Guo Ji Jin Bao· 2026-01-12 08:30
Core Viewpoint - The A-share market experienced a significant surge on January 12, 2026, driven by the NFC (Near Field Communication) industry chain, particularly highlighted by Alipay's "Tap" feature, which has transformed a dormant mobile function into a vital connection between the physical and digital worlds, reshaping the value of the entire NFC industry chain [1] Group 1: Company Performance - Lens Technology (300433.SZ) saw its stock price rise by 10% to 42.66 yuan, with a trading volume of 12 billion yuan, indicating high market activity [2] - Lens Technology is a key supplier for Alipay's "Tap" feature, with its stock increasing by 147% since the feature's announcement on July 8, 2024 [2] - The expansion of the "Tap" feature into various high-frequency applications has opened a "second growth curve" for Lens Technology beyond consumer electronics [3] Group 2: Chip Industry Insights - Fudan Microelectronics (688385.SH) is positioned as a leading domestic chip design company, providing essential NFC and security chips for the "Tap" feature, which contributed to its stock price increasing by 9.84% to 98 yuan [4] - Since the announcement of Alipay's "Tap," Fudan Microelectronics has seen its stock rise by over 220%, highlighting the critical role of NFC chips in the user experience [5] - Institutional investors are actively investing in Fudan Microelectronics, reflecting confidence in the company's value within the NFC ecosystem amid a focus on technological self-sufficiency and supply chain security [5] Group 3: 3D Vision Technology - Orbbec (688322.SH) represents the 3D vision sector, with its long-term stock performance reflecting market optimism about future interaction methods [6] - The "Tap" feature signifies a near-field interaction solution, while 3D vision technology is seen as central to spatial interaction, suggesting a convergence of various interaction modalities in future smart devices [6] - The market is positioning companies like Orbbec as integral to the upcoming AI hardware ecosystem, with applications in robotics, the metaverse, and AIoT [7]
奥比中光将携“端侧AI之眼”亮相CES 2026,3D视觉赋能具身智能新生态
Xin Lang Cai Jing· 2026-01-05 04:09
Core Viewpoint - The company, Orbbec (688322.SH), will showcase multiple 3D vision products and its robotic manufacturing capabilities at CES 2026, emphasizing its role as a leader in the robotics and AI vision sectors, focusing on the development of "edge AI eyes" to support embodied intelligence and various AI edge devices [1][6]. Product Matrix and Full Chain Layout - Orbbec will launch several new 3D cameras aimed at humanoid robots and outdoor autonomous mobile robots (AMR) during the exhibition, addressing key needs for precise operation perception, complex environment adaptation, and system collaboration [1][3]. - The company will highlight its collaboration with NVIDIA's Jetson Thor platform, which enhances system integration efficiency for robot manufacturers, facilitating faster product deployment from R&D to market [2][7]. - Orbbec's manufacturing capabilities include providing OEM services for various intelligent hardware, significantly reducing product launch cycles and production costs for clients [2][7]. Industry Positioning and Market Opportunities - The company is strategically positioned in the booming sector of embodied intelligence, with humanoid robots and outdoor AMRs gaining traction in various industries, including ports and mining [3][8]. - The global market for 3D vision devices in humanoid robots is projected to reach 160 billion yuan by 2030, driven by the increasing adoption of 3D sensors among leading manufacturers [3][8]. - Orbbec holds approximately 70% market share in China's 3D vision sensor market and 72% in South Korea's commercial and industrial mobile robot market, demonstrating its strong market penetration and commercial viability [4][9]. Historical Development and Achievements - Since its debut at CES in 2014, Orbbec has evolved from showcasing 3D camera products to developing comprehensive solutions, establishing a robust capability matrix that includes core technology, standard products, scene solutions, and manufacturing services [5][10]. - The company has filed nearly 2,000 patents in the 3D perception field, maintaining a leading position in intellectual property reserves globally [10]. - In the first three quarters of 2025, Orbbec reported revenues of 714 million yuan, a year-on-year increase of 103.5%, and a net profit of 108 million yuan, marking a significant turnaround towards high-quality development [6][10].
厘米级精度的三维场景实时重构!这款激光扫描仪太好用了~
自动驾驶之心· 2025-12-17 00:03
Core Viewpoint - The article introduces the GeoScan S1, a highly cost-effective handheld 3D laser scanner designed for various applications, emphasizing its advanced features and capabilities in real-time 3D mapping and data collection [3][11]. Group 1: Product Features - GeoScan S1 offers a lightweight design with a one-button startup, enabling efficient and practical 3D solutions [3][6]. - It utilizes a multi-modal sensor fusion algorithm to achieve centimeter-level precision in real-time 3D scene reconstruction, capable of generating 200,000 points per second and covering a measurement distance of up to 70 meters [3][31]. - The device supports scanning areas over 200,000 square meters and can be equipped with a 3D Gaussian data collection module for high-fidelity scene restoration [3][53]. Group 2: Technical Specifications - The GeoScan S1 operates on a hand-held Ubuntu system and integrates various sensor devices, including RTK, IMU, and dual wide-angle cameras, ensuring high precision and data synchronization [5][36]. - It features a relative accuracy of better than 3 cm and an absolute accuracy of better than 5 cm, with a battery life of approximately 3 to 4 hours [24][25]. - The device dimensions are 14.2 cm x 9.5 cm x 45 cm, weighing 1.3 kg without the battery and 1.9 kg with the battery [24]. Group 3: Market Position and Pricing - The GeoScan S1 is positioned as the most cost-effective handheld 3D laser scanner in the market, with a starting price of 19,800 yuan [11][60]. - Various versions are available, including a basic version, a depth camera version, and online/offline 3DGS versions, catering to different user needs and budgets [60][61]. Group 4: Application Scenarios - The GeoScan S1 is suitable for a wide range of environments, including office buildings, parking lots, industrial parks, tunnels, forests, and mines, effectively completing 3D scene mapping [40][49]. - It supports cross-platform integration, making it compatible with drones, unmanned vehicles, and robotic systems for automated operations [47].
华为Mate80全系支持3D人脸识别,产业链需求激增
Xuan Gu Bao· 2025-11-25 15:03
Group 1 - Huawei officially launched the Mate 80 series smartphones, which support 3D facial recognition across the entire series [1] - The Mate 80 series features 3D ToF technology, ensuring financial-grade payment security and supporting over 150 mainstream applications for 3D facial login or payment [1] - Dongwu Securities predicts that 2024 will be the year of explosion for the 3D visual industry, with expanding application scenarios and increasing demand for high-precision perception and autonomous operation [1] Group 2 - Orbbec has applied its 3D visual sensors in various payment scenarios, including offline retail, self-service kiosks, dining, healthcare, and transportation [2] - OFILM leverages its optical technology and automated manufacturing capabilities to expand into new fields such as smart locks, VR/AR, machine vision, and action cameras [2]
这台3D扫描仪,重建了整个隧道和公园~
自动驾驶之心· 2025-11-25 00:03
Core Viewpoint - The article introduces the GeoScan S1, a highly cost-effective handheld 3D laser scanner designed for various applications, emphasizing its advanced features and capabilities in real-time 3D mapping and data collection [3][11]. Group 1: Product Features - GeoScan S1 offers a lightweight design with a one-button start for efficient 3D scanning solutions, achieving centimeter-level precision in real-time scene reconstruction [3][6]. - The device can generate point clouds at a rate of 200,000 points per second, with a maximum measurement distance of 70 meters and 360° coverage, suitable for large scenes over 200,000 square meters [3][31]. - It integrates multiple sensors, including RTK, IMU, and high-resolution cameras, enabling high-precision mapping and data collection in complex environments [24][36]. Group 2: Technical Specifications - The GeoScan S1 operates on Ubuntu 20.04 and supports various data export formats such as PCD, LAS, and PLY, with relative accuracy better than 3 cm and absolute accuracy better than 5 cm [24][29]. - The device dimensions are 14.2 cm x 9.5 cm x 45 cm, weighing 1.3 kg without the battery and 1.9 kg with the battery, and it has a battery capacity of 88.8 Wh, providing approximately 3 to 4 hours of operation [24][25]. - It features a 5.5-inch touchscreen and supports wireless connectivity via Wi-Fi and Bluetooth, along with multiple external expansion options [25][24]. Group 3: Applications and Market Position - GeoScan S1 is suitable for various applications, including urban planning, construction monitoring, and environmental surveying, capable of operating in diverse settings such as office buildings, industrial parks, tunnels, and forests [40][49]. - The product is positioned as the most cost-effective option in the market, with a starting price of 19,800 yuan for the basic version, catering to a wide range of user needs [11][60]. - The device supports cross-platform integration, making it compatible with drones, unmanned vehicles, and robotic systems for automated operations [47][49].
3D视觉被过度设计?字节Depth Anything 3来了,谢赛宁点赞
具身智能之心· 2025-11-17 00:47
Core Insights - The article discusses the release of Depth Anything 3 (DA3) by a team from ByteDance, which enhances monocular depth estimation across various perspectives, achieving human-like spatial perception [5][12]. - DA3 simplifies 3D modeling by utilizing a standard Transformer architecture, demonstrating significant improvements in pose estimation (44% increase) and geometric estimation (25% increase) compared to state-of-the-art methods [7][12]. Group 1: Model Features and Innovations - DA3 is capable of predicting spatially consistent geometric shapes from any number of visual inputs, regardless of known camera poses [12]. - The model employs a simple Transformer backbone and a single depth ray prediction target, avoiding the complexities of multi-task learning [12]. - A key improvement is the input-adaptive cross-view self-attention mechanism, which allows efficient information exchange across views [13]. Group 2: Training and Evaluation - The training process utilizes a teacher-student paradigm to unify various training data formats, including real-world depth camera captures and synthetic data [14]. - A new visual geometry benchmark has been established, with DA3 achieving state-of-the-art results across 10 tasks, improving camera pose accuracy by 35.7% and geometric accuracy by 23.6% [15]. Group 3: Applications and Potential - DA3 demonstrates capabilities in video reconstruction, large-scale SLAM, and multi-camera spatial perception, enhancing understanding in autonomous driving and robotics [18][20][24]. - The model's design has attracted interest from developers looking to integrate this efficient approach into their projects, indicating its practical applicability [26].