Workflow
计算机视觉
icon
Search documents
特斯拉为何不用激光雷达?
半导体行业观察· 2026-02-16 01:58
在解决自动驾驶问题的这场高风险竞赛中,多年来已经出现了深刻的哲学和技术分歧。 一边是几乎整个汽车和科技行业,他们倡导一种叫做传感器融合的概念——一种双保险的方法,将 摄像头、雷达和激光雷达结合起来,构建一个冗余的、多层次的世界视图。 而另一边,特斯拉孤军奋战,大胆而有争议地押注于单一模式——纯粹的、基于摄像头的视觉。 特斯拉主动移除并禁用车辆上的雷达等硬件的决定遭到了广泛的质疑,但这一举措源于其对人工智 能和自然智能本质的深刻理解和根本信念。要理解特斯拉为何做出这样的抉择,首先必须了解特斯 拉究竟摒弃了什么。 什么是传感器融合? 传感器融合的概念相当简单。它旨在利用不同类型传感器的独特优势,构建一个统一且高度鲁棒的 车辆周围环境模型。每种传感器都有其自身的优缺点,理论上,融合它们可以弥补每种传感器的不 足。 摄像头能够提供最丰富、最高分辨率的数据,像人眼一样感知世界的色彩和纹理。它们可以读取路 标上的文字,识别交通信号灯的颜色,并理解复杂的视觉环境。它们的主要缺点是容易受到恶劣天 气和弱光环境的影响,而且难以测量相对速度。 雷达在测量物体距离和速度方面表现出色,即使在恶劣天气下也能正常工作。它可以轻松穿透雨、 ...
CVPR 2026 Workshop征稿|从感知到推理,ViSCALE 2.0 邀你重塑计算机视觉的 System 2
机器之心· 2026-02-13 04:19
Core Insights - The article discusses the evolution of computer vision towards a new paradigm, emphasizing the transition from basic pixel perception to complex spatial reasoning and world modeling, facilitated by Test-time Scaling (TTS) [2][5] - The upcoming ViSCALE 2026 conference aims to gather leading scholars to explore breakthroughs in visual models through computational expansion, focusing on deep reasoning rather than mere static outputs [4][5] Group 1: Conference Highlights - ViSCALE 2026 will feature discussions on spatial intelligence and world models, with contributions from top scholars including Sergey Levine, Manling Li, and Ziwei Liu [5] - The conference encourages innovative research submissions that challenge existing visual model limitations, providing a platform for both theoretical and application-focused studies [7] Group 2: Key Topics of Discussion - The conference will cover various topics, including: - Enhancing video generation's physical consistency and long-term causal reasoning through TTS [6] - Breaking 2D limitations to enable models to navigate and operate in 3D spaces like humans [6] - Developing visual reasoning chains that allow models to self-correct and engage in multi-step reasoning [6] - Exploring scaling laws that relate computational load during testing to visual reasoning performance [6] Group 3: Submission Details - The conference invites submissions in two tracks: Full Papers (8 pages) and Extended Abstracts (up to 4 pages), with specific formatting requirements [9] - Important deadlines include submission by March 10, 2026, and notification of acceptance by March 18, 2026 [9]
山东将在高端装备等领域开展语料库揭榜挂帅
Da Zhong Ri Bao· 2026-02-06 01:06
项目验收时语料库数据量不低于10万条 山东将在高端装备等领域开展语料库揭榜挂帅 记者从省工信厅了解到,围绕高端装备、烟草制品业、农副食品加工业、家具制造业、木材加工、 皮革毛皮羽毛及其制品和制鞋业、仪器仪表制造业、废弃资源综合利用业等行业,山东将开展语料库揭 榜挂帅项目申报,重点推进行业关键数据技术攻关、行业数据语料标准研制、高质量行业语料库打造、 语料应用场景落地等。 重点行业语料库揭榜挂帅项目,聚焦工业制造重点行业的基础理论研究、产品研发设计、生产管理 运行、过程质量检测等关键环节和特定场景的知识语料汇聚,基于结构化数据、非结构化数据和半结构 化数据,通过清洗、去噪和统一格式,用于支持自然语言处理、计算机视觉、机器学习、深度学习等任 务,满足行业大模型或场景大模型开发、训练和微调需求的高质量语料库。项目验收时行业相关语料库 数据量不低于10万条,具有较高的数据质量、领域覆盖程度、潜在价值和应用成效,项目验收时应通过 第三方测评;同时,山东鼓励各行业语料库项目加快语料资源优化整合,积极开放公共语料。(记者 付玉婷) ...
伞:我会飞了!人:我湿透了!这项硬核发明主打一个陪伴
机器人大讲堂· 2026-01-31 04:07
Group 1 - The article discusses the development of a flying umbrella by Canadian engineer John Tse, which autonomously hovers above the user to provide rain protection [1][5] - The umbrella utilizes drone technology, featuring four propellers for lift and a depth camera for tracking the user's head position, allowing it to maintain a stable hover [7][9] - Despite its innovative design, the umbrella's practicality is questioned, as it must hover several meters above the user, resulting in limited rain coverage [5][20] Group 2 - The second-generation flying umbrella improves upon the first by eliminating the need for manual remote control, which was criticized for being cumbersome [7] - The depth camera used in the umbrella can function effectively in low-light conditions, enhancing its tracking capabilities compared to standard cameras [9][12] - The umbrella's design includes a foldable mechanical arm structure, allowing it to be compact and portable when not in use [11][14] Group 3 - The development process involved numerous challenges, including hardware failures and software issues, extending the project timeline to nearly a year [18] - The umbrella's current battery life is limited to 10-15 minutes, similar to small consumer drones, raising concerns about its usability in crowded areas [20] - The project highlights the potential of personal creators to leverage existing drone technology and open-source hardware to create innovative solutions, even if they are not immediately practical [20]
京东方取得基于计算机视觉的群体识别技术专利
Sou Hu Cai Jing· 2026-01-24 03:34
Group 1 - BOE Technology Group Co., Ltd. has obtained a patent for a "computer vision-based group identification method and device," with authorization announcement number CN116597382B, applied on May 2023 [1] - BOE Technology Group, established in 1993 and located in Beijing, primarily engages in the manufacturing of computers, communications, and other electronic devices, with a registered capital of 37,413.880464 million RMB [1] - The company has invested in 73 enterprises, participated in 303 bidding projects, and holds 775 trademark records and 5,000 patent records, along with 47 administrative licenses [1] Group 2 - Beijing BOE Technology Development Co., Ltd., established in 2016 and also located in Beijing, focuses on the manufacturing of computers, communications, and other electronic devices, with a registered capital of 38 million RMB [1] - This subsidiary has invested in 1 enterprise, participated in 92 bidding projects, and holds 3,871 patent records, along with 4 administrative licenses [1]
AI视觉提供商极视角递表港交所 经营活动现金流为负
Mei Ri Jing Ji Xin Wen· 2026-01-22 14:47
据招股书,极视角专注于为企业提供AI计算机视觉解决方案及大模型解决方案,具体分为AI计算机视 觉解决方案(包括标准AI计算机视觉解决方案、定制AI计算机视觉解决方案和软件定义的一体化AI解决 方案)和大模型解决方案。 按业务划分,2022年、2023年、2024年和2025年前三季度(以下简称报告期内),极视角的收入主要来自 AI计算机视觉解决方案,占比分别为100%、100%、75.9%和81.8%。同期,公司大模型解决方案收入占 比从0%升至19.2%。 《每日经济新闻》记者注意到,极视角向工业、能源、零售及交通等垂直业务领域的客户提供解决方 案,按客户类型划分,公司报告期内的客户主要为民营企业客户,来自这些客户的收入占比分别为 94.7%、36.7%、58%和69.6%,呈现波动较大的态势。 极视角的收入来源地域分布呈现出"过山车"式剧烈波动,暗示其业务可能过度依赖特定地区的个别大项 目,而非建立了广泛稳定的全国销售网络。例如,华东地区在2023年还是公司的核心区域,收入占比达 65%,但到了2025年前三季度,该地区的收入占比就骤降至32.1%。同期,华南地区的收入从18.6%升至 56.0%。 应收 ...
第二届CVPR 2026 CV4CHL Workshop征稿启动,用AI大模型守护儿童未来
机器之心· 2026-01-22 03:13
Core Insights - The article discusses the rapid development of multimodal large language models and embodied AI, highlighting that AI and computer vision technologies focused on children's development, health, and education are still in their infancy [2] - The CV4CHL workshop aims to bridge interdisciplinary perspectives on pediatric AI and computer vision solutions, addressing critical gaps in the field [2] Event Details - The CV4CHL workshop is organized by PediaMed AI in collaboration with several prestigious institutions, including the University of Illinois Urbana-Champaign, Hong Kong University of Science and Technology (Guangzhou), ETH Zurich, and Shenzhen Children's Hospital [2] - The workshop will take place during CVPR 2026, scheduled for June 3-7, 2026, in Denver, Colorado, USA [7][6] Key Topics - The workshop will cover various themes, including: - Basic models inspired by human children's learning and cognitive abilities, and cutting-edge research on multimodal large language models [6] - Brain-computer interface technologies for children [6] - Frontiers in human-computer interaction with augmented reality glasses and smart glasses for children [6] - Applications of embodied AI in pediatrics [6] - Computer vision and foundational models related to children's cognitive development, such as gaze and gesture analysis [6] - Pediatric smart healthcare, including early disease screening and medical imaging and video analysis [6] - AI-enabled education, including smart educational tools and assistive technologies for children with special needs [6] - AI support for children's and adolescents' mental health [6] - Ethical and social implications of children's AI technologies, including privacy protection and human-robot interaction [6] Submission Information - The submission deadline for the workshop is March 31, 2026, with notification of review results by April 8, 2026 [6] - The workshop will feature both proceeding and non-proceeding submission tracks, with specific page limits for each [8]
新股消息 | 极视角港股IPO及境内未上市股份“全流通”获中国证监会备案
智通财经网· 2026-01-21 11:09
Group 1 - The China Securities Regulatory Commission has issued a notice regarding Shandong Jishi Jiao Technology Co., Ltd.'s plan to issue up to 20.0634 million overseas listed ordinary shares and list them on the Hong Kong Stock Exchange [1] - The company aims to convert a total of 99,872,436 shares held by 31 shareholders from unlisted domestic shares to overseas listed shares for circulation on the Hong Kong Stock Exchange [1] Group 2 - Jishi Jiao is a provider of AI computer vision solutions in China, offering end-to-end solution development, deployment, and management services across various industries [3] - According to Frost & Sullivan, the company ranks eighth in the emerging enterprise-level computer vision solutions market in China based on projected revenue for 2024 [3] Group 3 - A detailed list of shareholders applying for the conversion of shares includes notable entities such as Chen Zhenjie with 16,114,821 shares and Qualcomm (China) Holdings Limited with 4,990,208 shares [4][5] - The total number of shares being converted by all shareholders amounts to 99,872,436 shares [5]
极视角递表港交所 中信证券担任独家保荐人
Group 1 - The core viewpoint of the article is that Jishi Jiao has submitted an application for listing on the Hong Kong Stock Exchange, with CITIC Securities acting as the sole sponsor [1] - According to Frost & Sullivan's report, Jishi Jiao ranks eighth in the emerging enterprise-level computer vision solutions market in China, with a market share of 1.6% based on projected revenue for 2024 [1] - The company is the third largest software-centric provider in the same market segment, focusing on AI computer vision solutions, including standard, customized, and software-defined integrated AI solutions, as well as large model solutions [1] Group 2 - Jishi Jiao serves clients across various verticals, including industrial, energy, retail, and transportation, primarily in China, catering to enterprises, government, and universities [1] - The market size for enterprise-level computer vision solutions in China is expected to grow from RMB 36.8 billion in 2024 to RMB 182.4 billion by 2029, with a compound annual growth rate (CAGR) of 37.7% [1] - The emerging enterprise-level computer vision solutions market in China is projected to increase from RMB 11.1 billion in 2024 to RMB 97 billion by 2029, with a CAGR of 54.3%, and its penetration rate in the overall market is expected to rise to 53.2% [1]
【全球招募】用AI唤醒千年文明!探元计划NextGen数智活化赛道:五大文化场景等您“揭榜挂帅”
腾讯研究院· 2026-01-20 09:53
Core Viewpoint - The article emphasizes the integration of advanced technologies like AI to revitalize cultural heritage and enhance public engagement with historical narratives and experiences [2][56]. Group 1: Cultural Revitalization through Technology - The initiative aims to create immersive experiences that allow users to interact with cultural heritage, such as AI-generated historical narratives and personalized experiences [2][5]. - The "NextGen" plan by Tencent focuses on leveraging cutting-edge technologies to address the challenges of cultural heritage revitalization, aiming to create new forms of expression and engagement [5][56]. Group 2: Specific Topics and Challenges - The program identifies three main topics for innovation: 1. Development of multi-modal intelligent agents for cultural content generation [5]. 2. Creation of immersive interactive experiences that combine sensory data and emotional computing [6]. 3. Human-machine collaboration for the transmission and development of traditional crafts through digital means [7]. Group 3: Specific Cultural Scenarios - Five specific cultural scenarios have been outlined for technological application: 1. "Cloud Residence Intelligent Companion" for enhancing public understanding of historical texts [8][9]. 2. "Hangzhou West Lake Experience" focusing on personalized immersive tourism experiences [15][16]. 3. "Dawenkou Culture Interactive Experience" to facilitate understanding of ancient pottery techniques [19]. 4. "Bridge Wisdom Transmission" for teaching traditional wooden bridge construction techniques [29]. 5. "Cantonese Lion Dance Digital Activation" to enhance interaction and experience in traditional performances [36]. Group 4: Collaboration and Support - The initiative invites global technology teams to collaborate with cultural institutions to propose innovative solutions, with funding and resources available for selected projects [43][52]. - The project will undergo a structured process from proposal submission to implementation, ensuring thorough evaluation and support [48][50].