计算机视觉
Search documents
AI商品替换技术:智能图像处理在电商领域的创新实践
Sou Hu Cai Jing· 2026-01-16 15:31
Core Insights - The emergence of AI-based product substitution tools is driven by the explosive growth of visual content demand in the e-commerce industry, transitioning from traditional manual editing to automated solutions [1][2] - These tools utilize advanced technologies such as semantic segmentation, Generative Adversarial Networks (GAN), and image synthesis to achieve seamless element replacement while maintaining image quality [1][2] - The ongoing iteration of AI technology is expected to further enhance the accuracy and naturalness of product substitution, providing more efficient visual solutions for the e-commerce sector [2] Group 1: Technology and Functionality - AI product substitution tools can automatically identify and replace specific elements in product images, significantly improving processing efficiency [1][4] - Keevx's intelligent product substitution system employs a multi-stage neural network architecture for pixel-level precision while preserving original image quality, optimized for cross-border e-commerce [1] - Mokker AI focuses on background replacement with a vast scene library and intelligent light-matching algorithms, catering to small and medium-sized businesses [2] Group 2: Market Impact and Efficiency - The implementation of intelligent product substitution systems can enhance material production efficiency by 45% [4] - The use of knowledge graph technology in SeeAny AI's engine allows for automatic recommendations of suitable replacement elements, reducing user operational barriers [2] - The transition to AI-driven visual content production addresses the high costs and long cycles associated with traditional product image creation [2]
东风汽车申请小目标检测方法专利,提高小目标在边缘设备上的检测精度和速度
Jin Rong Jie· 2026-01-12 00:26
国家知识产权局信息显示,东风汽车集团股份有限公司申请一项名为"小目标检测方法、系统、电子设 备及计算机可读介质"的专利,公开号CN121305026A,申请日期为2025年9月。 专利摘要显示,本发明提供了一种小目标检测方法、系统、电子设备及计算机可读介质,属于计算机视 觉技术领域,该方法包括获取实时图片;将实时图片输入小目标检测模型的骨干网络进行特征提取,得 到多尺度特征图;骨干网络包括重复可分组视觉变换器网络;将多尺度特征图输入小目标检测模型的特 征融合网络,得到融合后的特征图;将融合后的特征图输入小目标检测模型的检测头网络,得到目标检 测结果。本发明采用重复可分组视觉变换器网络对实时图片提取不同尺度大小的特征图,在训练阶段增 加网络深度和宽度增强模型的特征提取能力,在推理阶段通过网络结构重参数化简化模型结构,加快推 理速度,实现轻量化小目标检测模型,提高小目标在边缘设备上的检测精度和速度。 声明:市场有风险,投资需谨慎。本文为AI基于第三方数据生成,仅供参考,不构成个人投资建议。 财经频道更多独家策划、专家专栏,免费查阅>> 天眼查资料显示,东风汽车集团股份有限公司,成立于2001年,位于武汉市,是一家 ...
这项技术,颠覆芯片堆叠
半导体行业观察· 2026-01-09 01:53
Core Insights - MIT researchers have developed a new solution to address energy consumption issues in data transfer between logic circuits and memory, proposing a stacked structure that integrates logic and memory transistors in the backend of traditional CMOS chips [1][2][8] Group 1: Research Findings - The new architecture involves adding active device layers in the backend of the chip, allowing for a compact vertical stack that reduces energy and time consumption during data transfer [1][2] - The key device in this stack is a BEOL transistor with an amorphous indium oxide channel layer, which can be "grown" at approximately 150°C, preventing damage to underlying circuits [2][10] - The integration of ferroelectric hafnium zirconium oxide (HZO) layers has resulted in BEOL transistors with a switching speed of 10 nanoseconds and a size of about 20 nanometers, achieving low operating voltage compared to similar devices [4][11] Group 2: Manufacturing Process - The manufacturing process focuses on controlling defects in the indium oxide layer, which is only about 2 nanometers thick, optimizing it to ensure fast and clean switching of transistors [4][11] - The new method allows for the stacking of active components without the high temperatures typically required in front-end processes, thus preserving existing components [2][10] Group 3: Applications and Future Directions - This technology is expected to significantly benefit workloads dominated by memory traffic, such as AI inference and deep learning, by reducing energy consumption in data-centric computing [6][9] - Future plans include integrating backend storage transistors into single circuits and further optimizing the control of ferroelectric layer properties [12]
理想多次合作的年轻学者之浙大彭思达
理想TOP2· 2026-01-08 15:59
Core Insights - The article discusses the academic contributions and achievements of Sida Peng, highlighting his research in computer vision and graphics, particularly in depth estimation and urban scene modeling [1][2][3]. Research Contributions - Sida Peng has authored multiple influential papers, including "InfiniDepth," "StreetCrafter," and "Street Gaussians," focusing on advanced techniques in depth estimation and urban scene synthesis [1][2]. - His work has led to significant advancements in video generation and dynamic urban modeling, utilizing precise laser radar data for improved accuracy [2]. Citation Metrics - Since 2021, Sida Peng's citation count has increased significantly, reaching 7,470 citations across 36 papers, with 58 papers cited at least 10 times [3][4]. - His h-index stands at 36, indicating a strong impact in his field, with notable papers such as "PVNet" and "Implicit Neural Representations" receiving high citation counts [4]. GitHub and Open Source Contributions - Sida Peng has a substantial following on GitHub, with 2.3K followers and numerous high-citation articles that have open-sourced their code, contributing to the academic community [5][6]. - His repositories include projects related to neural representations and real-time instance segmentation, showcasing his commitment to sharing knowledge and resources [5]. Academic Guidance and Mentorship - The article emphasizes the importance of mentorship in academic growth, detailing how Sida Peng's experiences under Professor Zhou Xiaowei have shaped his research capabilities and achievements [11]. - It highlights the supportive environment provided by mentors, which is crucial for developing research skills and achieving significant academic milestones [11].
地缘技术|“武士精神”正在重回硅谷
Xin Lang Cai Jing· 2026-01-06 23:41
Core Insights - Palantir Technologies, founded by Peter Thiel, has established itself as a significant player in the U.S. military-industrial complex, surpassing traditional defense contractors like Raytheon [3] - The company signed a $10 billion long-term contract with the U.S. Army in August 2025 to provide data support and software services over the next decade, marking the largest digital military contract awarded to a tech company [3] - The relationship between Silicon Valley and U.S. military security strategy has deep historical roots, with technological advancements often driven by military needs [5][6] Group 1: Company Overview - Palantir is a software and big data technology company based in Silicon Valley, co-founded by Peter Thiel, who is also known for his book "Zero to One" [1] - The CEO, Alexander Karp, is a notable figure with a Ph.D. from Goethe University and has been actively involved in public debates on technology and ethics [3] Group 2: Military Contracts and Collaborations - The $10 billion contract with the U.S. Army is a significant milestone for Palantir, emphasizing its role in modern military operations [3] - The U.S. Department of Defense has increasingly relied on Silicon Valley for technological advancements, with companies like Palantir benefiting from military contracts [6][7] Group 3: Technological Innovation and Defense Strategy - The emergence of new technologies such as artificial intelligence and quantum computing is central to U.S. military strategy, particularly in response to China's advancements [8][9] - The U.S. military is focusing on creating a "data-driven, AI-enabled army" to counter challenges posed by China, with initiatives like the "Replicator" project aimed at deploying low-cost autonomous systems [8][9] Group 4: Impact of Global Conflicts - The ongoing Russia-Ukraine conflict has accelerated the adoption of emerging technologies in military operations, benefiting companies like Palantir and Anduril [10] - The U.S. Department of Defense's Project Maven aims to leverage AI for military applications, with Palantir playing a crucial role in developing the necessary data integration platforms [12] Group 5: Geopolitical Context - The U.S. government's strategic focus on countering China's military capabilities has led to increased investments in defense technology, with Palantir positioned to capitalize on this trend [14] - The narrative of a "China threat" is being utilized by Silicon Valley companies to secure funding and contracts, with Palantir's software being used to monitor Chinese activities [14]
北大90后副教授董豪出任上纬启元首席科学家, 研究方向聚焦具身智能
Xin Lang Cai Jing· 2026-01-06 13:28
1月6日,据第一财经,北京大学90后长聘副教授董豪加入上纬新材的个人机器人业务品牌上纬启元,并 出任上纬启元的首席科学家。上纬启元表示,董豪将聚焦具身智能模型领域的技术研发与战略布局。 公开资料显示,董豪现任北京大学计算机学院前沿计算研究中心的长聘副教授,是科技创新2030国家重 大项目负责人(首席科学家),入选国家级高层次青年人才计划。研究方向聚焦于具身智能、大模型、 强化学习、计算机视觉以及相应的开源系统。核心目标是探索并构建兼具成本效益与通用性的机器人系 统。长期以来,始终深耕开源AI系统领域,牵头负责了多个开源项目。 出任上纬启元的首席科学家相关信息,已在董豪个人学术主页中更新确认。 业内分析认为,董豪教授在具身智能领域的深厚积淀,与上纬启元的技术研发需求高度契合,其加入将 进一步强化企业在核心算法、通用机器人系统等关键领域的竞争力。此次引入首席科学家,也表明上纬 新材正在强化其在消费级人形机器人的顶层科研布局。 今年7月8日,上纬新材公告,智元机器人拟通过其与核心团队共同出资设立的持股平台,以"协议转让 +要约收购"组合方式获取公司控制权。11月25日晚,上纬新材公告,公司第四届董事会第一次会议,全 ...
商汤Kapi相机跃居中国区App Store「摄影与录像」榜首,此前在海外多国热度领先
Xin Lang Cai Jing· 2025-12-31 16:01
Core Insights - SenseTime's Kapi camera has topped the Apple App Store's "Photography & Video" category in China as of December 31, showcasing its strong market performance and innovative features [2][7] - The Kapi camera, launched on December 20, is positioned as the first true "AI photography assistant," breaking traditional app limitations and offering advanced features like scene recognition and filter recommendations [2][3][8] - The success of Kapi camera indicates a significant shift in consumer-level AI applications, moving from experimental technology to mainstream usage, thus enhancing confidence in AI commercialization [5][10] Product Performance - Kapi camera achieved the number one spot in the Philippines App Store free app category on December 8 and has maintained a strong presence in the "Photos & Videos" category across multiple countries, including the UK, Italy, France, and Germany [3][8] - The app's technology includes a professional image processing workflow that replicates the Apple Log curve, allowing users to capture images with cinematic quality and rich detail without post-processing [3][8] Technological Advancements - SenseTime's SenseNova V6.5 Pro ranked first in the domestic evaluation by SuperCLUE, scoring 75.35 and achieving the highest score in visual reasoning among domestic models [4][9] - The report highlights that SenseTime's models are approaching the average level of leading models in basic cognitive dimensions, with SenseNova V6.5 exceeding the average in visual reasoning, indicating a competitive edge in the industry [9][10]
中国关税新增两类机器人税目
第一财经· 2025-12-30 07:55
Core Viewpoint - The article discusses the recent changes in China's tariff schedule for 2026, particularly the introduction of new tariff categories for robots, including intelligent bionic robots and cleaning robots, aimed at supporting technological development and the circular economy [3][5]. Group 1: Tariff Adjustments - The 2026 tariff schedule includes new categories for intelligent bionic robots with a most-favored-nation (MFN) rate of 0% and a general rate of 30% [3][5]. - Two new categories for cleaning robots have been added, with MFN rates of 8% and 0%, and general rates of 130% and 30% respectively [3][5]. Group 2: Definition and Features of Intelligent Bionic Robots - Intelligent bionic robots are defined as autonomous or semi-autonomous robots that closely mimic the appearance, structure, or functions of humans or animals [4]. - These robots are equipped with various sensors and technologies, enabling them to perform tasks such as environmental perception, autonomous path planning, and human interaction through natural language processing [4]. Group 3: Industry Impact and Future Outlook - The addition of these tariff categories is expected to help industries and companies better understand trade data and assess overseas market trends [5]. - The tariff adjustments also include a temporary import tax rate below the MFN rate for 935 items, aimed at promoting high-level technological self-reliance and modern industrial system construction [5].
中国关税新增两类机器人税目
Di Yi Cai Jing· 2025-12-30 06:40
Core Insights - The Chinese government has introduced new tariff categories for intelligent bionic robots and cleaning robots as part of the 2026 tariff adjustment plan, with the most favored nation (MFN) rates set at 0% for intelligent bionic robots and varying rates for cleaning robots [1][3] Group 1: New Tariff Categories - The 2026 tariff adjustment plan includes the addition of intelligent bionic robot tariff categories with an MFN rate of 0% and a general rate of 30%, along with two cleaning robot categories with MFN rates of 8% and 0%, and general rates of 130% and 30% respectively [1][3] - Intelligent bionic robots are defined as autonomous or semi-autonomous robots that closely mimic the appearance, structure, or functions of humans or animals, equipped with advanced technologies for environmental perception and interaction [2] Group 2: Rationale for Changes - The adjustments aim to support technological advancement, circular economy, and the development of the forest economy, with the total number of tariff categories now reaching 8,972 [3] - The introduction of these new tariff categories is expected to help industries and companies accurately grasp trade data and assess overseas market trends [3]
人工智能专家凌海滨全职加入西湖大学,创立智能计算与应用实验室
生物世界· 2025-12-30 00:18
Core Viewpoint - Westlake University has announced the full-time appointment of Haibin Ling, an Empire Innovation Professor from Stony Brook University, to lead the establishment of the Intelligent Computing and Applications Laboratory, focusing on artificial intelligence and interdisciplinary research [2]. Group 1: Appointment and Research Focus - Haibin Ling will serve as a chair professor at Westlake University, leading research in areas such as computer vision, multimodal AI, augmented reality, AI for Science, and quantum information [2]. - The newly established laboratory aims to advance research and applications in artificial intelligence [2]. Group 2: Academic Background and Career - Haibin Ling, born in 1974 in Anshun, Guizhou, holds a Bachelor's and Master's degree from Peking University and a Ph.D. from the University of Maryland [4][5]. - His career includes positions at Microsoft Research Asia, UCLA, Siemens Research, and Temple University, before joining Stony Brook University in 2019 [5]. - Ling's research has significantly impacted the field of computer vision, particularly in dynamic object tracking, which is crucial for various applications including security monitoring and medical imaging [5]. Group 3: Achievements and Contributions - Ling has received multiple awards, including the ACM UIST Best Student Paper Award (2003), the NSF CAREER Award (2014), and the IEEE VR Best Journal Paper Award (2021) [6]. - He has served on editorial boards for several prestigious journals and has been involved in leading roles at top AI conferences [6].