3D重建

Search documents
刚刚,CVPR 2025奖项出炉:牛津&Meta博士生王建元获最佳论文,谢赛宁摘年轻研究者奖
机器之心· 2025-06-13 15:45
机器之心报道 机器之心编辑部 刚刚,在美国田纳西州纳什维尔举办的 CVPR 2025 公布了最佳论文等奖项。 今年共有 14 篇论文入围最佳论文评选,最终 5 篇论文摘得奖项 ,包括 1 篇最佳论文 、 4 篇最佳论文荣誉提名 。此外,大会还颁发了 1 篇最佳学生论文 、 1 篇最 佳学生论文荣誉提名 。 根据会方统计,今年大会共收到 4 万多名作者提交的 13008 份论文。相比去年(11532),今年的投稿数量增长了 13%,最终有 2872 篇论文被接收,整体接收率 约为 22.1%。在接收论文中,Oral 的数量是 96(3.3%),Highlights 的数量是 387(13.7%)。 计算机视觉技术的火热给大会审稿带来了空前的压力。本届投稿作者数量、论文评审者和领域主席(AC)数量均创下新高。 今年前来现场参会的学者也超过 9000 人,他们来自 70 余个国家和地区。 CVPR 官方公布了各个细分领域的论文接收情况,如下图所示。可以看到,图像与视频生成领域今年度的论文接收数量最多,而接收率最高的领域则是基于多视角 和传感器的 3D 以及基于单图像的 3D。 此次,最佳论文奖委员会成员中有 AI ...
美图公司AI视觉领域竞争力升级:七项图像编辑成果出炉
Zheng Quan Ri Bao· 2025-04-09 08:40
Core Insights - Meitu's MT Lab has achieved significant recognition with five research outcomes selected for the prestigious CVPR 2025 conference, which received over 13,000 submissions and has a low acceptance rate of 22.1% [2] - The lab also had two projects accepted at the AAAI 2025 conference, which had an acceptance rate of 23.4% from 12,957 submissions [2] - The seven research outcomes focus on image editing, including three generative AI technologies, three segmentation technologies, and one 3D reconstruction technology [2] Generative AI Technologies - GlyphMastero has been implemented in Meitu's app Meitu Xiuxiu, providing users with a seamless text modification experience [3] - MTADiffusion is integrated into Meitu's AI material generator WHEE, allowing for efficient image editing with simple commands [3] - StyO is utilized in Meitu Xiuxiu's AI creative and beauty camera features, enabling users to explore different dimensions easily [4] Segmentation and 3D Reconstruction Technologies - The segmentation breakthroughs include interactive segmentation and cutout technologies, which are applied in e-commerce design, image editing, and portrait beautification [4] - EVPGS represents advancements in 3D reconstruction, with increasing demand in new perspective generation, augmented reality (AR), 3D content generation, and virtual digital humans [4] Industry Position and Future Potential - Meitu's long-term investment in AI capabilities has allowed the company to integrate cutting-edge technologies into practical applications, enhancing its competitive edge in the core visual field [4] - The continuous iteration of product capabilities has led to increased user engagement and willingness to pay, indicating promising growth potential and expansion opportunities for the company [4]
深度|具身合成数据的路线之争,谁将率先走出困境?
Z Potentials· 2025-04-08 12:30
" 没有数据,就创造数据。 "NVIDIA Cosmos World Foundation Models, CES 2025 NVIDIA Cosmos World Foundation Models, CES 2025 摘要 本文主要描述了具身合成数据两条主要技术路线之争: " 视频合成 +3D 重建 " or " 端到端 3D 生成 " 。参考自动驾驶的成功经验,前者模态转换链路过长 导致误差累积, ' 直接合成 3D 数据 ' 理论上有信息效率优势,但需要克服 " 常识欠缺 " 等挑战。 眼下,机器人流行视频中高难度动作(空翻、跳舞、格斗等)主要依靠 遥控 / 预设编程完成的。 机器人 逐渐完善了 自身运动控制能力 ,然而对外环境感 知、推理能力有待完善。 数据是 AI 时代的石油。具身智能的突破高度依赖于数据驱动的训练。由于现实数据采集成本高,合成数据被推上了前台。它不只是 " 虚拟的替代品 " ,更 可能是具身智能迈向通用能力的关键推动力。英伟达在 CES 2025 指出 " 尚无互联网规模的机器人数据 " ,自动驾驶已具备城市级仿真,但家庭等复杂室内 环境缺乏 3D 合成平台。 为解决 " 常识欠 ...