计算机

Search documents
实验室10篇论文被ICCV 2025录用
自动驾驶之心· 2025-07-02 13:54
Core Insights - The article discusses the acceptance of 10 papers from a laboratory at the 20th ICCV International Conference on Computer Vision, highlighting advancements in 3D vision and related technologies [25]. Paper Summaries Paper 1: Domain-aware Category-level Geometry Learning Segmentation for 3D Point Clouds - This paper addresses domain generalization in 3D scene segmentation, proposing a framework that couples geometric embedding with semantic learning to enhance model generalization [1]. Paper 2: Hierarchical Variational Test-Time Prompt Generation for Zero-Shot Generalization - The authors introduce a hierarchical variational method for dynamic prompt generation during inference, significantly improving the zero-shot generalization capabilities of visual language models [3]. Paper 3: Knowledge-Guided Part Segmentation - A new framework is proposed that utilizes structural knowledge to enhance the segmentation of fine-grained object parts, improving understanding of complex structures [5][6]. Paper 4: TopicGeo: An Efficient Unified Framework for Geolocation - TopicGeo presents a unified framework for geolocation that improves computational efficiency and accuracy by directly matching query images with reference images [9]. Paper 5: Vision-Language Interactive Relation Mining for Open-Vocabulary Scene Graph Generation - This paper explores a model that enhances the understanding of relationships in open-vocabulary scene graph generation through multimodal interaction learning [11]. Paper 6: VGMamba: Attribute-to-Location Clue Reasoning for Quantity-Agnostic 3D Visual Grounding - The authors propose a mechanism that combines attribute and spatial information to improve the accuracy of 3D visual grounding tasks [13]. Paper 7: Meta-Learning Dynamic Center Distance: Hard Sample Mining for Learning with Noisy Labels - A new metric called Dynamic Center Distance is introduced to enhance the learning process in the presence of noisy labels by focusing on hard samples [15]. Paper 8: Learning Separable Fine-Grained Representation via Dendrogram Construction from Coarse Labels for Fine-grained Visual Recognition - The paper presents a method for learning fine-grained representations from coarse labels without predefined category numbers, enhancing adaptability to dynamic semantic structures [17]. Paper 9: Category-Specific Selective Feature Enhancement for Long-Tailed Multi-Label Image Classification - This research addresses the issue of label imbalance in multi-label image classification by enhancing feature sensitivity for underrepresented categories [19]. Paper 10: Partially Matching Submap Helps: Uncertainty Modeling and Propagation for Text to Point Cloud Localization - The authors redefine the task of text to point cloud localization by allowing partial spatial matches, improving the model's ability to handle real-world ambiguities [21].
极智嘉 全栈技术筑壁垒掘金仓储自动化黄金赛道
Sou Hu Cai Jing· 2025-07-02 09:30
Company Overview - Beijing Geek+ Technology Co., Ltd. (referred to as "Geek+") is launching its IPO from today until July 4, 2025, with plans to list on the Hong Kong Stock Exchange on July 9, 2025 [2] - The company plans to issue 140,353,000 H-shares, raising approximately HKD 2.358 billion at an issue price of HKD 16.80 per share [2] - Geek+ has attracted four cornerstone investors, collectively subscribing USD 91.3 million (approximately HKD 716.7 million) [2] Technology and Innovation - Geek+ has developed a comprehensive technology stack covering hardware, software, and algorithms, creating a significant technological moat [3] - The company introduced laser-vision fusion SLAM technology, achieving an average positioning accuracy of less than ±10mm, leading the industry [4] - The Hyper+ core algorithm platform is one of the most advanced in the AMR market, optimizing resource allocation and maximizing cost efficiency [5] - Geek+ has created the world's first universal robot technology platform, Robot Matrix, enhancing R&D efficiency by over 30% [6][7] - The company has filed over 2,000 patents by 2024, with its PopPick solution leading globally in compatibility and throughput efficiency [8] Market Landscape - The global AMR market is projected to grow from CNY 38.7 billion in 2024 to CNY 162.1 billion by 2029, with a CAGR of 33.1% [10] - The penetration rate of AMR in warehouse automation is expected to rise from 4.4% in 2020 to 20.2% in 2029 [10] - Key growth drivers include the booming e-commerce sector, increasing demand for logistics automation, and the need for manufacturing efficiency [13] - AMR robots have diverse applications across various industries, including logistics, manufacturing, healthcare, and food service [14] Competitive Advantages - Geek+ has established a global service network and collaborates with partners like Bosch Rexroth and Mujin, creating a complete ecosystem from hardware to systems [18] - The company has received strategic investments from firms like Warburg Pincus, Ant Group, and Intel, with net proceeds of approximately HKD 2.206 billion allocated for R&D and market expansion [19] - Geek+ maintains a leading market share in the AMR sector, with a revenue increase from CNY 790 million in 2021 to CNY 2.41 billion in 2024, reflecting a CAGR of 45% [23] - The company has a customer repurchase rate of 74.6%, indicating strong client retention and satisfaction [24] Industry Outlook - The intelligent logistics automation industry is experiencing rapid growth, with favorable policies supporting technological innovation and application promotion [15] - Advances in AI, machine learning, computer vision, and IoT are enhancing AMR robot performance and functionality [16] - The global labor shortage and the decline of China's demographic dividend are driving the shift towards automation, with Geek+ solutions reducing labor needs by 65% [17]
在中国,Model Y的好日子到头了
3 6 Ke· 2025-07-02 02:12
上周,小米用一场暴风骤雨,震撼了整个中国汽车行业。25.35万元起售的YU7,3分钟大定20万辆,1小时大定28.9万辆,18小时锁单24万辆…… 夸张,实在是太夸张。 殊不知,刚刚过去周末,前往了位于上海长宁区荟聚商场的小米门店,前来了解YU7的人流量只能用恐怖形容。就这么说,你见过哪个汽车品牌,有排队 叫号等待试驾的场景吗? 至于背后的原因,肯定是多维度的。 而我,也与几位消费者进行了交流,发现他们的身份五花八门,有大家口中所谓的"年轻米粉",有准备入手一辆纯电SUV被其性价比吸引的潜客,更有看 起来至少50多岁从未接触过新能源产品的传统燃油车车主。 总之,YU7肯定是破圈了。 不吹不黑,目前阻止它进一步位于大盘收割的障碍,看似只剩产能受限。打开小米汽车官方APP能够发现,即刻下定终端选择比例最多的"标准版",交付 周期已经达到56-59周,并且还有继续延长的趋势。换言之,需要足足等待一年多。 如此之长的时间,肯定会劝退部分用户。 但无论怎样,随着制造端的不断提速,铁定爆款的YU7都正在以一己之力,改变着中型纯电SUV市场长久以来略显固化的格局。身处其中的所有选手,必 然都会受到一定程度的影响。 尤其是今 ...
中国首个脑机接口产业集聚区启动 智能人机交互加速落地应用
Shang Hai Zheng Quan Bao· 2025-07-01 23:46
7月1日,脑机接口概念延续强势,创新医疗2连板,塞力医疗涨停,翔宇医疗涨超5%,爱朋医疗、三博 脑科、伟思医疗等跟涨。 消息面上,中国首个脑机接口未来产业集聚区"脑智天地"在上海启动建设。另外,马斯克在脑机接口 Neuralink团队的发布会上表示,目前全球已经有七人植入了设备,通过"心灵感应"产品,他们重获跟物 理世界交互的能力,可以用大脑玩马里奥赛车、使命召唤,甚至可以控制机械臂写字。预计2026年让盲 人重获光明,2028年计划实现更广泛的人机融合应用。 国家药监局于6月20日审议通过《关于优化全生命周期监管支持高端医疗器械创新发展的举措》。其中 提到,加快发布医用外骨骼机器人、放射性核素成像设备等相关标准;积极筹建医用机器人、人工智能 医疗器械标准化技术委员会;加强增材制造用医用材料、脑机接口柔性电极、基因工程合成生物材料等 新型生物材料标准化研究。 东吴证券表示,侵入式技术的持续突破将推动消费和医疗康复市场提升对脑机接口技术的认知,非侵入 式脑机接口产品的商业化落地也有望加速。此外,前沿探索如生物计算机等领域的发展,也为脑机接口 技术的未来发展提供了更多可能性。浙商证券表示,脑科学如能取得重大进展, ...
重磅直播!清华&博世开源SOTA性能纯血VLA:Impromptu-VLA告别双系统~
自动驾驶之心· 2025-07-01 12:58
论文链接:https://arxiv.org/abs/2505.23757v1 对于想入门的同学,建议扎实深度学习和计算机视觉基础,逐步了解自动驾驶各模块。多阅读前沿论文,并通过 开源项目动手实践,熟悉数据处理和模型训练流程。希望能为大家带来启发,期待与大家交流。 数据集pipeline: >>直播和内容获取转到 → 自动驾驶之心知识星球 项目主页:https://github.com/ahydchh/Impromptu-VLA 当前自动驾驶系统在城市和高速公路等结构化环境中取得了显著进展,但面对乡村小路、临时施工区、非标准交 通规则以及恶劣路况等"非结构化场景"时,其鲁棒性和安全性仍面临严峻挑战。现有大规模自动驾驶数据集主要 侧重于常规交通状况 ,导致在这些复杂多变的非结构化环境中缺乏专门的、大规模且精细标注的数据。为了弥 补这一关键空白,清华AIR联合博世中央研究院 提出并构建了 Impromptu VLA 框架,旨在提供一个开放权重和 开放数据的驾驶视觉-语言-动作模型。Impromptu VLA 是一个完全端到端、无中间感知表征的"纯血VLA"系统, 其从驾驶视频片段中直接提取多模态特征,并生成自然语 ...
暑假打打比赛!PRCV 2025空间智能与具身智能视觉感知挑战赛正式启动~
自动驾驶之心· 2025-06-30 12:51
空间智能与具身智能视觉感知挑战赛 竞赛目的与意义 视觉感知是实现空间智能与具身智能的关键支撑技术,近年来在自动驾驶、智慧城市、机器人等场景中展现出 广泛应用前景。特别是强化学习等技术在智能体感知与决策中的深度融合,正在成为推动该领域突破的重要力 量。 • 推动高效、高质量的空间智能和具身智能技术的研究。 • 探索强化学习、计算机视觉、图形学等前沿方法的创新。 • 促进神经渲染、场景优化和机器人抓取等方向的应用。 竞赛组织方 组织者 :彭君然、陈磊、唐彦嵩、刘健、许修为、尹航、孙浩文、卫浩宇、刘旭阳、赵鑫 指导专家 :张兆翔、鲁继文、殷绪成 组织单位 :北京科技大学、清华大学、中国科学院自动化研究所、北京九章云极科技有限公司、塞弗卓盈 (上海)科技有限公司 赞助商及技术支持单位 :北京九章云极科技有限公司 媒体支持单位 :塞弗卓盈(上海)科技有限公司 联系电话 :13051937326 联系邮箱 : prcvcompetition@126.com 微信交流群 :报名邮件回复确定 参赛者要求 : 按自愿报名的原则,参赛团队和成员的组成可以为: 报名方式 以个人或团队方式均可通过邮件方式报名参赛,每个参赛队伍人员不 ...
无需训练,即插即用,2倍GPU端到端推理加速——视频扩散模型加速方法DraftAttention
机器之心· 2025-06-28 04:35
本文第一作者为美国东北大学博士生沈轩,研究方向为高效人工智能,致力于在 GPU、移动端、FPGA 和 ASIC 等多种硬件平台上实现大模型的高效部署与加 速。第二作者为香港中文大学的韩晨夏,研究方向聚焦于计算机体系结构与 AI 系统的高效化设计。 在高质量视频生成任务中,扩散模型(Diffusion Models)已经成为主流。然而,随着视频长度和分辨率的提升,Diffusion Transformer(DiT)模型中的注意力机制 计算量急剧增加,成为推理效率的最大瓶颈。这是因为在视频生成中,DiT 通常使用 3D 全局注意力来建模时空一致性, 虽然效果出色,但计算量会随着 token 数 量呈平方增长 ,带来了巨大的计算负担。在 HunyuanVideo 等视频生成模型中,注意力模块计算时间占比超过 80%,生成仅 8 秒的 720p 视频甚至需要接近一小时 的时间。因此,提升视频生成模型的生成速度成为了迫切的需求。 现有视频生成加速方法,如 Sparse VideoGen(https://arxiv.org/abs/2502.01776)和 AdaSpa(https://arxiv.org/abs/250 ...
Z世代就业市场极度内卷,摩根大通CEO指点迷津
财富FORTUNE· 2025-06-27 11:53
Core Viewpoint - The job market for Generation Z is filled with contradictory signals, with entry-level positions disappearing while CEOs complain about talent shortages. The key to job security lies in acquiring the right skills [1][2]. Skills Gap and Workforce Needs - Companies are facing a skills gap in specific areas and urgently need young talent to fill these roles [2]. - Essential skills include networking, coding, programming, financial management, and project management [3][4]. Education and Training - Many schools are failing to provide adequate training in these critical areas, which hampers the development of the next generation of programmers and project managers [5]. - Education should focus on whether students can secure jobs after graduation, rather than solely on college graduation rates [6]. Importance of Computer Science Education - There is a strong belief in the necessity of students learning programming, especially in light of advancements in generative AI technologies [8]. - Over 250 CEOs, including leaders from Microsoft and Airbnb, signed a letter advocating for all students to receive education in computer science and artificial intelligence [8][9]. - A study from the University of Maryland found that students who take computer science courses in high school earn an average of 8% more in their first job [10]. Soft Skills and Character - Generation Z often struggles with workplace readiness, particularly in areas like professionalism, organizational skills, and communication [11]. - Companies prioritize character over technical expertise when hiring, emphasizing the importance of being smart, ethical, and having good character [13][14].
AI研究必备!施普林格·自然AI资源与服务指南
机器人大讲堂· 2025-06-26 08:32
过去几年来,人工智能( AI)风潮席卷全球,以ChatGPT为代表的生成式AI(GenAI)激发了各行各业对 这个领域的极大热情,并对众多行业产生了深远影响, 科学界的研究人员也在积极探索各种 AI工具来提高 自己的科研效率 。 与此同时,相关领域的科研成果大幅增加, 对 AI内容的需求也越来越大 。 作为全球领先的科研出版机构,施普林格 ·自然也积极拥抱新技术和新趋势, 出版 AI领域扎实和有深刻见 解的科研成果 ,推动思想和信息的全球交流。同时,其以科技赋能科研为导向, 探索多种方法开发 AI工 具和服务 ,以可持续且符合伦理的方式发掘并发挥 AI的潜力,服务广泛的学术界和产业界。 施普林格 ·自然拥有哪些AI资源和服务呢?本篇文章将为大家进行汇总,快来一睹为快吧! ▍ Springer 人工智能图书合集 该合集为施普林格・自然于 2025年推出的全新图书合集,内容涵盖从预测人类行为的机器学习算法,到深 远影响医疗保健等行业的神经网络研究。合集收录了相关领域丰富的资源且图书类型多样。专著和精选会 议论文集提供大量有关人工智能及其应用的主要研究成果,教科书和手册则可作为学生的入门读物,或为 希望全面了解这一 ...
科技赛道强势爆发,计算机ETF(159998),云计算ETF沪港深(517390)实现三连涨,本周均大涨约7%
Mei Ri Jing Ji Xin Wen· 2025-06-26 06:55
Group 1 - The technology sector is experiencing a strong surge, with cloud computing and computer stocks seeing significant increases, including companies like Zhina Compass, Changliang Technology, and Hengsheng Electronics, which have all risen over 25% in the past 10 days [1] - The Computer ETF (159998) has achieved three consecutive days of gains, with a cumulative increase of over 7% this week and five consecutive days of net inflows, including a net subscription of 172 million shares yesterday [1] - The Cloud Computing ETF (517390) saw an intraday increase of approximately 2%, with a weekly cumulative increase nearing 7%, driven by significant breakthroughs in industrial 5G terminal equipment construction and optimistic capital expenditure outlooks from leading cloud vendors [1] Group 2 - As internet giants increasingly invest in cloud computing and traditional financial institutions undergo a new round of IT reforms, the application of cloud computing in the financial sector is accelerating [2] - According to IDC, the Chinese financial cloud market is projected to reach $5.23 billion in the second half of 2024, reflecting a year-on-year growth of 11.0%, indicating a diversified market development [2] - Starting in 2025, the entry of open-source models like DeepSeek is expected to introduce new changes and opportunities in the competitive landscape of the financial cloud market, leading to a potential dual growth of "infrastructure reconstruction + intelligent application explosion" [2]