ResNet

Search documents
任少卿加入中科大......
自动驾驶之心· 2025-09-20 05:35
参考 | 量子位 点击下方 卡片 ,关注" 自动驾驶之心 "公众号 戳我-> 领取 自动驾驶近30个 方向 学习 路线 >>自动驾驶前沿信息获取 → 自动驾驶之心知识星球 本文只做学术分享,如有侵权,联系删文 任少卿去中科大了! AI大神任少卿开始在母校中国科学技术大学,开班招生了。 任少卿,曾任Momenta联合创始人、蔚来汽车副总裁,07级中科大本硕博(微软亚洲研究院联合培养),ResNet和Faster R-CNN作者。学术高被引超44 万,是全球中国籍学者高被引第一。ResNet也是21世纪全球最高被引论文。获未来科学大奖-数学与计算机科学奖。 招生方向为AGI、世界模型、具身智能、AI4S等。 硕士、博士生都在招。有推免资格的学生,下周一(22日)开启紧急面试。 更多内容 自动驾驶产业和学术最新咨询,欢迎加入自动驾驶之心知识星球,国内最大的自驾社区平台。 ...
任少卿在中科大招生了!硕博都可,推免学生下周一紧急面试
量子位· 2025-09-20 05:12
Core Viewpoint - Ren Shaoqing, a prominent figure in AI and computer vision, is starting a recruitment program at his alma mater, the University of Science and Technology of China, focusing on advanced topics in AI such as AGI, world models, embodied intelligence, and AI for Science [1][2]. Group 1: Recruitment Details - The recruitment is open for both master's and doctoral students, with emergency interviews starting on the upcoming Monday for students with recommendation qualifications [3]. - Interested students can send their resumes to Ren Shaoqing's email for inquiries regarding the application process and interview details [16]. Group 2: Background of Ren Shaoqing - Ren Shaoqing is an expert in computer vision and autonomous driving, having graduated from the University of Science and Technology of China and obtained a joint PhD with Microsoft Research Asia [4][5]. - He has been recognized as one of the most influential scholars in AI, ranking 10th in the AI 2000 list, and received the Future Science Prize in Mathematics and Computer Science in 2023 [6]. Group 3: Contributions to AI - Ren is a co-author of ResNet, a groundbreaking work in deep learning that addresses the vanishing gradient problem, significantly impacting fields requiring high perception capabilities like computer vision and autonomous driving [7]. - ResNet has received over 290,000 citations and won the Best Paper Award at CVPR 2016 [8]. - He also contributed to Faster R-CNN, an efficient two-stage object detection algorithm that balances speed and accuracy [10]. Group 4: Role in NIO - After completing his PhD, Ren co-founded Momenta and later joined NIO, where he played a key role in developing autonomous driving algorithms and leading the smart driving R&D team [13]. - At NIO, he developed the NIO World Model (NWM), which integrates spatiotemporal cognition and generative capabilities, allowing for high-fidelity scene reconstruction and long-term scenario simulation [14][15].
科学界论文高引第一人易主,Hinton、何恺明进总榜前五!
机器人圈· 2025-08-27 09:41
Core Insights - Yoshua Bengio has become the most cited scientist in history with a total citation count of 973,655 and 698,008 citations in the last five years [1] - The ranking is based on total citation counts and recent citation indices from AD Scientific Index, which evaluates scientists across various disciplines [1] - Bengio's work on Generative Adversarial Networks (GANs) has surpassed 100,000 citations, indicating significant impact in the AI field [1] Group 1 - The second-ranked scientist is Geoffrey Hinton, with over 950,000 total citations and more than 570,000 citations in the last five years [3] - Hinton's collaboration on the AlexNet paper has received over 180,000 citations, marking a pivotal moment in deep learning for computer vision [3] - The third and fourth positions in the citation rankings are held by researchers in the medical field, highlighting the interdisciplinary nature of high-impact research [6] Group 2 - Kaiming He ranks fifth, with his paper on Deep Residual Learning for Image Recognition cited over 290,000 times, establishing a foundation for modern deep learning [6] - The paper by He is recognized as the most cited paper of the 21st century according to Nature, emphasizing its lasting influence [9] - Ilya Sutskever, another prominent figure in AI, ranks seventh with over 670,000 total citations, showcasing the strong presence of AI researchers in citation rankings [10]
超97万:Yoshua Bengio成历史被引用最高学者,何恺明进总榜前五
机器之心· 2025-08-25 06:08
Core Insights - The article highlights the prominence of AI as the hottest research direction globally, with Yoshua Bengio being the most cited scientist ever, accumulating a total citation count of 973,655 and 698,008 citations in the last five years [1][3]. Group 1: Citation Rankings - The AD Scientific Index ranks 2,626,749 scientists from 221 countries and 24,576 institutions based on total citation counts and recent citation indices [3]. - Yoshua Bengio's work on Generative Adversarial Networks (GANs) has surpassed 100,000 citations, outpacing his co-authored paper "Deep Learning," which also exceeds 100,000 citations [3][4]. - Geoffrey Hinton, a pioneer in AI, ranks second with over 950,000 total citations and more than 570,000 citations in the last five years [4][5]. Group 2: Notable Papers and Their Impact - The paper "AlexNet," co-authored by Hinton, Krizhevsky, and Sutskever, has received over 180,000 citations, marking a significant breakthrough in deep learning for computer vision [5][6]. - Kaiming He’s paper "Deep Residual Learning for Image Recognition" has over 290,000 citations, establishing ResNet as a foundational model in modern deep learning [10][11]. - The article notes that ResNet is recognized as the most cited paper of the 21st century, with citation counts ranging from 103,756 to 254,074 across various databases [11]. Group 3: Broader Implications - The high citation counts of these influential papers indicate their lasting impact on the academic community and their role in shaping future research directions in AI and related fields [17].
性能暴涨4%!CBDES MoE:MoE焕发BEV第二春,性能直接SOTA(清华&帝国理工)
自动驾驶之心· 2025-08-18 23:32
Core Viewpoint - The article discusses the CBDES MoE framework, a novel modular expert mixture architecture designed for BEV perception in autonomous driving, addressing challenges in adaptability, modeling capacity, and generalization in existing methods [2][5][48]. Group 1: Introduction and Background - The rapid development of autonomous driving technology has made 3D perception essential for building safe and reliable driving systems [5]. - Existing solutions often use fixed single backbone feature extractors, limiting adaptability to diverse driving environments [5][6]. - The MoE paradigm offers a new solution by enabling dynamic expert selection based on learned routing mechanisms, balancing computational efficiency and representational richness [6][9]. Group 2: CBDES MoE Framework - CBDES MoE integrates multiple structurally heterogeneous expert networks and employs a lightweight self-attention router (SAR) for dynamic expert path selection [3][12]. - The framework includes a multi-stage heterogeneous backbone design pool, enhancing scene adaptability and feature representation [14][17]. - The architecture allows for efficient, adaptive, and scalable 3D perception, outperforming strong single backbone baseline models in complex driving scenarios [12][14]. Group 3: Experimental Results - In experiments on the nuScenes dataset, CBDES MoE achieved a mean Average Precision (mAP) of 65.6 and a NuScenes Detection Score (NDS) of 69.8, surpassing all single expert baselines [37][39]. - The model demonstrated faster convergence and lower loss throughout training, indicating higher optimization stability and learning efficiency [39][40]. - The introduction of load balancing regularization significantly improved performance, with the mAP increasing from 63.4 to 65.6 when applied [42][46]. Group 4: Future Work and Limitations - Future research may explore patch-wise or region-aware routing for finer granularity in adaptability, as well as extending the method to multi-task scenarios [48]. - The current routing mechanism operates at the image level, which may limit its effectiveness in more complex environments [48].
刚刚,何恺明官宣新动向~
自动驾驶之心· 2025-06-26 10:41
Core Viewpoint - The article highlights the significant impact of Kaiming He joining Google DeepMind as a distinguished scientist, emphasizing his dual role in academia and industry, which is expected to accelerate the development of Artificial General Intelligence (AGI) at DeepMind [1][5][8]. Group 1: Kaiming He's Background and Achievements - Kaiming He is renowned for his contributions to computer vision and deep learning, particularly for introducing ResNet, which has fundamentally transformed deep learning [4][18]. - He has held prestigious positions, including being a research scientist at Microsoft Research Asia and Meta's FAIR, focusing on deep learning and computer vision [12][32]. - His academic credentials include a tenure as a lifelong associate professor at MIT, where he has published influential papers with over 713,370 citations [18][19]. Group 2: Impact on Google DeepMind - Kaiming He's expertise in computer vision and deep learning is expected to enhance DeepMind's capabilities, particularly in achieving AGI within the next 5-10 years, as stated by Demis Hassabis [7][8]. - His arrival is seen as a significant boost for DeepMind, potentially accelerating the development of advanced AI models [5][39]. Group 3: Research Contributions - Kaiming He has published several highly cited papers, including works on Faster R-CNN and Mask R-CNN, which are among the most referenced in their fields [21][24]. - His recent research includes innovative concepts such as fractal generative models and efficient one-step generative modeling frameworks, showcasing his continuous contribution to advancing AI technology [36][38].
刚刚,何恺明官宣入职谷歌DeepMind!
猿大侠· 2025-06-26 03:20
Core Viewpoint - Kaiming He, a prominent figure in AI and computer vision, has officially joined Google DeepMind as a distinguished scientist while retaining his position as a tenured associate professor at MIT, marking a significant boost for DeepMind's ambitions in artificial general intelligence (AGI) [2][5][6]. Group 1: Kaiming He's Background and Achievements - Kaiming He is renowned for his contributions to deep learning, particularly for developing ResNet, which has fundamentally transformed the trajectory of deep learning and serves as a cornerstone for modern AI models [5][17]. - His academic influence is substantial, with over 713,370 citations for his papers, showcasing his impact in the fields of computer vision and deep learning [17][18]. - He has received numerous prestigious awards, including the best paper awards at major conferences such as CVPR and ICCV, highlighting his significant contributions to the field [23][26]. Group 2: Implications of His Joining DeepMind - Kaiming He's expertise in computer vision and deep learning is expected to accelerate DeepMind's efforts towards achieving AGI, a goal that Demis Hassabis has indicated could be realized within the next 5-10 years [8][9]. - His recent research focuses on developing models that learn representations from complex environments, aiming to enhance human intelligence through more capable AI systems [16][17]. - The addition of Kaiming He to DeepMind is seen as a strategic advantage, potentially leading to innovative breakthroughs in AI model development [6][37].
何恺明新身份:谷歌DeepMind杰出科学家
机器之心· 2025-06-26 00:30
Core Viewpoint - The article discusses the recent news of Kaiming He joining Google as a part-time Distinguished Scientist at DeepMind, highlighting his significant contributions to the field of AI and computer vision [2][4][24]. Group 1: Kaiming He's Background and Achievements - Kaiming He achieved the highest score in the 2003 Guangdong Province college entrance examination and was admitted to Tsinghua University [8]. - He completed his PhD at the Chinese University of Hong Kong under the supervision of Xiaoguo Tang and has held positions at Microsoft Research Asia, Facebook AI Research, and MIT [9]. - His research has received multiple awards, including the best paper award at CVPR in 2009 and 2016, and he has over 710,000 citations according to Google Scholar [10][12]. Group 2: Research Contributions - Kaiming He's most notable work includes the ResNet paper published in 2016, which has been cited over 280,000 times and is considered one of the most cited papers of the 21st century [15][18]. - His research addresses the gradient propagation problem in deep networks, establishing fundamental components for modern deep learning models [18][19]. - He has also contributed to the development of the Masked Autoencoders model, which has gained popularity in the computer vision community [20]. Group 3: Future Prospects at Google - The article expresses anticipation for Kaiming He's potential contributions at Google, particularly in the area of generative modeling, as suggested by his recent research [6][24].
专访张祥雨:多模态推理和自主学习是未来的 2 个 「GPT-4」 时刻
海外独角兽· 2025-06-08 04:51
本期内容是拾象 CEO 李广密对大模型公司阶跃星辰首席科学家张祥雨的访谈。 张祥雨专注于多模态领域,他提出了 DreamLLM 多模态大模型框架,这是业内最早的图文生成理解 一体化的多模态大模型架构之一,基于这个框架,阶跃星辰发布了中国首个千亿参数原生多模态大 模型 Step-1V。此外,他的学术影响力相当突出,论文总引用量已经超过了 37 万次。 一直以来,业界都相当期待一个理解、生成一体化的多模态,但直到今天这个模型还没出现,如何 才能达到多模态领域的 GPT-4 时刻?这一期对谈中,祥雨结合自己在多模态领域的研究和实践历 程,从纯粹的技术视角下分享了自己对多模态领域关键问题的全新思考,在他看来,虽然语言模型 领域的进步极快,但多模态生成和理解的难度被低估了: • 接下来 2-3 年,多模态领域会有两个 GPT-4 时刻:多模态推理和自主学习; • o1 范式的技术本质在于激发出 Meta CoT 思维链:允许模型在关键节点反悔、重试、选择不同分 支,使推理过程从单线变为图状结构。 目录 01 研究主线: 重新回归大模型 • 多模态生成理解一体化难以实现的原因在于,语言对视觉的控制能力弱,图文对齐不精确, ...
亚裔 AI 人才的硅谷晋升之路,被一张绿卡阻断了?
3 6 Ke· 2025-04-28 11:23
Core Viewpoint - The article highlights the precarious situation faced by skilled immigrants in the U.S. tech industry, particularly in light of tightening immigration policies, as exemplified by the case of Kai Chen, a prominent AI researcher who was forced to leave the U.S. after her green card application was denied [2][4][5]. Group 1: Impact of Immigration Policies - The tightening of immigration policies under the Trump administration has created a new barrier for skilled workers in the tech industry, particularly affecting those on H1B visas [2][6][18]. - Kai Chen's experience reflects a broader trend where even highly qualified individuals with significant contributions to their companies can suddenly find themselves at risk of deportation [4][5][6]. - The article notes that over 1,000 international students have had their visas revoked, illustrating the widespread impact of these policies across various sectors [16][18]. Group 2: Demographics and Contributions of Asian Talent - Asian representation in major U.S. AI companies is significant, with Asians making up 45.7% of Google's workforce, surpassing the percentage of white employees [7][9]. - The article emphasizes that while Asian talent, particularly of Indian and Chinese descent, has been rising in the tech industry, they often face challenges in career advancement due to office politics [9][10]. - Despite these challenges, the AI sector has provided new opportunities for Asian professionals, allowing them to leverage their technical skills for career growth [10][11]. Group 3: Future Prospects for Talent - The article discusses the potential for skilled workers like Kai Chen to seek opportunities outside the U.S., as companies in Europe and China are actively recruiting top talent [19][20]. - Major Chinese tech firms are launching initiatives to attract high-end talent, indicating a shift in where skilled professionals may choose to work in the future [20][21]. - The narrative suggests that while the U.S. has historically been a magnet for talent, the current political climate may lead to a redistribution of skilled workers globally [19][22].