ViT

Search documents
「CV 铁三角」落定Meta,视觉 AI 如何向多模态演进?
机器之心· 2025-07-19 05:49
Group 1 - The core viewpoint of the article discusses the strategic hiring by Meta, focusing on the "CV Triangle" and its implications for the evolution of visual AI towards multimodal capabilities [4][5][6] - The "CV Triangle" consists of three key researchers from OpenAI Zurich, previously from GoogleBrain, whose work has significantly influenced the development of modern multimodal AI frameworks [5][6] - The article outlines five representative works led by the "CV Triangle," including S4L, BiT, ViT, MLP-Mixer, and PALI, which collectively contribute to the advancement of visual AI and its integration with other modalities [5][6][7] Group 2 - The article highlights the milestones necessary for the transition from visual AI to multimodal AI, emphasizing the importance of continuous research and development in this field [8]
刚刚,OpenAI苏黎世办公室被Meta一锅端,三名ViT作者被挖走
机器之心· 2025-06-26 04:35
Core Viewpoint - Meta has aggressively recruited top AI researchers from OpenAI, indicating a strategic move to regain its competitive edge in the AI sector [3][6][9]. Group 1: Recruitment and Strategy - Meta CEO Mark Zuckerberg has successfully poached three researchers, Lucas Beyer, Alexander Kolesnikov, and Xiaohua Zhai, from OpenAI's Zurich office [4][5]. - The recruitment is part of a broader strategy by Zuckerberg, who is personally reaching out to hundreds of top talents in the AI field, offering lucrative compensation packages, including offers worth up to $100 million [6][7]. - Meta's recent investment of $14 billion in AI startup Scale and the hiring of its CEO, Alexandr Wang, to lead a new superintelligence team further emphasizes its commitment to AI development [7]. Group 2: Responses from OpenAI - OpenAI CEO Sam Altman has downplayed concerns regarding the talent exodus, suggesting that the best talents are not leaving for Meta [9]. - In response to the recruitment efforts by Meta, OpenAI is also increasing funding and development opportunities for its researchers to retain talent [9]. Group 3: Background of Key Researchers - Xiaohua Zhai has a strong academic background, holding a PhD in Computer Science from Peking University and has been a significant contributor to multimodal research at Google DeepMind before joining OpenAI [12][14][15]. - Lucas Beyer, who has also been influential in AI research, completed his studies at RWTH Aachen University and has worked at Google Brain and DeepMind [18][20]. - Alexander Kolesnikov, with a PhD in machine learning and computer vision, has a notable research history at Google Brain and DeepMind before joining OpenAI [24][26].
对话香港大学马毅:“如果相信只靠 Scaling Laws 就能实现 AGI,你该改行了”
晚点LatePost· 2024-06-04 10:05
文丨程曼祺 编辑丨宋玮 黄俊杰 当大部分人都相信一件事或趋势时,不同意的人可以选择沉默,也可以大声说出来。前者是少数派中的多数派,后者少数派中的少数派。 马毅就是一个少数派中的少数派。 自 2000 年从伯克利大学博士毕业以来,马毅先后任职于伊利诺伊大学香槟分校(UIUC)、微软亚研院、上海科技大学、伯克利大学和香港大 学,现担任香港大学计算机系主任和数据科学研究院院长。 他最早将 "压缩感知" 技术应用于计算机视觉领域,在人脸识别、物体分类等任务上产生了巨大影响。 知名 AI 学者李飞飞是马毅在 UIUC 时参与招聘的第一个华人助理教授,ResNet 一作何恺明是马毅在微软亚研院负责视觉组时招的第一个新员 工。 少数派中的少数派。 马毅公开表达时直言不讳。AI 业界惊叹于 GPT 等大模型的威力,担心 AI 可能毁灭人类,如图灵奖得主杰弗里·辛顿(Geoffrey Hinton) 和 OpenAI 发起者之一伊隆·马斯克(Elon Musk)就多次将 AI 类比为原子弹,呼吁监管。 "说现在的 AI 危险的人,要么是无知,要么是别有目的。" 马毅在 twitter 上回应 AI 威胁论。 强烈的观点来自他对 ...