OpenAI大佬爆料:本科生靠一篇博客杀进OpenAI,没博士,0篇论文
3 6 Ke·2026-02-25 11:14

Core Insights - The article highlights the unconventional path of Keller Jordan, who secured a position at OpenAI without a PhD or traditional research background, emphasizing the importance of open-source projects and practical contributions in the AI field [1][3]. Group 1: Keller Jordan's Journey - Keller Jordan graduated from UCSD in 2020 with dual degrees in mathematics and computer science, without having published any papers [5]. - His first job was at an AI content moderation startup, where he began to explore improvements in existing research [5]. - After reaching out to Google researcher Behnam for guidance, he collaborated on a project that led to a paper presented at ICLR [8]. Group 2: Contributions to AI Research - Keller's work on "NanoGPT speed run" significantly improved the training efficiency of Transformer models, achieving a 3.8 times increase in token efficiency, reducing the required tokens from approximately 10 billion to 2.7 billion [9][10]. - The design of the speed run was innovative, allowing for low-cost experimentation, with a single attempt costing as little as $8, making it accessible for individual researchers [12][13]. Group 3: Development of Muon - Keller developed an optimizer named Muon, which optimized the hidden layers of neural networks, achieving record training speeds for NanoGPT and CIFAR-10 [14][19]. - Muon demonstrated superior performance compared to the widely used AdamW optimizer, particularly as model sizes increase, indicating a potential breakthrough in AI model training [19]. Group 4: Entry into OpenAI - Keller officially joined OpenAI in December 2024, following the success of Muon in the developer community [20]. - He expressed a preference for continuing his research over publishing a paper, criticizing the prevalence of low-quality optimization papers in the field [21]. Group 5: Other Success Stories - The article also mentions other individuals who have successfully transitioned into major AI companies without traditional academic credentials, such as Sholto Douglas at Google DeepMind and Andy Jones at Anthropic, highlighting a trend of talent recognition based on practical contributions rather than formal publications [23][25][28].

OpenAI大佬爆料:本科生靠一篇博客杀进OpenAI,没博士,0篇论文 - Reportify