强化学习(Reinforcement Learning)

Search documents
小米开源首个原生端到端语音大模型;谷歌将Gemini AI引入Chrome浏览器丨AIGC日报
创业邦· 2025-09-20 04:39
Group 1 - Xiaomi has officially open-sourced its first native end-to-end voice model, Xiaomi-MiMo-Audio, which is based on an innovative pre-training architecture and over a billion hours of training data, achieving few-shot generalization based on ICL in the voice domain and observing significant "emergent" behavior during pre-training [2] - Ashish Kumar, head of Tesla's Optimus AI team, has announced his departure to join Meta as a research scientist. During his tenure at Tesla, he focused on scalable AI methods and enhanced robot dexterity through reinforcement learning [2] - Google is integrating its Gemini AI model into the Chrome browser, allowing users to request explanations of web pages, consolidate information from multiple tabs, and restore previously closed sites. This integration follows a court ruling that Google does not need to divest Chrome [2] - Tencent has launched a one-stop work platform called "混元3D Studio," aimed at 3D designers and game developers. The platform utilizes AI technology to streamline the entire 3D production process, reducing production cycles from days to minutes [2]
特斯拉Optimus AI团队负责人Ashish Kumar离职加盟Meta
Xin Lang Cai Jing· 2025-09-19 02:47
特斯拉Optimus AI团队的负责人Ashish Kumar宣布离职,加入Meta担任研究科学家一职。Ashish Kumar 于2023年7月加入特斯拉,负责Optimus项目的AI开发工作,直至2025年9月离职。在特斯拉的两年多时 间里,他领导团队专注于可扩展AI方法的研究,将传统机器人控制栈替换为强化学习(Reinforcement Learning)技术,并通过视频学习提升机器人的灵巧度(dexterity)。特斯拉Optimus人形机器人项目的负责 人为阿肖克·埃卢斯瓦米(AshokElluswamy),他自2025年6月起接任该项目负责人。 ...