DeepMind
Search documents
X @TechCrunch
TechCrunch· 2025-08-05 14:16
DeepMind reveals Genie 3, a world model that could be the key to reaching AGI | TechCrunch https://t.co/i2UDsxXFqv ...
Google Genie 3 - The Most Advanced World Simulator Ever...
Matthew Berman· 2025-08-05 14:02
Model Overview - Google announced Genie 3, a general-purpose world model for generating diverse interactive environments [1][8] - Genie 3 allows real-time interaction with improved consistency and realism compared to Genie 2 [12] - The model generates 720p high-quality environments [3] Technical Aspects - Genie 3 considers the entire previously generated trajectory, not just the previous frame, for autoregressive generation [15] - Consistency in Genie 3 is an emergent capability resulting from training scale, not pre-programming [19] - Genie 3 generates dynamic and rich worlds frame by frame based on world description and user actions, unlike methods relying on explicit 3D representation [20] Potential Applications - World models like Genie 3 can be used for training robots and agents [9] - The technology has potential applications in creating video games, movies, and television shows [9] - Google positions world models as a key step towards AGI by providing AI agents with unlimited simulation environments for training [9][10] Comparison with Previous Models - Genie 3 demonstrates significant improvements in consistency, detail, and generation length compared to Genie 2 [22][23] - Genie 3 allows for deeper world exploration than Genie 2 [23] Interactive Features - Users can prompt events in real-time, adding elements to the scene [21] - The model demonstrates realistic interactions, such as light moving out of the way of a jet ski and reflections in mirrors [6] - The model can simulate actions like painting, with paint only being applied when the brush touches the wall [29][30]
智源大会盛况:AI领域精英共绘科技蓝图,探索智能未来新方向
Sou Hu Cai Jing· 2025-08-04 19:16
Group 1 - The Beijing Zhiyuan Conference, held in June 2025, has become a significant event in the AI field, attracting global elites and showcasing the latest academic achievements [1] - The conference featured four Turing Award winners, enhancing its academic atmosphere, and included representatives from major tech companies like Google, DeepMind, and domestic giants such as Huawei and Baidu [1] - The event serves as a bridge between theory and practice, connecting laboratories with the market [1] Group 2 - The two-day conference included nearly 20 thematic forums discussing foundational theories, application exploration, industrial innovation, and sustainable development in AI [2] - Multimodal technology and deep reasoning emerged as focal points, aiming to enhance AI's ability to process various data types and improve logical reasoning and decision-making [2] - Experts shared applications of multimodal technology in image recognition, speech recognition, and natural language processing, highlighting new possibilities for AI in sectors like intelligent customer service and healthcare [2] Group 3 - Innovative companies, such as Beijing Hongyixin Technology Development Co., actively participated in the conference, showcasing their focus on software and information services [4] - The company utilizes advanced technologies like big data, AI, and cloud computing to provide data governance solutions [4] - Researchers from Hongyixin engaged in discussions with industry elites, integrating cutting-edge ideas into their applications and solutions, thereby invigorating the company's future development [4]
专栏丨科技转型,欧洲为何“大象转身难”
Xin Hua She· 2025-08-04 14:17
Group 1 - The article highlights Europe's struggle with technological transformation, particularly in AI and electric vehicles, where it lags behind the US and Asia despite its early start in AI research [1][2] - The inertia of established industries, particularly in the automotive sector, is identified as a significant barrier to transformation, as traditional supply chains and manufacturing systems hinder the shift to electric and smart vehicles [1] - A conservative social mindset and business culture in Europe stifle innovation, with only 29% of UK companies encouraging the use of AI tools, leading to a preference for stable projects over high-risk innovations [2] Group 2 - Political instability and policy fluctuations, such as the inconsistent support for electric vehicle subsidies in the UK, create uncertainty in the market and hinder long-term planning and infrastructure development [3] - The EU plans to invest €1.3 billion in key technologies like AI from 2025 to 2027, while the UK government aims to invest £1 billion to enhance national computing power, indicating a push to accelerate technological advancement [3] - To successfully navigate the transformation, Europe needs to streamline policy mechanisms, guide capital flows, and alleviate social anxieties, while also fostering global collaboration [3]
深度|Perplexity CEO:为什么决定做Comet浏览器?我们需要自己的客户端,并控制我们自己的命运
Z Potentials· 2025-08-04 05:51
Core Viewpoint - Perplexity AI has developed the Comet browser to compete with Google and enhance user experience by integrating AI capabilities directly into the browsing process, aiming to provide a more personalized and efficient online experience [3][10][20]. Group 1: Development Motivation - The decision to create the Comet browser stemmed from the observation that most user queries are conducted through browser search bars, with Google handling approximately 15 billion queries daily, a significant portion of which comes from Chrome and Safari [3][4]. - The historical context of Google's rise, particularly through the Google Toolbar, illustrates the importance of controlling the browsing experience to avoid dependency on other platforms like Microsoft [4][7]. - The need for a dedicated client was emphasized due to challenges faced with existing browser extensions and the desire to maintain control over user data and experience [7][15]. Group 2: AI Integration and User Experience - Comet aims to leverage AI agents to perform complex tasks that traditionally require significant time and effort, such as conducting research across multiple platforms and managing schedules [8][9]. - The browser is designed to enhance user productivity by allowing AI to assist in everyday tasks, thereby providing a competitive edge for small business owners and individuals [9][19]. - The integration of AI within the browser is seen as a necessary evolution to keep pace with advancements in AI technology and user expectations [10][19]. Group 3: Competitive Landscape - Perplexity AI positions itself against Google by challenging the sustainability of Google's advertising model, suggesting that AI agents could disrupt traditional ad spending by providing more efficient alternatives for users [20][21]. - The company believes that Google's reliance on advertising revenue could be undermined as users increasingly turn to AI for decision-making, potentially reducing the effectiveness of Google AdWords [21][36]. - Perplexity AI's strategy focuses on creating a subscription-based model for AI services, which could provide a more stable revenue stream compared to ad-based models [36][39]. Group 4: Privacy and Data Management - Comet emphasizes user privacy by ensuring that data remains on the client side, with no sensitive information like passwords or credit card details being stored on external servers [15][40]. - The company advocates for a zero-retention policy, allowing users to control their data and delete any information they do not wish to retain [40][41]. - This approach contrasts with other AI services that may store user data on their servers, highlighting a commitment to user security and privacy [15][40]. Group 5: Future Vision and AI's Societal Impact - The long-term vision for Comet includes the potential for AI to filter out low-quality content and enhance the browsing experience by providing valuable insights and summaries [22][23]. - The company acknowledges the societal implications of AI, particularly the potential for job displacement, while also emphasizing the need for individuals to adapt and leverage AI to remain competitive in the workforce [43][44]. - Perplexity AI aims to empower users by providing tools that enhance productivity and free up time for personal pursuits, thereby redefining the relationship between humans and technology [43][44].
思辨会 | 思辨八方,智启未来——2025世界人工智能大会思辨会综述
Guan Cha Zhe Wang· 2025-08-03 13:30
Group 1: AI Development and Trends - The 2025 World Artificial Intelligence Conference (WAIC 2025) showcased a variety of discussions on the future of AI, emphasizing a shift from traditional conference formats to a "question-driven, deep dialogue" approach [1] - AI is breaking down traditional disciplinary barriers, particularly in fields like quantum physics, materials science, and biomedicine, leading to new research paradigms [3][4] - The integration of embodied intelligence and reinforcement learning is creating a new form of AI that closely resembles human intelligence, enabling real-world applications such as autonomous robots and self-driving cars [7][8] Group 2: AI in Life Sciences - AI is transforming life sciences by covering the entire research process, from pathology studies to molecular analysis, exemplified by systems like DeepMind's GNoME [5] - The development of digital twin brains is reshaping the understanding of the human brain, allowing for simulations of brain activity and predictions of neurological diseases [6] Group 3: AI Safety and Ethical Considerations - The rise of intelligent agents raises security concerns, with experts highlighting the need for a comprehensive protection system from design to deployment to ensure these agents are reliable partners [2] - Ethical considerations are paramount as technologies like digital twin brains challenge the boundaries of "thought privacy" and human consciousness [6][9]
AI教父Hinton,重新能坐下了
Hu Xiu· 2025-08-03 04:53
Group 1 - Geoffrey Hinton, the AI pioneer, recently sat down comfortably in Shanghai, marking a significant moment in his life after nearly 18 years of discomfort that prevented him from sitting for extended periods [1][6][30] - Hinton's journey in AI began in 1972 when he chose to pursue neural networks, a path that was largely dismissed by his peers at the time [12][20] - His persistence in the field led to breakthroughs in deep learning, particularly during the ImageNet competition in 2012, where his team achieved a remarkable error rate of 15.3% [30][31][32] Group 2 - Hinton's contributions to AI were recognized with the Turing Award in 2019, which he received while standing, reflecting his long-standing discomfort with sitting [59][63] - Following his resignation from Google in May 2023, Hinton expressed concerns about the risks associated with AI, stating that he regretted his role in its development [67][68] - In recent interviews, Hinton has been able to sit for longer periods, indicating a potential improvement in his health, and he has been vocal about the dangers of AI, suggesting a 10%-20% chance of human extinction due to AI in the next 30 years [70][76]
X @Demis Hassabis
Demis Hassabis· 2025-08-02 00:28
Model Capabilities - Gemini 2.5 Deep Think demonstrates advanced capabilities in fusing ideas across research papers, exceeding previous levels [1] - The model's capabilities necessitate careful evaluation [1]
X @Demis Hassabis
Demis Hassabis· 2025-08-01 17:07
😂 - very much looking forward to seeing what my mathematician friends will do with the fuller version too!Sundar Pichai (@sundarpichai):We’re bringing a version of Deep Think that achieved gold-medal status at IMO to Ultra subscribers in the @Geminiapp (+ the official version is now in the hands of mathematicians).Toggle it on when reasoning through complex scientific literature, tackling a coding problem that https://t.co/OyFSGsQSgJ ...
MuJoCo教程来啦!从0基础到强化学习,再到sim2real
具身智能之心· 2025-08-01 16:02
Core Viewpoint - The article discusses the unprecedented advancements in AI, particularly in embodied intelligence, which is transforming the relationship between humans and machines. This technology is poised to revolutionize various industries, including manufacturing, healthcare, and space exploration [1][3]. Group 1: Embodied Intelligence - Embodied intelligence is characterized by machines that can understand language commands, navigate complex environments, and make intelligent decisions in real-time. This technology is no longer a concept from science fiction but is rapidly becoming a reality [1]. - Major tech companies like Tesla, Boston Dynamics, OpenAI, and Google are competing in the field of embodied intelligence, focusing on creating systems that not only have a "brain" but also a "body" capable of interacting with the physical world [1][3]. Group 2: Technical Challenges - Achieving true embodied intelligence presents significant technical challenges, including the need for advanced algorithms and a deep understanding of physical simulation, robot control, and perception fusion [3][4]. - MuJoCo (Multi-Joint dynamics with Contact) is highlighted as a critical technology in this field, serving as a high-fidelity simulation engine that bridges the virtual and real worlds [4][6]. Group 3: Advantages of MuJoCo - MuJoCo allows researchers to create realistic virtual robots and environments, enabling millions of trials and learning experiences without risking expensive hardware. This significantly accelerates the learning process, as simulations can run hundreds of times faster than real-time [6][8]. - The technology supports high parallelism, allowing thousands of simulation instances to run simultaneously, and provides a variety of sensor models, ensuring robust and precise simulations [6][8]. Group 4: Educational Opportunities - A comprehensive MuJoCo development course has been developed, focusing on practical applications and theoretical foundations, covering topics from physical simulation principles to deep reinforcement learning [9][11]. - The course is structured into six modules, each with specific learning objectives and practical projects, ensuring a solid grasp of embodied intelligence technologies [15][17]. Group 5: Project-Based Learning - The course includes six progressively challenging projects, such as building a smart robotic arm, implementing vision-guided grasping systems, and developing multi-robot collaboration systems, which are designed to provide hands-on experience [19][27]. - Each project is accompanied by detailed documentation and code references, facilitating a deep understanding of the underlying technologies and their applications in real-world scenarios [30][32]. Group 6: Target Audience and Outcomes - The course is suitable for individuals with programming or algorithm backgrounds looking to enter the field of embodied robotics, as well as students and professionals interested in enhancing their practical skills [32][33]. - Upon completion, participants will possess a complete skill set in embodied intelligence, including technical, engineering, and innovative capabilities, making them well-equipped for roles in this rapidly evolving industry [32][33].