Workflow
DeepMind
icon
Search documents
从“内部世界”到虚拟造物:世界模型的前世今生
Jing Ji Guan Cha Bao· 2025-08-21 08:25
Group 1 - Google DeepMind released a new model called Genie 3, which can generate interactive 3D virtual environments based on user prompts, showcasing enhanced real-time interaction capabilities compared to previous AI models [2] - Genie 3 introduces a feature called "Promptable World Events," allowing users to dynamically alter the generated environment through text commands, significantly expanding user interaction possibilities [2] - The performance of Genie 3 has sparked discussions about "World Models," which represent a potential pathway towards achieving Artificial General Intelligence (AGI) [2] Group 2 - The concept of "World Models" is inspired by the human brain's ability to create and utilize an "inner world" for predictive capabilities, allowing individuals to simulate future scenarios based on current inputs [4][5] - Historical attempts to replicate this capability in AI include early models that used feedback control theories and symbolic reasoning, evolving through the integration of statistical learning methods [6][7] - The term "World Model" was coined by Jürgen Schmidhuber in 1990, emphasizing the need for AI to understand and simulate the real world comprehensively [7] Group 3 - The implementation of World Models involves several key stages: representation learning, dynamic modeling, control and planning, and result output, each contributing to the AI's ability to simulate and interact with the environment [11][12][13][14] - World Models can significantly enhance various fields, including embodied intelligence, digital twins, education, and gaming, by allowing AI to actively engage and learn from simulated environments [15][16][17] Group 4 - The emergence of World Models has raised ethical and governance concerns, particularly regarding the potential blurring of lines between reality and virtuality, as well as the implications for user behavior and societal norms [18][19][20] - Experts in the AI field are divided on the necessity of World Models for achieving AGI, with some advocating for their importance while others suggest alternative approaches may suffice [21][22][23][24] Group 5 - The exploration of World Models represents a significant challenge to understanding cognition and the mechanisms of reality, positioning AI as a participant in the age-old quest to comprehend the workings of the world [25]
中英青年学者共探AI与科技成果转化新路径
Huan Qiu Wang· 2025-08-21 01:49
【环球网科技综合报道】"2010年,三个英国年轻人创立了人工智能研究机构DeepMind,成为人工智能 时代无法忽略的一家公司。这也是英国科研和创新实力的一个缩影。"英国皇家工程院院士、英国杰出 青年人才协会创始人顾赛表示,"此番来到上海,看到国内创新、创业的氛围,我们感到十分激动。" 同时,米磊还介绍了中科创星(上海)高质量孵化器正在实践的"超前孵化"和"深度孵化"模式。其 中,"超前孵化"从原理和论文阶段介入,支持科学家开展原理设计与概念验证;"深度孵化"则参与团队 组建、产品研发和商业运营,实现从PI(学科带头人)-IDEA-IP-IPO的全过程孵化。 8月20日,"好望角科学沙龙"中英科技成果转化专场活动在上海举行,来自牛津大学、帝国理工学院、 伦敦政治经济学院等英国高校20余名青年学者,与香港科技大学、中国科学院深圳先进技术研究院、同 济大学的学者,以及科技投资和成果转化界10余位代表参与交流,探讨中英两国科技成果转化经验。 据悉,"好望角科学沙龙"由中科创星发起,由中科创星、东壁科技数据、上海市研发公共服务平台管理 中心共同主办,致力于打造具有广泛影响力的科创融合与跨界交流平台。"好望角科学沙龙" ...
X @Avi Chawla
Avi Chawla· 2025-08-20 19:16
RT Avi Chawla (@_avichawla)DeepMind built a simple RAG technique that:- reduces hallucinations by 40%- improves answer relevancy by 50%Let's understand how to use it in RAG systems (with code): ...
空间机器人航天领域有妙用 学者沪上科学沙龙“论AI”
Zhong Guo Xin Wen Wang· 2025-08-20 14:33
Group 1 - The event "Cape of Good Hope Science Salon" focused on the application of artificial intelligence and robotics in space exploration, highlighting potential uses such as space debris collection and satellite maintenance [1][3] - The event attracted over 20 young scholars from prestigious UK universities and representatives from Chinese research institutions, emphasizing international collaboration in technology transfer [1][3] - The development of artificial intelligence is accelerating the "AI-ization" of various sectors, with expectations of driving economic growth and the need for patient capital in China's tech investment landscape [3] Group 2 - Shanghai is positioning itself as a hub for artificial intelligence development and is actively promoting patient capital through initiatives like the launch of three major guiding industry mother funds totaling 100 billion yuan [3] - The Future Industry Fund in Shanghai, announced earlier this year, aims to support the transformation of technological achievements from "0 to 1" and is designed as a counter-cyclical patient capital with a 15-year investment period [3]
X @Avi Chawla
Avi Chawla· 2025-08-20 06:31
RAG技术提升 - DeepMind 开发了一种简单的 RAG 技术,将幻觉减少 40% [1] - 该技术将答案相关性提高了 50% [1] RAG系统应用 - 行业正在探索如何在 RAG 系统中使用该技术(附带代码)[1]
X @Avi Chawla
Avi Chawla· 2025-08-20 06:30
DeepMind built a simple RAG technique that:- reduces hallucinations by 40%- improves answer relevancy by 50%Let's understand how to use it in RAG systems (with code): ...
这些硅谷AI精英“疯了”:花光积蓄囤装备逃命,开末日狂欢派对
Hu Xiu· 2025-08-19 10:26
本文来自微信公众号:APPSO (ID:appsolution),作者:APPSO,原文标题:《这些硅谷 AI 精 英"疯了":花光积蓄囤装备逃命,开末日狂欢派对,还给自己挖坑》,题图来自:AI生成 事实也是如此,纽约时报曾报道了身处AI末日论中心的Anthropic,这是一家由前OpenAI员工创立、号 称更注重安全的公司。 报道中提到Anthropic总部的氛围,是一种无处不在的紧张。员工们不担心程序代码出问题,担心的是 自己亲手打造的AI可能会被用来做可怕的、且具有毁灭性的事情。 其中一位员工透露自己常常因为过度忧虑AI而彻夜难眠,另一位员工则是平静地表示,未来十年内, 失控的AI毁灭人类的概率有20%。 在硅谷,有一个越来越常见的社交问题,你的p(doom)是多少? P(doom)指的是AI导致人类世界末日的概率,这个数字越高,意味着AI带来世界末日的时间 越近。 听起来有点难以相信,但它确实已经成了技术圈最热门的谈资。 Anthropic的CEO Dario Amodei给出的答案是10%到25%之间。 之前担任了五分钟的OpenAI CEO Emmett Shear估计是5%~50%。 深度学习教 ...
xAI 联创大神离职,去寻找下一个马斯克
3 6 Ke· 2025-08-19 00:47
Core Insights - Igor Babuschkin, a key figure at xAI, has left the company to start his own venture capital firm, Babuschkin Ventures, focusing on AI safety research and investing in startups that aim to advance humanity and unlock the mysteries of the universe [1][3][30] - Babuschkin's departure highlights a trend of top AI talent moving from research roles to venture capital, a shift that is relatively rare in the industry, especially at such a young age [3][30][36] Group 1: Igor Babuschkin's Role and Contributions - Igor played a crucial role in the development of xAI, leading the team through multiple iterations of the Grok AI model and overseeing the construction of the Colossus supercomputing cluster in Memphis [1][16] - His background includes significant achievements at DeepMind, where he led projects like AlphaStar and contributed to the development of Codex and GPT-4 during his time at OpenAI [9][11][14] - Babuschkin's departure was marked by a heartfelt farewell message, emphasizing his contributions to xAI and the impact he had on the company's growth [4][6][29] Group 2: Industry Trends and Implications - The AI industry has seen a notable trend of talent moving to venture capital, with many former researchers opting to start their own companies or join existing ones rather than transitioning to investment roles [30][31] - The venture capital landscape in AI is booming, with significant funding opportunities, as evidenced by the over $35 billion raised in Silicon Valley alone last year [36] - Babuschkin's move reflects a broader urgency among AI professionals regarding the development of AGI (Artificial General Intelligence) and the need for responsible investment in AI technologies [30][38]
核心模型被曝蒸馏DeepSeek?前女友一纸控诉,曝出欧版OpenAI塌房真相
3 6 Ke· 2025-08-18 12:12
Core Viewpoint - Mistral AI, once hailed as "Europe's OpenAI," is embroiled in a scandal involving allegations of plagiarism, specifically that its core technology is derived from DeepSeek, misleadingly presented as an original RL achievement [1][3][21]. Group 1: Allegations and Scandal - A former female employee of Mistral revealed in a personal letter that the company distilled DeepSeek's technology and misrepresented it as their own, using OpenAI's data while distorting benchmark results [3][4][21]. - The scandal gained traction online, with notable figures in the AI community, such as DeepMind researcher Susan Zhang, publicly condemning Mistral's unethical practices [4][21]. - The former employee expressed her frustrations about being sidelined and ignored when she raised concerns about the company's practices, leading to her eventual dismissal [6][7]. Group 2: Technical Comparisons - An industry insider, Sam Paech, had previously noted similarities between Mistral's Small 3.2 model and DeepSeek, suggesting that Mistral's outputs closely mirrored those of DeepSeek [9][10]. - Further analysis revealed that Mistral-small-3.2 and DeepSeek-v3 exhibited strikingly similar characteristics, indicating a lack of originality in Mistral's model [12][21]. Group 3: Historical Context and Achievements - Mistral AI was once celebrated for its rapid rise, achieving a valuation of $6.2 billion within just over a year of its establishment, positioning itself as a significant player in the European AI landscape [24][34]. - The company had previously launched successful products, including the Le Chat application, which topped the charts in France, and was supported by French President Macron as a key player in the national AI strategy [26][28][34].
腾讯研究院AI速递 20250818
腾讯研究院· 2025-08-17 16:01
Group 1 - Google has released the lightweight model Gemma 3 270M, which has 270 million parameters and a download size of only 241MB, designed specifically for terminal use [1] - The model is energy-efficient, consuming only 0.75% of battery power after 25 conversations on the Pixel 9 Pro, and can run efficiently on resource-constrained devices after INT4 quantization [1] - Gemma 3 270M outperforms the Qwen 2.5 model in the IFEval benchmark test and has surpassed 200 million downloads, tailored for specific task fine-tuning [1] Group 2 - Meta has open-sourced the DINOv3 visual foundation model, which surpasses weakly supervised models in multiple dense prediction tasks using self-supervised learning [2] - The model features innovative Gram Anchoring strategy and RoPE, with a parameter scale of 7 billion and training data expanded to 1.7 billion images [2] - DINOv3 is commercially licensed and offers various model sizes, including ViT-B and ViT-L, with specialized training for satellite image backbone networks, already applied in environmental monitoring [2] Group 3 - Tencent has launched the Lite version of its 3D world model, reducing memory requirements to below 17GB, allowing efficient operation on consumer-grade graphics cards with a 35% reduction in memory usage [3] - Technical breakthroughs include dynamic FP8 quantization, SageAttention quantization technology, and cache algorithms that enhance inference speed by over 3 times with less than 1% accuracy loss [3] - Users can generate a complete navigable 3D world by inputting a sentence or uploading an image, supporting 360-degree panoramic generation and Mesh file export for seamless integration with games and physics engines [3] Group 4 - Kunlun Wanwei has released six models from August 11 to 15, covering popular fields such as video generation, world models, unified multimodal, agents, and AI music creation [4] - The latest music model Mureka V7.5 significantly enhances the tonal quality and articulation of Chinese songs, improving voice authenticity and emotional depth through optimized ASR technology, surpassing top foreign music models [4] - A MoE-based character description voice synthesis framework, MoE-TTS, was also released, allowing users to precisely control voice features and styles through natural language, outperforming closed-source commercial products under open data conditions [4] Group 5 - OpenAI has released a programming prompt guide for GPT-5, emphasizing the importance of clear and non-conflicting instructions to avoid confusion [5][6] - It suggests using appropriate reasoning intensity and structured rules similar to XML for complex tasks, while planning self-reflection before execution for zero-to-one tasks [6] Group 6 - The first humanoid robot sports event showcased various competitions, including running, soccer, boxing, dance, and martial arts, with the Yushu robot winning the 1500m race [7] - The soccer 5V5 group matches demonstrated real-time computation and collaboration capabilities of robot players, with standout performances from specific players [7] - The event featured commentary focusing on AI knowledge, with humorous moments such as robots colliding and falling over during gameplay [7] Group 7 - DeepMind's Genie 3 model can generate 24 frames of 720p HD visuals per second and create interactive worlds with a single sentence, showcasing advanced memory capabilities [8] - The model's physical law representation improves as training data scale and depth increase, marking a significant step towards AGI [8] - Future developments will focus on realism and interactivity, potentially providing unlimited training scenarios for robots to overcome data limitations [8] Group 8 - OpenAI's CEO hinted at plans to invest trillions in building data centers and suggested that an AI might become the CEO in three years [9] - He confirmed the development of AI devices in collaboration with Jony Ive and acknowledged the increasing value of human-created content [9] - The CEO believes the current "AI bubble" is similar to the internet bubble but emphasizes that AI is a crucial long-term technological revolution [9] Group 9 - OpenAI's chief scientist discussed the evolution of AGI definitions from abstract concepts to multidimensional capabilities, highlighting the need for practical application value assessments [10] - The researchers noted that AI developments have exceeded expectations, with models excelling in competitions, demonstrating strong reasoning and creative thinking [10] - Experts recommend not abandoning programming education but rather viewing AI as a supportive tool, emphasizing the importance of structured and critical thinking [11] Group 10 - Sierra AI's founder predicts the AI market will split into three main tracks: frontier foundational models, AI toolchains, and application-type agents, with the latter presenting the greatest opportunities [12] - Agents can significantly enhance productivity, shifting from "software enhancing human efficiency" to "software completing tasks independently," akin to early computer impacts [12] - The future will see many long-tail agent companies emerging, similar to the evolution of the software market, with pricing based on business outcomes rather than technical details [12]