持续学习
Search documents
外滩大会再证蚂蚁的底色:金融科技公司
Mei Ri Shang Bao· 2025-09-11 23:04
Group 1: Conference Overview - The 2025 Inclusion·Bund Conference opened in Shanghai with the theme "Reshaping Innovative Growth," featuring 550 guests from 16 countries and regions, including notable figures like Richard Sutton and Yuval Noah Harari [1] - The conference focused on five main topics: "Financial Technology," "Artificial Intelligence and Industry," "Innovation and Investment Ecology," "Global Dialogue and Cooperation," and "Responsible Innovation and Inclusive Future," comprising one main forum and 44 insight forums [1] - The event is recognized as one of Asia's three major financial technology conferences, attracting global attention for its openness, diversity, and forward-looking nature [1] Group 2: Insights from Richard Sutton - Richard Sutton, the 2024 Turing Award winner, emphasized that artificial intelligence is entering an "experience era," where the potential for AI exceeds previous capabilities [2] - He noted that current machine learning methods are reaching the limits of human data, and there is a need for new data sources generated through direct interaction between intelligent agents and the world [2] - Sutton defined "experience" as the interaction of observation, action, and reward, which is essential for learning and intelligence [2][3] Group 3: Insights from Wang Xingxing - Wang Xingxing, CEO of Yushutech, expressed regret for not pursuing AI earlier, highlighting the rapid development of large models that now allow for the integration of AI with robotics [4] - He discussed the emergence of a new embodied intelligence industry, where robots can possess AGI capabilities, enabling them to perceive, plan, and act autonomously [4] - Wang is optimistic about the future of innovation and entrepreneurship, stating that the barriers to entry have significantly lowered, creating a favorable environment for young innovators [4] Group 4: Ant Group's Technological Advancements - Ant Group is recognized as a leading technology financial company, with significant investments in AI and various sectors [5][6] - The conference showcased Ant Group's new AI assistant "Xiao Zheng," which integrates multiple large models to streamline government services [6] - Ant Group's CTO announced the launch of the "Agentic Contract," which will be natively deployed on their new Layer2 blockchain, Jovay [6]
对AI的恐惧被夸大了,“强化学习之父”萨顿外滩演讲:四条原则预言AI未来
3 6 Ke· 2025-09-11 08:34
Group 1 - The core idea presented is that the human data dividend is nearing its limit, and artificial intelligence (AI) is entering an "experience era" centered on continuous learning, which has the potential to exceed previous capabilities [1][9][44] - AI's current training methods are primarily focused on transferring existing human knowledge to static models without autonomous learning capabilities, leading to a recognition of the limitations of this approach [10][14] - The future of AI relies on the development of two currently immature technologies: continual learning and meta-learning, which are essential for unlocking the full potential of experience-based learning [16][14] Group 2 - AI has become a highly politicized issue, with public fears about bias, unemployment, and even human extinction being exaggerated and fueled by certain organizations and individuals [16][18][25] - The call for regulation and control of AI reflects a broader societal tendency to fear the unknown, which can hinder collaborative efforts necessary for progress [24][28] - The concept of decentralized collaboration is emphasized as a superior alternative to centralized control, allowing for coexistence among diverse intelligent agents with different goals [20][26][21] Group 3 - Four principles are proposed to predict the future of AI: the absence of a unified global opinion on how the world should operate, the eventual understanding and creation of intelligence by humans, the inevitable surpassing of current human intelligence by superintelligent entities, and the flow of power and resources towards the most intelligent agents [35][36][37] - The inevitability of AI's replacement of human roles is acknowledged, framing it as a natural progression in the evolution of intelligence [38][44] - The role of humans as catalysts and pioneers in the "design era" is highlighted, emphasizing the unique ability to push design to its limits through AI [42][43]
图灵奖得主理查德·萨顿:人类将开启“宇宙第四大时代”
2 1 Shi Ji Jing Ji Bao Dao· 2025-09-11 05:45
Core Insights - Richard Sutton, the 2024 Turing Award winner, emphasizes the inevitability of AI replacing human roles in the development process of humanity [1][2] - Sutton introduces four realistic "predictive principles" regarding the future of AI, highlighting the need for decentralized collaboration and the importance of experience in learning [2][3] Group 1: AI and Learning - Sutton argues that current machine learning primarily focuses on transferring existing human knowledge to static AI, which lacks autonomous learning capabilities [1][2] - He identifies the need for a new data source generated through direct interaction between intelligent agents and the world, marking the transition into an "experience era" [1][2] - The core of intelligence lies in the ability to predict and control input signals based on experience, which is essential for the development of AI [2] Group 2: Future of AI - Sutton's four predictive principles include the lack of consensus on how the world operates, the potential for humans to understand and create intelligence through technology, the likelihood of superintelligent AI surpassing human intelligence, and the concentration of power and resources among the most intelligent agents [2][3] - He posits that humanity is currently in the "replicator era" and is on the verge of entering the "design era," where AI will play a crucial role [3][4] - Sutton encourages embracing AI as a necessary step in the evolution of the universe, advocating for courage and a spirit of adventure in facing its challenges [4]
图灵奖得主理查德·萨顿:人工智能进入“经验时代”,潜力超以往
Bei Ke Cai Jing· 2025-09-11 04:47
Core Insights - Richard Sutton, the 2024 Turing Award winner, emphasized that the human data dividend is nearing its limit, and artificial intelligence is entering an "experience era" centered on continuous learning, which has the potential to exceed previous capabilities [1][2] Group 1: AI and Learning - Sutton stated that most current machine learning aims to transfer existing human knowledge to static AI, which lacks autonomous learning capabilities. He believes we are reaching the limits of human data, and existing methods cannot generate new knowledge, making continuous learning essential for intelligence [2] - He defined "experience" as the interaction of observation, action, and reward, which is crucial for an intelligent agent's ability to predict and control its input signals. Experience is the core of all intelligence [2] Group 2: Collaboration and Future Predictions - Addressing fears about AI causing bias, unemployment, or even human extinction, Sutton argued that such fears are exaggerated and often fueled by those who profit from them. He highlighted that economic systems function best when individuals have different goals and abilities, similar to how decentralized collaboration among intelligent agents can lead to win-win outcomes [3] - Sutton proposed four predictive principles for the future of AI: 1. There is no consensus on how the world should operate, and no single view can dominate [3] 2. Humanity will truly understand intelligence and create it through technology [3] 3. Current human intelligence will soon be surpassed by superintelligent AI or enhanced humans [3] 4. Power and resources will flow to the most intelligent agents [3] Group 3: Historical Context and Future Outlook - Sutton categorized the history of the universe into four eras: the particle era, the star era, the replicator era, and the design era. He believes humanity's uniqueness lies in pushing design to its limits, which is the goal pursued through AI today [4] - He described AI as the inevitable next step in the evolution of the universe, urging society to embrace it with courage, pride, and a spirit of adventure [4] Group 4: Event Overview - The 2025 Inclusion Bund Conference, themed "Reshaping Innovative Growth," took place in Shanghai from September 10 to 13, featuring a main forum, over 40 open insight forums, global theme days, innovation stages, a technology exhibition, and various networking opportunities [4]
图灵奖得主理查德·萨顿2025外滩大会演讲:经验是一切智能的核心与基础
Yang Guang Wang· 2025-09-11 04:06
Core Insights - The 2025 Inclusion Bund Conference opened in Shanghai, featuring a keynote speech by Richard Sutton, the 2024 Turing Award winner and a pioneer in reinforcement learning [1] Group 1: Machine Learning and AI - Sutton emphasized that current machine learning primarily focuses on transferring existing human knowledge to static, non-autonomous AI, reaching the limits of human data [2] - He introduced the concept of the "experience era," advocating for new data sources generated through direct interaction between intelligent agents and the world [2] - Sutton defined "experience" as the interplay of observation, action, and reward, asserting that knowledge is derived from experience, which is fundamental to intelligence [2] Group 2: Future of AI - Sutton proposed four predictive principles regarding the future of AI: 1. There is no consensus on how the world operates, and no single perspective can dominate [3] 2. Humanity will truly understand intelligence and create it through technology [3] 3. Current human intelligence will soon be surpassed by superintelligent AI or enhanced humans [3] 4. Power and resources will gravitate towards the most intelligent agents [3] - He categorized the history of the universe into four eras: particle, star, replicator, and design, asserting that humanity's unique ability to push design to its limits is crucial in the pursuit of AI [3] Group 3: Embracing AI - Sutton stated that artificial intelligence is the inevitable next step in the evolution of the universe, and it should be embraced with courage, pride, and a spirit of adventure [4]
AI跨步进入“经验时代”
Hua Er Jie Jian Wen· 2025-09-11 03:50
Group 1 - The AI industry is transitioning into an "experience era," where continuous learning is essential for intelligence, moving beyond the limitations of human data [2] - Richard Sutton emphasizes that knowledge is derived from experience, which involves observation, action, and reward, and that the intelligence of an agent depends on its ability to predict and control input signals [2] - Two technologies, continual learning and meta-learning, are necessary to unlock the full potential of AI in this new experience era [2] Group 2 - Concerns about AI leading to bias, unemployment, or even human extinction are exaggerated and fueled by certain organizations and individuals profiting from such fears [3] - Sutton argues that decentralized collaboration among agents with different goals can lead to mutual benefits, highlighting human cooperation as a unique strength [3] - He presents four predictive principles regarding the future of AI, including the lack of consensus on how the world should operate and the potential for superintelligent AI to surpass human intelligence [3] Group 3 - Sutton categorizes the history of the universe into four eras: particle, star, replicator, and design, asserting that humanity's unique ability to push design to its limits is crucial in the current pursuit of AI [4] - He believes that AI is an inevitable next step in the evolution of the universe, advocating for a courageous and adventurous approach to its development [5]
“强化学习之父” 理查德·萨顿:人类数据红利逼近极限,AI正进入以持续学习为核心的“经验时代”
Zheng Quan Shi Bao· 2025-09-11 03:50
Core Insights - Richard Sutton, the 2024 Turing Award winner, emphasizes that the human data dividend is nearing its limit, and artificial intelligence is entering an "experience era" centered on continuous learning, which has the potential to exceed previous capabilities [1][2] Group 1: Experience Era - Sutton defines "experience" as the signals of observation, action, and reward that are exchanged between agents and the world, asserting that knowledge derives from experience and that the intelligence of an agent depends on its ability to predict and control its input signals [2] - The transition to the experience era is driven by reinforcement learning, but to fully unlock its potential, two currently immature technologies—continual learning and meta-learning—are required [2] Group 2: Collaboration and AI - Addressing concerns about AI leading to bias, unemployment, or even human extinction, Sutton argues that fears surrounding artificial intelligence are exaggerated, and that decentralized collaboration among different agents can lead to mutually beneficial outcomes [2] - He highlights that humanity's greatest strength lies in collaboration, which has been the foundation of economic, market, and governmental successes [2] Group 3: Future of AI - Sutton posits that the replacement of human roles by AI is inevitable, with humans acting as catalysts and pioneers for the "design era," which he categorizes as the fourth era in the evolution of the universe, following the particle, star, and replicator eras [2][3] - He encourages embracing the evolution of artificial intelligence with courage, pride, and a spirit of adventure [3]
强化学习之父” 理查德·萨顿:人类数据红利逼近极限,AI正进入以持续学习为核心的“经验时代
Zheng Quan Shi Bao Wang· 2025-09-11 03:26
Core Insights - Richard Sutton, the 2024 Turing Award winner, emphasizes that the human data dividend is nearing its limits, and artificial intelligence is entering an "experience era" centered on continuous learning, which has the potential to exceed previous capabilities [1][2] Group 1: Experience Era - Sutton defines "experience" as the interaction of observation, action, and reward, which are signals exchanged between agents and the world [2] - The current machine learning methods are reaching their limits in generating new knowledge, making them unsuitable for continuous learning, which is crucial for intelligence [1][2] Group 2: Technological Advancements - To fully unlock the potential of AI in the experience era, two currently immature technologies are needed: continual learning and meta-learning [2] - Sutton believes that the collaboration between decentralized agents can lead to win-win outcomes, countering fears about AI causing bias, unemployment, or even human extinction [2] Group 3: Human-AI Collaboration - Sutton argues that human collaboration is the greatest success, and AI's role will be to enhance this collaboration, which is fundamental to economic, market, and governmental successes [2] - He posits that AI's replacement of human roles is inevitable, with humans acting as catalysts in ushering in a new "design era" in the evolution of the universe [2] Group 4: Future Perspective - Sutton views artificial intelligence as a necessary next step in the evolution of the universe, advocating for a courageous and adventurous approach to its development [3]
Anthropic CEO 万字访谈:亲述丧父之痛、炮轰黄仁勋、揭秘指数定律与 AI 未来!
AI科技大本营· 2025-08-01 09:27
Core Viewpoint - Dario Amodei, CEO of Anthropic, is a pivotal figure in AI development, advocating for responsible AI while simultaneously pushing technological advancements. His dual role as a developer and a cautionary voice highlights the urgent need for safety in AI as its capabilities rapidly evolve [2][5][12]. Group 1: AI Development and Risks - Amodei emphasizes the exponential growth of AI capabilities, comparing current models to intelligent university students, and warns that the implications of AI on national security and the economy are becoming increasingly urgent [10][12]. - He believes that the real competition lies in fostering a responsible culture that attracts top talent, rather than merely focusing on model performance [5][12]. - Amodei expresses frustration at being labeled a "doomsayer," arguing that his warnings stem from a deep understanding of the technology's potential and risks, particularly influenced by personal experiences with healthcare [5][41]. Group 2: Exponential Growth and Market Dynamics - The company has experienced significant revenue growth, with projections indicating a potential increase to hundreds of billions if the current exponential growth trend continues [18][32]. - Amodei argues against the notion of diminishing returns in AI scaling, citing rapid advancements in code capabilities and market adoption as evidence of ongoing progress [19][21]. - He highlights the importance of capital efficiency, suggesting that Anthropic can achieve more with less funding compared to larger tech companies, thus making it an attractive investment opportunity [31][32]. Group 3: Company Culture and Talent Acquisition - Anthropic has successfully maintained a strong company culture, with employees showing loyalty despite competitive offers from larger firms, indicating a commitment to the company's mission [28][29]. - The company has raised nearly $20 billion, positioning itself competitively in the AI landscape, and is building data centers to match the scale of its competitors [27][30]. - Amodei stresses that the culture of a company is crucial for attracting top talent, suggesting that mission alignment is more valuable than financial incentives alone [29][37]. Group 4: Business Focus and Applications - Anthropic is focusing on enterprise-level AI applications, believing that the potential for business applications is at least equal to, if not greater than, consumer applications [33][34]. - The company aims to improve its models continuously, particularly in coding, which has shown rapid market adoption and significant utility for professionals [36][34]. - Amodei argues that enhancing model capabilities can lead to substantial value creation in various sectors, including healthcare and finance, thus driving business growth [34][35].
具身领域LLM结合强化学习与世界模型工作汇总
具身智能之心· 2025-07-29 06:15
Core Viewpoint - The article discusses recent advancements in the field of embodied intelligence, particularly focusing on the integration of large language models (LLMs) with reinforcement learning and world models, highlighting several notable research papers from 2024 [2][3]. Group 1: UniSim - UniSim aims to learn general real-world interactive simulators through generative modeling, revealing that natural datasets can provide diverse advantages for learning simulators [3]. - The research demonstrates that integrating various datasets allows for the simulation of high-level commands and low-level controls, enabling zero-shot application in real-world scenarios [3]. Group 2: Robust Agents - The study from Google DeepMind asserts that causal reasoning is essential for robust and general AI, concluding that agents capable of satisfying regret bounds must learn approximate causal models [5]. - This finding has significant implications for transfer learning and causal inference [5]. Group 3: MAMBA - MAMBA introduces an efficient world model approach for meta-reinforcement learning, addressing sample efficiency issues prevalent in current methods [8]. - The framework shows a remarkable improvement in sample efficiency, achieving up to 15 times better performance in high-dimensional tasks [8]. Group 4: EMMA - EMMA leverages LLMs trained in text-based worlds to guide the training of visual world agents, enhancing their ability to interact with dynamic environments [10]. - The approach results in a significant success rate improvement of 20%-70% in diverse tasks compared to existing VLM agents [10]. Group 5: Text2Reward - The Text2Reward framework automates the generation of dense reward functions using LLMs, addressing the challenges of reward function design in reinforcement learning [13][14]. - The method demonstrates superior performance in 13 out of 17 tasks, achieving over 94% success in new motion behaviors [14]. Group 6: Online Continual Learning - The research proposes two frameworks for continuous learning in interactive instruction-following agents, emphasizing the need for agents to learn incrementally as they explore their environments [17][18]. - A confidence-aware moving average mechanism is introduced to update parameters without relying on task boundary information [18]. Group 7: AMAGO - AMAGO is a scalable contextual reinforcement learning framework that addresses challenges in generalization, long-term memory, and meta-learning [21]. - The framework allows for parallel training of long-sequence transformers, enhancing scalability and performance in complex tasks [21]. Group 8: PDDL-based Planning - The study presents a novel paradigm for task planning using pre-trained LLMs, focusing on building explicit world models through PDDL [22][23]. - The framework significantly reduces the need for human intervention by allowing LLMs to convert between PDDL and natural language, facilitating efficient model correction [23].