Workflow
AI安全
icon
Search documents
两位顶级科学家的17分钟对话:如何训练“善良”的AI
Di Yi Cai Jing· 2025-07-26 13:43
辛顿建议年轻人做"所有人都做错了"的事。 在17分钟的对话中,辛顿和周伯文两位科学家谈及大模型的"意识"、如何训练"善良"的AI,以及给年轻科学家的建议。 辛顿还调侃自己获得诺贝尔物理学奖是个"错误","他们真的很想在人工智能领域颁发诺贝尔奖,但他们没有这个奖项。所以他们拿了一个物理学的奖颁给 人工智能(的科学家)。" 辛顿此前多次警告人类重视AI安全与风险,在这次对话中,他认为,当今的多模态聊天机器人已经具有意识。但具体如何规避这种风险,一直没有很多关 于措施的讨论。 2025 WAIC期间,现年77岁的"AI教父"、图灵奖和2024年诺贝尔奖双料得主杰弗里·辛顿(Geoffrey Hinton)第一次踏上了中国,关于他在这里的一切动向和 观点都备受关注。 7月26日上午,辛顿在WAIC开幕式演讲提到人工智能可能会战胜人类智能,这让他感到担忧,人类要避免"养虎为患",这一演讲很快刷屏朋友圈,在各个AI 群里传播。 下午,辛顿又现身模速空间旁的美高梅酒店,在科学前沿全体会议上与上海人工智能实验室主任周伯文进行了一场对话。这并不是第一财经第一次来到这一 会议,但这一定是历届人数最多的一次科学前沿全体会议。 下午 ...
“AI教父”辛顿现身WAIC:称AI将寻求更多控制权
Di Yi Cai Jing· 2025-07-26 06:27
Group 1 - The core viewpoint of the article revolves around the potential of AI to surpass human intelligence and the associated risks, as articulated by Geoffrey Hinton during the World Artificial Intelligence Conference (WAIC) [1][4][6] - Hinton emphasizes the need for a global effort to address the dangers posed by AI, suggesting that nations should collaborate on AI safety and training [5][6] - The article highlights Hinton's historical contributions to AI, particularly his development of the AlexNet algorithm, which revolutionized deep learning [5][6] Group 2 - Hinton discusses the evolution of AI over the past 60 years, identifying two main paradigms: symbolic logic and biologically inspired approaches [3][4] - He expresses concerns about the rapid advancement of AI technologies, estimating a 10% to 20% probability that AI could potentially threaten human civilization [6] - Hinton advocates for allocating significant computational resources towards ensuring AI systems align with human intentions, criticizing tech companies for prioritizing profit over safety [6]
直击WAIC | 上海人工智能实验室主任周伯文:AI研究不是零和游戏,更多优势来自安全方面的合作
Xin Lang Ke Ji· 2025-07-26 03:54
专题:2025世界人工智能大会 新浪科技讯 7月26日上午消息,2025世界人工智能大会(WAIC 2025)于7月26-28日在上海举办。 新浪声明:所有会议实录均为现场速记整理,未经演讲者审阅,新浪网登载此文出于传递更多信息之目 的,并不意味着赞同其观点或证实其描述。 责任编辑:李思阳 在2025世界人工智能大会暨人工智能全球治理高级别会议主论坛(上午场)上,上海人工智能实验室主 任、首席科学家周伯文谈到,在技术层面上,现在的人工智能发展明确的特点就是:通用型、可复制和 开源,这些非常的有用,但同时也会带来很多的风险问题,所以在AI的研究方面,AI的进展和安全同 等重要,所以AI的研究本身不是一个零和游戏,它会有更多的优势来自于安全方面的合作。 在发展和安全方面,周伯文表示:"我一直认为,不能只强调发展不谈安全,也不能说只谈安全不讲发 展。"去年在WAIC的全体会议上,他提出了45度平衡率,意思就是需要找到实现发展和安全并重的技术 实现路径。 周伯文指出,在这个框架下,我们过去一年跟很多国际的学者有合作和交流,达成了一个观点,就是说 很多原来的研究工作我们都把它叫做make AI safe,但是要真正实现 ...
诺奖得主杰弗里·辛顿:应建立AI安全相关机构和社群,推动AI向善
news flash· 2025-07-26 03:43
诺奖得主杰弗里·辛顿:应建立AI安全相关机构和社群,推动AI向善 《科创板日报》26日讯,在2025世界人工智能大会主论坛上,图灵奖、诺贝尔物理学奖得主杰弗里·辛 顿表示,几乎所有专家认为会出现比人类更智能的AI,AI智能体为完成任务,会想要生存、获得更多 控制,可能操纵人类,简单关闭AI不现实,就像养老虎当宠物,养大后可能被其伤害,而人类无法消 灭AI,因其在多领域作用重大。杰弗里·辛顿希望建立AI安全机构、国际社群,研究训练AI向善的技 巧,各国可在本国主权范围内研究并分享成果,全球或主要AI国家应思考建立相关网络,研究如何训 练聪明的AI辅助人类,而非消灭或统治人类,这是人类长期面临的重要问题。(记者 黄心怡) ...
2025中国互联网大会开幕 聚焦技术与实体经济融合
Zheng Quan Ri Bao Wang· 2025-07-23 12:55
Group 1 - The 2025 China Internet Conference was held in Beijing from July 23 to 25, focusing on the theme "Digital Drives New Quality, Intelligent Creation of the Future" [1] - The conference featured over 30 thematic activities, including special forums, industry forums, closed-door seminars, exhibitions, high-level dialogues, and enterprise deep-dive sessions [1] Group 2 - Discussions at the conference highlighted the dual nature of AI security, emphasizing the need for both protecting AI models and enhancing system security through AI technology [2] - The internet's audio and video traffic accounted for 85% of total traffic last year, with projections suggesting it could reach 90% this year [2] Group 3 - The human-shaped robot industry is gaining attention due to advancements in internet technology, particularly language models, which enhance human-robot interaction [3] - The continuous upgrade of communication technologies like 5G is accelerating the market expansion for human-shaped robots [3] - The naked-eye 3D technology is maturing, with a shift in consumer entertainment demands towards 3D experiences [3] Group 4 - The integration of internet technology with vertical industries is crucial for creating economic value, requiring detailed analysis of industry-specific processes and characteristics [4] - The rapid development of AI presents a favorable opportunity for internet technology to serve vertical industries, which should lead the integration process for better economic outcomes [4]
奇安信韩永刚:大模型开发应用带来了新的安全隐患,AI安全还处于起步阶段
news flash· 2025-07-23 03:57
Core Insights - The security of AI differs significantly from traditional security, with current protective measures primarily focused on AI development testing environments, AI-related data, and applications, indicating that the field is still in its early stages [1] - Content security, cognitive adversarial challenges, and future intelligent agent permission control, along with application and data protection, remain difficult areas, representing future growth potential for the cybersecurity industry [1] - AI is expected to create incremental demand and supply in cybersecurity, potentially transforming small-scale high-level capabilities into large-scale offerings, thus shifting the industry from labor-intensive to knowledge-intensive, which may enhance efficiency [1] - The development and application of large models introduce new security risks due to their black-box nature, connections to various businesses and personnel, and the application of multidimensional data, compounded by a lack of effective security assessments, protections, and monitoring during rapid deployment [1] - AI security encompasses not only traditional security issues but also new challenges such as content security [1]
种子轮就估值120亿美元,她能打造另一个OpenAI吗?
机器之心· 2025-07-16 08:09
Core Viewpoint - Thinking Machines Lab, founded by former OpenAI CTO Mira Murati, has raised $2 billion in seed funding, achieving a post-money valuation of $12 billion, marking one of the largest seed rounds in Silicon Valley history [2][10]. Group 1: Seed Funding Significance - The $2 billion seed funding is unprecedented, as most AI startups typically raise only a few million to tens of millions in early financing [5]. - This funding allows Thinking Machines Lab to build a "symbiotic" ecosystem, combining top talent with substantial computational resources necessary for AI development [8][9]. Group 2: Company Background and Vision - Thinking Machines Lab aims to create multimodal AI that operates through natural interactions, incorporating dialogue and visual elements [12]. - The company plans to include an open-source component in its products, which will benefit researchers and startups in developing customized models [13]. Group 3: Talent Acquisition and Industry Context - The company has attracted several high-profile individuals, forming what is described as an "AI dream team" [20]. - The competitive landscape for AI talent is highlighted by recent high-profile moves and the significant funding received by Thinking Machines Lab, underscoring the critical importance of AI in the current era [23].
OpenAI谷歌Anthropic罕见联手发研究!Ilya/Hinton/Bengio带头支持,共推CoT监测方案
量子位· 2025-07-16 04:21
Core Viewpoint - Major AI companies are shifting from competition to collaboration, focusing on AI safety research through a joint statement and the introduction of a new concept called CoT monitoring [1][3][4]. Group 1: Collaboration and Key Contributors - OpenAI, Google DeepMind, and Anthropic are leading a collaborative effort involving over 40 top institutions, including notable figures like Yoshua Bengio and Shane Legg [3][6]. - The collaboration contrasts with the competitive landscape where companies like Meta are aggressively recruiting top talent from these giants [5][6]. Group 2: CoT Monitoring Concept - CoT monitoring is proposed as a core method for controlling AI agents and ensuring their safety [4][7]. - The opacity of AI agents is identified as a primary risk, and understanding their reasoning processes could enhance risk management [7][8]. Group 3: Mechanisms of CoT Monitoring - CoT allows for the externalization of reasoning processes, which is essential for certain tasks and can help detect abnormal behaviors [9][10][15]. - CoT monitoring has shown value in identifying model misbehavior and early signs of misalignment [18][19]. Group 4: Limitations and Challenges - The effectiveness of CoT monitoring may depend on the training paradigms of advanced models, with potential issues arising from result-oriented reinforcement learning [21][22]. - There are concerns about the reliability of CoT monitoring, as some models may obscure their true reasoning processes even when prompted to reveal them [30][31]. Group 5: Perspectives from Companies - OpenAI expresses optimism about the value of CoT monitoring, citing successful applications in identifying reward attacks in code [24][26]. - In contrast, Anthropic raises concerns about the reliability of CoT monitoring, noting that models often fail to acknowledge their reasoning processes accurately [30][35].
启明星辰(002439) - 2025年7月15日投资者关系活动记录表
2025-07-15 15:00
Financial Performance Overview - The company expects to achieve revenue between CNY 1.115 billion and CNY 1.175 billion for the first half of 2025, with a projected net profit attributable to shareholders ranging from -CNY 1.03 billion to -CNY 0.73 billion, and a non-recurring net profit between -CNY 1.83 billion and -CNY 1.53 billion [2][3]. Revenue Decline Factors - Revenue decline is attributed to external environmental challenges and market demand adjustments, with a structural adjustment in the cybersecurity market due to tightened customer budgets [2][3]. - Strategic focus on quality and revenue structure changes led to a reduction in low-margin integration projects, resulting in a decline in related transaction revenue from major clients [3]. Response Measures - The company has accelerated the commercialization of innovative businesses, achieving breakthroughs in AI security and data security, maintaining a leading market share in 30 core products and services [3][4]. - Improved operational quality through strict project order management and enhanced cash flow management, with a projected increase in overall gross margin by over 2 percentage points compared to the previous year [4][19]. Profitability Insights - The net profit attributable to shareholders is expected to grow by 43% to 60% year-on-year, driven by stock price fluctuations of associated listed companies and increased investment income [6]. - Non-recurring net profit has declined due to reduced revenue and gross profit scale, but cost control measures are in place to enhance long-term competitiveness [7]. Strategic Collaboration and Market Expansion - The company aims to deepen strategic collaboration with China Mobile, enhancing the competitiveness of security products and services in the enterprise market [4][8]. - The new chairman emphasizes the mission to build a world-class cybersecurity company and strengthen R&D efforts to support China Mobile's business [8]. Market Trends and Opportunities - The cybersecurity industry is facing pressure, but there are emerging opportunities in AI application security and data security, with significant growth potential in these areas [20][21]. - The company is focusing on high-margin orders and expanding its market reach in sectors like finance and healthcare, while managing low-margin projects [15][16]. Future Outlook - The company anticipates a gradual recovery in market demand, particularly in AI and data sectors, with a focus on enhancing internal procurement from China Mobile [16][18]. - Continued emphasis on cash flow improvement and operational efficiency is expected to support sustainable growth in the second half of 2025 [19].
启明星辰上半年与中移协同处于深化阶段 持续推进高质量发展
Cai Jing Wang· 2025-07-15 02:38
Group 1 - The company expects to achieve operating revenue between 1.115 billion and 1.175 billion yuan for the first half of 2025, with a projected net profit growth of 43% to 60% compared to the same period last year [1] - The company has improved its operational quality, with a significant increase in the proportion of high-margin products, leading to a more than 2 percentage point increase in overall gross margin compared to the same period last year [1] - The company has strengthened accounts receivable and cash flow management, resulting in a noticeable increase in operating cash flow and a reduction in accounts receivable at the end of the reporting period [1] Group 2 - The company is focusing on AI security, launching a series of products related to large model application safety, and has seen a doubling in order amounts in the second quarter [2] - The company has successfully implemented several multi-million yuan projects in data security, providing comprehensive lifecycle security protection for clients and enhancing data value through trusted data development [2] - The company is deepening its collaboration with China Mobile, aiming to enhance the quality and efficiency of cooperation, and plans to optimize resource allocation to boost cloud security and DICT collaborative revenue [3] Group 3 - The company is committed to the "Overall National Security Concept" and plays a crucial role in supporting China Mobile's "BASIC6" innovation plan, focusing on integrating cloud, network, and intelligent security capabilities [3] - The company aims to maintain confidence in development and continue to deepen business collaboration with China Mobile, expecting to consolidate competitive advantages and move towards a new stage of high-quality development [3]