Workflow
智能体
icon
Search documents
ACL 2025 | 让小说角色 「活」起来!复旦BookWorld打造沉浸式小说世界模拟系统
机器之心· 2025-06-24 06:46
BookWorld由复旦大学冉一婷、王鑫涛主导完成,由阳德青老师、肖仰华老师共同指导。复旦大学知识工场实验室长期关注大语言模型的人格化、角色扮演 研究,在该领域发表多篇顶会论文和首篇综述。 想象为《红楼梦》或《权力的游戏》创造一个AI的世界。书中的角色们变成AI,活在BookWorld当中。每天,他/她们醒来,思考,彼此对话、互动,建立 感情和关系。 如果他们能活出自己的生活,不再由笔者操控,故事是否会不一样?会不会有一个平行时空里,宝玉和黛玉有了一段美好的爱情? 今天要介绍的这篇 ACL 2025 论文 ——《BookWorld: From Novels to Interactive Agent Societies for Creative Story Generation》,聚焦于如何让 小说中的角色真正 "活" 起来,打造一个沉浸式的虚拟世界。 在BookWorld中,作者们提出了一个"小说->AI世界->故事创作"的系统。BookWorld能从小说中提取角色和世界观的数据,构建一个AI世界,让角色AI在 世界中进行长期的交互,自己创造自己的故事。为了实现流畅自然的长期交互,BookWorld建模了角色 ...
超薄机身搭载满血骁龙8至尊版芯片,荣耀Magic V5破局折叠屏“不可能”
Huan Qiu Wang· 2025-06-24 06:16
Core Viewpoint - The upcoming Honor Magic V5 aims to balance thin design and flagship performance, positioning itself as a groundbreaking product in the foldable smartphone market [1][10]. Group 1: Product Specifications - The Honor Magic V5 is claimed to be the world's thinnest foldable flagship smartphone, measuring only 8.8mm in thickness and weighing 217g, setting a new record in the foldable screen category [1]. - It features the Snapdragon 8 Gen 2 chip, ensuring no performance compromise despite its lightweight design, marking it as the only foldable device with this high-end chip [1][3]. Group 2: Battery and Imaging Technology - The device incorporates a 6100mAh battery, utilizing advanced materials and AI manufacturing to enhance capacity while maintaining a slim profile, alleviating battery anxiety for users [5]. - It also includes a flagship-level periscope telephoto lens, addressing previous limitations in imaging capabilities of foldable devices, thus improving overall photographic performance [5]. Group 3: AI Innovations - The Honor Magic V5 introduces significant AI advancements, including a full-stack personal knowledge base, multi-agent collaboration, and cross-device connectivity, transforming AI from a passive tool to an active service partner for users [10]. - The focus on lightweight design, hardware configuration, and software adaptation aligns with consumer preferences, with "lightweight" being the top priority among users when selecting foldable smartphones [10]. Group 4: Industry Impact - The Honor Magic V5 represents a response to industry trends, demonstrating that lightweight and powerful features can coexist, challenging the traditional trade-off in the foldable smartphone market [10]. - As AI evolves from a functional tool to a user companion, companies like Honor are paving the way for new possibilities in the global smart terminal industry [10].
穿越微笑曲线,影音图文等数字内容体验更智能、更便捷!
Sou Hu Cai Jing· 2025-06-23 15:52
(中国,东莞2025年6月20日)2025年华为开发者大会(HDC)正式开幕。大会宣布,HarmonyOS 6开发者Beta正式启动,并发布全新的互联架构、鸿蒙智 能体框架HMAF等,激发更多应用创新,为鸿蒙应用体验带来了更多可能。 华为通过更自然的交互方式、更强大的应用智能化自主能力、更高效的连续服 务闭环、更协同的多智能体合作,打造无处不在的智能体验。 作为鸿蒙系统级AI Agent,小艺和华为视频、华为音乐、华为阅读、华为浏览器和华为天气等鸿蒙应用相结合,更精准地捕捉用户意图,深度理解用户偏 好,为用户提供更加符合实际场景和需求的智能个性化影音内容资讯等数字内容体验创新。在日常生活中,从影音娱乐到读书看资讯,再到出行时看天气, 鸿蒙智能体都带来了大有不同的内容体验。 用户可以对小艺说某句影视台词或情节描述,如"我想看李白骑着白鹤飞上天",或者让小艺推荐近期火热的《长安的荔枝》相关的歌单或唐朝探案类型小说 书单,或者向小艺提问"高考填报志愿"的相关内容、从深圳出差到哈尔滨的穿衣指南等,华为视频AI影视全能搜能快速精准搜索全网影视内容,并弹出卡片 让用户直接打开华为视频观看;华为音乐和华为阅读能够基于用户个 ...
国泰海通|策略:数字货币:打开跨境结算与融资新路径
报告导读: 主题交易热度维持平稳,结构切换加速,稳定币与 PCB 主题大涨,新消费 / 创新药和稀土永磁主题回调,看好科技类主题再次布局机会。推荐:数字货币 /AI 智能体 / 内需消费 / 并购重组。 主题温度计:热点主题切换加速,稳定币与 PCB 主题领涨。 6 月 16 日 -20 日热点主题日均成交额平 均 4.32 亿元,日均换手率 3.35% , 6 月以来主题交易热度整体维持平稳。热点主题结构上切换加速, 新消费 / 创新药 / 稀土相关主题回调,中东局势动荡下油气开发相关主题持续活跃,而稳定币主题在密 集催化下领涨市场,人工智能主线中 PCB/ 光模块等细分板块率先走强,逐步提振科技类主题热度。股市 预期和微观流动性均处于上升趋势,但需关注主题叙事环境的阶段性变化,结构切换中看好科技类主题的 再次布局机会。 主题一:数字货币。 2025 陆家嘴论坛提出设立数字人民币国际运营中心;跨境支付通的上线有望便利陆 港经贸往来推动人民币国际化。中国香港、美国通过稳定币相关法律规范有望加速数字货币产业发展,稳 定币打通加密资产与传统金融体系的联系,适用于跨境支付场景,并将成为大型企业跨境结算和资金调配 过 ...
纳米 AI 梁志辉:超级搜索智能体是 AI 时代的真正入口
Founder Park· 2025-06-23 12:00
AI 正在重构搜索本身的体验。 从早期的 AI Summary,到现在各家产品产品推出的 Deep Research,搜索的广度和深度,一直在被拓展。 Agent 时代,又从智能体的角度对搜索进行了能力的升维。 纳米 AI 推出的「超级搜索智能体」就是基于这样的思路而诞生的一款产品,基于搜索但不止于搜索。 结合了今天的多 Agent 协作、多模型协作、MCP、AI 浏览器等能力,纳米 AI 超级搜索智能体是在当前的模型技术架构下,国内公司对于 AI 搜索和 AI 智能体的一个新的解决方案的尝试。 在 AGI Playground 2025 上,360 集团副总裁、纳米 AI 负责人梁志辉详细分享了 360 对于 AI 时代的搜索、智能体搭建以及 AI 浏览器的看法,和 360 在这 些产品上的探索与经验。 以下内容基于演讲内容,由 Founder Park 整理。 不仅仅是简单搜索,复杂问题乃至千字问题、研究任务、复杂的商品购买需求、投资任务,甚至直接解决用户的内容创作难题,比如快速生成几分钟的视 频或者一份需要花很长时间才能完成的 PPT。 搜索意味着用户有问题要解决,Agent 智能体,可能是在今天解决搜 ...
2025年AI在多个方面持续取得显著进展和突破
Sou Hu Cai Jing· 2025-06-23 07:19
Group 1 - In 2025, multimodal AI is a key trend, capable of processing and integrating various forms of input such as text, images, audio, and video, exemplified by OpenAI's GPT-4 and Google's Gemini model [1] - AI agents are evolving from simple chatbots to more intelligent assistants with contextual awareness, transforming customer service and user interaction across platforms [3] - The rapid development and adoption of small language models (SLMs) in 2025 offer significant advantages over large language models (LLMs), including lower development costs and improved user experience [3] Group 2 - AI for Science (AI4S) is becoming a crucial force in transforming scientific research paradigms, with multimodal large models aiding in the analysis of complex multidimensional data [4] - The rapid advancement of AI brings new risks related to security, governance, copyright, and ethics, prompting global efforts to strengthen AI governance through policy and technical standards [4] - 2025 is anticipated to be the "year of embodied intelligence," with significant developments in the industry and technology, including the potential mass production of humanoid robots like Tesla's Optimus [4]
AI月报:当AI包办一切,未来不是拼效率,而是拼“品味”
3 6 Ke· 2025-06-23 03:47
Industry Overview - The AI industry is transitioning from a phase of model competition to productization and ecosystem integration, focusing on user entry points, agent standards, and terminal capabilities [1][2] - The key terms in AI have shifted from "larger models" and "faster inference" to "agents," "autonomous execution," and "delegated programming" [2] Model Development - New generation foundational models like GPT-4.5 and Gemini 2.5 Pro represent a significant shift in AI's cognitive capabilities, moving from passive responders to models that engage in self-reflection and multi-step reasoning [4][5] - These advanced models can now decompose complex questions, reason through multiple paths, and select optimal solutions, resembling human-like thought processes [4][5] AI Agents - AI agents are evolving from simple tools to autonomous entities capable of executing complex tasks, marking a new stage in AI applications [7][8] - They can perceive their environment, autonomously plan, utilize tools, connect data, and complete multi-step tasks, fundamentally changing human-software interaction [10][12] AI Programming - The programming landscape is shifting from AI as an assistant to AI taking on full task delegation, significantly enhancing developer productivity [14][16] - AI agents can now accept natural language programming tasks, generate code, conduct testing, and manage deployment processes, allowing developers to focus on higher-level design and strategy [15][17] Business Model Evolution - The industry consensus is moving from "Model as a Service" (MaaS) to "Results as a Service" (RaaS), emphasizing the delivery of measurable outcomes rather than just tools [20][21] - This shift requires AI companies to focus on quantifiable business metrics such as GMV growth and customer satisfaction, transforming AI from a cost center into a profit engine [21][22] Workforce Impact - As AI capabilities expand, the unique human skills of taste, judgment, and direction become increasingly valuable, positioning humans as collaborators rather than competitors to AI [24][25] - Future roles will emphasize strategic thinking and problem definition over technical execution, with engineers and product managers acting more as architects and visionaries [26][27]
6月23日|财经简报 充电宝安全危机 伊朗宣布关闭霍尔木兹海峡
Sou Hu Cai Jing· 2025-06-23 03:36
Market Dynamics and Sentiment - A-shares experienced a downward adjustment, with the ChiNext Index leading the decline by 0.84%, the Shenzhen Component Index down 0.47%, and the Shanghai Composite Index slightly down by 0.07%. The market turnover decreased to 1.07 trillion yuan, indicating a strong wait-and-see sentiment [2] - The major indices in the US showed mixed performance, with technology stocks generally declining, including a nearly 4% drop in Google, while Apple rose over 2%. Chinese concept stocks displayed a mixed performance, with the Nasdaq Golden Dragon China Index down 0.92% [2] Policy and Major Events - The US imposed tariffs on steel household appliances starting June 23, leading to a collective adjustment in the Asia-Pacific stock market. Concerns arose regarding the profit pressure on appliance exporters, particularly those reliant on the North American market, prompting some companies to consider relocating production to Southeast Asia or switching to aluminum [3] - The Federal Reserve maintained interest rates during the June meeting, but the "dot plot" indicated a reduction in the expected rate cuts for 2025 from two to one, signaling a more hawkish stance. Trump continued to pressure for significant rate cuts, while Powell emphasized the independence of policy, leading to increased market divergence regarding the rate cut path [3] Industry Sectors and Hotspots - The PCB industry is experiencing a surge due to increased demand driven by AI servers and electric vehicles, with leading companies like Shenghong Technology having orders extending into 2026 [5] - In the consumer electronics sector, multiple factors are driving investment opportunities in the third quarter, with a focus on concepts like HarmonyOS and solid-state batteries [6] - The charging treasure safety crisis has emerged, with multiple brands facing suspension of 3C certification, and battery supplier Amperis under regulatory investigation, exposing credit risks in the industry [7] - The extension of the cobalt raw material ban in the Democratic Republic of Congo for an additional three months may elevate cobalt prices, benefiting companies like Huayou Cobalt and Tengyuan Cobalt [8] - Iran's announcement to close the Strait of Hormuz has drawn attention to the nuclear pollution prevention and oil and gas shipping concepts, with companies like Guangguang Co. and Ningbo Shipping being highlighted [9] - The National Medical Products Administration supports full lifecycle supervision of high-end medical devices, which is favorable for companies like Mindray Medical and United Imaging Medical [10] - The commercial use of humanoid robots in China is expected to reach 60,000 units by 2030, with a compound annual growth rate of 95.3%, benefiting companies like Tongda Power and Zhengye Technology [11] Company Dynamics and Capital Operations - China Railway Construction's 3.856 billion shares of restricted stock were unlocked on June 23, accounting for 72.29% of the total share capital, which may exert pressure on the stock price [12] - Guangting Information's 48.547 million shares of restricted stock were unlocked, representing 52.41% of the total share capital, involving four shareholders [13] - Beijing Junzheng's stock registration date is June 23, with a proposed cash dividend of approximately 48.16 million yuan [14] - Jianfa Real Estate's 670 million yuan bond was fully redeemed on June 23 [15] - Yihua Co. held an extraordinary shareholders' meeting to review the repurchase and cancellation of restricted stock and the reduction of registered capital, which may impact the company's capital structure [16]
Altman对话YC总裁:OpenAI的开源模型将远超期待
Hu Xiu· 2025-06-23 02:27
Group 1 - OpenAI is set to release a powerful open-source model, GPT-5, which will be a significant step towards achieving full multimodal capabilities [2][3] - GPT-5 is expected to support various input types including voice, images, code, and video, enhancing user interaction and application development [3][4] - The cost of using AI models is rapidly decreasing, with GPT-3's costs dropping to one-fifth within a week, indicating a trend that will continue [5][4] Group 2 - This year is referred to as the "Year of the Agent," where AI agents are expected to perform tasks similar to entry-level employees, potentially replacing many computer-based jobs [6][8] - OpenAI's vision for AGI is categorized into five levels, with GPT-5 being a step towards achieving deeper reasoning and real-time content generation capabilities [9][27] Group 3 - Entrepreneurs are encouraged to seize the technological transformation opportunities, as this is considered the best time in tech history for startups [10][42] - Successful startups should focus on unmet market needs rather than replicating existing products like OpenAI's core chat assistant [11][32] Group 4 - The integration of AI into everyday life is evolving, with users beginning to treat ChatGPT as an operating system that connects various data sources [21][24] - The future of human-computer interaction is expected to minimize traditional interfaces, allowing for more seamless and intuitive user experiences [40][41] Group 5 - OpenAI aims to create a comprehensive model capable of reasoning and generating real-time video content, which would revolutionize computer interfaces [27][26] - The potential for robotics is highlighted, with expectations that humanoid robots will soon be able to perform useful tasks in the real world [28][29] Group 6 - The discussion emphasizes the importance of building defensible companies in the face of competition from larger entities like OpenAI, suggesting that unique and innovative approaches are crucial for success [30][33] - The conversation reflects on the rapid advancements in AI and the need for new infrastructure to support these developments, drawing parallels to the historical impact of the transistor [62][63]
Sam Altman重磅官宣:OpenAI将推出开源模型,GPT5迈向完全多模态(万字完整实录)
3 6 Ke· 2025-06-23 02:22
Group 1 - OpenAI is set to release a powerful open-source model, GPT-5, which will support multiple input modalities including voice, images, code, and video, marking a significant step towards achieving full multimodal capabilities [1][18] - GPT-5 is expected to launch in the summer of this year and will enhance AI technology's accessibility and innovation [1][18] - The ultimate goal for OpenAI is to develop a fully multimodal model capable of deep reasoning, real-time video generation, and extensive code writing [1][18] Group 2 - Current AI models, such as GPT-3, have capabilities that exceed existing product applications, indicating a vast "product overflow" potential for new product development [2] - The cost of using AI models is rapidly decreasing, with GPT-3's costs dropping fivefold in just one week, suggesting a continuing trend of improved price-performance ratios [3][12] - ChatGPT's memory feature is evolving to create a more integrated user experience, allowing it to function as an operating system that connects various data sources [3][15] Group 3 - This year has been termed the "Year of the Agent," with AI agents being described as "Level 3 AGI" capable of performing tasks independently like a junior employee [4] - OpenAI's AGI framework categorizes the development of AGI into five levels, from conversational agents to organizational agents [4] Group 4 - Entrepreneurs are encouraged to seize the current technological transformation as the best time in history for startups, with AI expected to significantly enhance quality of life [5] - The rapid evolution of technology often leads to the downfall of large companies while smaller firms can iterate faster and at lower costs [5][30] Group 5 - OpenAI aims to foster an ecosystem where startups can leverage its platform to create innovative applications rather than merely replicating existing products like ChatGPT [21][22] - The company envisions a future where AI can seamlessly integrate into daily life, functioning as a proactive assistant that understands user needs [14][15]