Agent时代
Search documents
豆包大模型日均token用量破50万亿后,火山引擎将主战场押注Agent
Tai Mei Ti A P P· 2025-12-19 10:05
Core Insights - The release of Doubao Model 1.8 and Seedance 1.5 pro marks a significant update in AI capabilities, particularly in multi-modal understanding and Agent functionalities [2][4] - Doubao Model 1.8 has achieved a daily token usage of over 50 trillion, a tenfold increase from the previous year, with over 100 enterprise clients utilizing more than 1 trillion tokens [2][5] - The advancements in Agent capabilities are seen as a pivotal development, allowing for complex applications in enterprise scenarios [4][7] Group 1: Model Updates - Doubao Model 1.8 has significantly improved its tool-calling ability, allowing for the simultaneous use of over 20 tools, reducing planning steps by 37% and increasing execution success rates by 21% [5] - The model has enhanced capabilities in visual understanding, long video comprehension, and document structuring, along with native support for intelligent context management [5][6] - Seedance 1.5 pro is designed to meet the growing demand for video creation, featuring cinematic narrative tension and breakthroughs in audio-visual synchronization technology [2][5] Group 2: Industry Trends - The industry is still in its early stages, with ongoing technical limitations, but there is a strong demand for multi-modal models [3][7] - The Agent era is expected to continue its growth, with predictions of enterprises utilizing 50 to 200 Agents by 2025, necessitating improved management and operational capabilities [10] - Key sectors such as internet, retail, automotive, and education are rapidly adopting Agent technologies, while traditional industries are slower but have high potential [7][10] Group 3: Competitive Landscape - Major players like Anthropic, Google, and OpenAI are refining their models to enhance practical applications, with a focus on economic value and real-world utility [8][10] - The competition among large model vendors is anticipated to intensify as the Agent capabilities become more critical in the market [10]
“AI才女”罗福莉小米首秀
Xin Lang Cai Jing· 2025-12-17 16:16
小米首次开源自研大模型,同时外界关注的"AI才女"罗福莉也首次亮相发布会。 12月17日,在小米人车家全生态合作伙伴大会上,小米集团合伙人、集团总裁卢伟冰宣布,小米自研 AI大模型Xiaomi MiMo-V2-Flash已正式开源上线,他将其称为迈向Agent时代的全新语言基座。 根据卢伟冰公布的小米自研MiMo系列的时间表,目前小米已经推出推理大模型MiMo-7B,视觉推理大 模型MiMo-VL,原生端到端音频生成模型MiMo-Audio,端侧视觉语言大模型MiMo-VL-Miloco,具身 大模型MiMo-Embodied。 值得注意的是,小米创始人兼CEO雷军并未来到现场,在卢伟冰率先登台演讲后,原DeepSeek核心成 员、被业内称为"AI才女"的罗福莉也首次亮相小米发布会,她现在的职位是小米MiMo大模型负责人。 智通财经记者 范佳来 卢伟冰也在现场首次拆解了"小米人车家生态":产品包括个人设备、出行设备、家庭设备;核心技术包 括芯片、OS、AI;智能制造包括手机、汽车、大家电工厂。 罗福莉无疑是此次发布会中外界关注的焦点,此前有传言称,雷军以千万年薪挖角罗福莉,但当时罗福 莉和小米集团并未正式回应 ...
小米自研大模型MiMo-V2-Flash正式开源上线,卢伟冰:迈向Agent时代的全新语言基座
Xin Lang Cai Jing· 2025-12-17 02:34
新浪科技讯 12月17日上午消息,在今日的2025小米人车家全生态合作伙伴大会上,小米集团合伙人、 集团总裁卢伟冰发表《一路同行,澎湃未来》的主题演讲。 他宣布小米自研AI大模型Xiaomi MiMo-V2-Flash已正式开源上线,他将其称为迈向Agent时代的全新语 言基座。 根据卢伟冰公布的小米自研MiMo系列的时间表,目前小米已经推出推理大模型MiMo-7B,视觉推理大 模型MiMo-VL,原生端到端音频生成模型MiMo-Audio,端侧视觉语言大模型MiMo-VL-Miloco,具身 大模型MiMo-Embodied。 新浪科技讯 12月17日上午消息,在今日的2025小米人车家全生态合作伙伴大会上,小米集团合伙人、 集团总裁卢伟冰发表《一路同行,澎湃未来》的主题演讲。 他宣布小米自研AI大模型Xiaomi MiMo-V2-Flash已正式开源上线,他将其称为迈向Agent时代的全新语 言基座。 根据卢伟冰公布的小米自研MiMo系列的时间表,目前小米已经推出推理大模型MiMo-7B,视觉推理大 模型MiMo-VL,原生端到端音频生成模型MiMo-Audio,端侧视觉语言大模型MiMo-VL-Milo ...
豆包和OpenAI,都在押注同一个未来
Tai Mei Ti A P P· 2025-12-04 01:00
Core Insights - The article discusses the launch of Doubao Mobile Assistant, which allows users to perform complex tasks through voice commands, potentially transforming the mobile internet landscape [3][4][5] - Doubao's strategy involves collaborating with smartphone manufacturers to gain access to system-level permissions, enabling it to operate across applications and redefine user interaction with mobile devices [4][9][12] Group 1: Product Launch and Features - Doubao Mobile Assistant was released on December 1, with a retail price of 3499 yuan, quickly selling out and reselling for 3999 to 4999 yuan on secondary markets [3] - The assistant can perform tasks such as price comparison, booking restaurants, and even opening a car trunk, showcasing its ability to streamline user interactions [4][5] Group 2: Market Impact and User Behavior - The introduction of Doubao Mobile Assistant may lead to a significant shift in user behavior, as reliance on traditional apps like Taobao and Meituan could diminish, turning them into tools invoked by AI [5][7] - Users have reported issues with the assistant, such as restrictions when logging into WeChat, indicating potential challenges in its integration with existing applications [3][8] Group 3: Competitive Landscape - The article highlights the competitive dynamics in the AI and mobile sectors, with major players like Apple and Google also enhancing their AI capabilities within their operating systems [10][11] - Doubao's approach to penetrate the operating system level poses a threat to existing app ecosystems, as it could disrupt the flow of user traffic and advertising revenue [7][8] Group 4: Future Outlook - The integration of AI into mobile devices is seen as a potential "iPhone moment," with the possibility of redefining mobile interaction and creating new business models [9][12] - The outcome of this competition remains uncertain, as it could lead to either the evolution of existing devices or the emergence of entirely new AI-centric hardware [12]
产品经理的工作可能要反过来做了
3 6 Ke· 2025-11-24 02:23
Core Insights - The role of product managers is being fundamentally transformed due to advancements in AI technology, particularly large language models, which are changing how software interacts with users [1][10][12] Group 1: Historical Context of Software Development - Early computers operated on command-line interfaces, requiring users to input specific commands without understanding [2][4] - The introduction of graphical user interfaces in the 1980s, such as the Macintosh, allowed users to interact with computers through visual elements, making software more user-friendly [3][5] - The evolution of mobile devices, particularly the iPhone, further simplified interactions by breaking down functionalities into individual apps [4][6] Group 2: Limitations of Traditional Software Design - Traditional software design has led to increasingly complex and bloated products due to the need for manual design of interfaces, processes, and functionalities [6][8] - Customization demands from clients have resulted in software that resembles a marketplace rather than a streamlined product, complicating user experience [8][9] Group 3: Impact of AI on Software Paradigms - The emergence of large language models has the potential to eliminate the need for traditional software components like interfaces and processes, as these models can understand user intent and execute tasks autonomously [10][12] - Current software products are evolving along two main paths: foundational reconstruction and chatbot integration, with the latter serving as a transitional tool for users accustomed to traditional interfaces [15][23] Group 4: Future of Software as Intelligent Agents - The future of software is envisioned as "living entities" that continuously engage with users, adapting to their needs and preferences, rather than static tools [30][35] - This shift requires a rethinking of product design, focusing on user scenarios and interaction methods, moving away from traditional button-based interfaces to more intuitive, context-aware systems [36][39] - Product managers will need to design these intelligent agents with capabilities such as intent understanding, emotional sensing, and long-term memory, while the coding aspect can be handled by AI [40][41]
马斯克:5-6 年后手机大变样!科创人工智能ETF华夏(589010) 午后弱势整理,市场情绪趋于谨慎
Mei Ri Jing Ji Xin Wen· 2025-11-04 06:43
Group 1: Market Performance - The Sci-Tech Innovation Artificial Intelligence ETF (589010) is trading at 1.386 yuan, with a decline of 2.39%, maintaining a downward trend throughout the day [1] - Only one constituent stock is up, while 29 are down, indicating significant pressure on the AI sector, with some stocks like Aobi Zhongguang and Xinghuan Technology experiencing declines exceeding 7% [1] - Recent net capital inflow has significantly decreased, with approximately 12.71 million yuan on November 3, down from previous levels around 60 million yuan, reflecting cautious market sentiment [1] Group 2: Technological Insights - From a technical-economic perspective, the Transformer model has created three structural benefits for AIGC: 1. Scale effects on the research side, where a unified architecture allows for the reuse of underlying CUDA kernels and optimizations across various tasks, significantly reducing average training costs [3] 2. Decreasing marginal costs on the deployment side, where the same inference engine can handle requests from any modality, enhancing GPU utilization and increasing output per unit of computing power [3] 3. A "flywheel effect" on the data side, where multi-modal models continuously improve through high-quality data feedback, enhancing model accuracy and coverage [3] - The Transformer model is expected to continue evolving towards a scale of trillions of parameters, integrating various modalities into a unified attention framework, thus supporting the upcoming Agent era with a foundational algorithmic base [3] Group 3: Future Predictions - Elon Musk predicts that within the next 5-6 years, traditional smartphones and apps will disappear, with most content being AI-generated, transforming user devices into AI inference nodes [2] - Musk envisions a future where user devices will primarily serve as interfaces for AI communication, generating real-time content based on user preferences [2] Group 4: Investment Opportunities - The Sci-Tech Innovation Artificial Intelligence ETF closely tracks the Shanghai Stock Exchange's AI index, covering high-quality enterprises across the entire industry chain, benefiting from high R&D investment and policy support [3] - The ETF's 20% price fluctuation limit and small-cap elasticity are positioned to capture significant moments in the AI industry [3]
DeepSeek:UE8M0 FP8是针对即将发布的下一代国产芯片设计
智通财经网· 2025-08-21 08:23
Core Insights - DeepSeek has released version 3.1, marking a significant step towards the "Agent Era" [1] - The new version utilizes UE8M0 FP8 Scale parameter precision, indicating advancements in technology [1] - There are notable adjustments in the tokenizer and chat template compared to the previous version, DeepSeek-V3 [1] - The UE8M0 FP8 is specifically designed for an upcoming next-generation domestic chip [1][2] Company Developments - The official webpage, app, mini-program, and API platform have all been updated to incorporate the new model [2] - Users have expressed anticipation for additional features, such as image and video functionality [2]
DeepSeek-V3.1正式发布,迈向 Agent 时代的第一步
Hua Er Jie Jian Wen· 2025-08-21 06:39
Group 1 - DeepSeek officially released DeepSeek-V3.1, featuring a hybrid reasoning architecture that supports both thinking and non-thinking modes [1] - The new version, DeepSeek-V3.1-Think, offers higher thinking efficiency, providing answers in a shorter time compared to DeepSeek-R1-0528 [1] - Enhanced agent capabilities have been achieved through post-training optimization, significantly improving performance in tool usage and intelligent tasks [1] Group 2 - Starting from September 6, 2025, the pricing for API calls on the DeepSeek open platform will be adjusted, with input costs set at 0.5 to 4 yuan per million tokens (cache hit) and 4 yuan per million tokens (cache miss), while output costs will be 12 yuan per million tokens [1]
马斯克疯狂点赞,Lovart凭什么是世界上第一个设计智能体?
Sou Hu Cai Jing· 2025-07-12 05:18
Core Insights - Lovart, also known as "星流AI" in China, has rapidly gained attention in the AI application field, with significant engagement on social media and a surge of users seeking trial invitations [1][3] - The emergence of Lovart signifies a shift from traditional AI tools to a new model of creative collaboration, redefining the relationship between creators and AI [3][19] Group 1: Old World Challenges - The previous generation of AI tools, referred to as AIGC 1.0, only addressed the initial stages of the creative process, leaving creators to handle the majority of integration and editing tasks manually [6] - The introduction of workflow tools like ComfyUI marked the AIGC 2.0 era, but their complexity deterred most designers, making them more suitable for AI experts rather than general creators [6][7] Group 2: New Model Introduction - Lovart's founder, Chen Mian, identified that creators need a comprehensive solution rather than just advanced tools, likening the new model to a "chef team" that handles all aspects of creative work [7][8] - The core idea of Lovart is to transform AI from a mere tool into a "Creator Team," allowing users to act as clients who provide input while AI manages the execution [8][19] Group 3: Interaction Redefined - Lovart's product design emphasizes a natural interaction model, using a metaphor of a "table" where creators can easily communicate their needs and see the results in real-time [9][11] - The interface consists of a large canvas for visual work and a dialogue box for user instructions, streamlining the creative process and enhancing user experience [10][11] Group 4: Market Positioning - Lovart strategically targets the overlooked "creative individual" and professional consumer segments, avoiding direct competition with industry giants like Adobe and Midjourney [14] - The company focuses on creating unique user experiences by integrating domain knowledge with AI capabilities, rather than simply improving existing tools [14][15] Group 5: Future Outlook - Lovart is positioned at the forefront of the emerging Agent era, which is expected to revolutionize the creative industry by enhancing collaboration and efficiency [15][19] - The founder believes that the true potential of AI lies in its ability to replace not just individual tools but entire collaborative teams, fundamentally changing the creative landscape [19][21]
HDC2025丨华为发布鸿蒙智能体框架白皮书,全面迈入Agent时代
Sou Hu Cai Jing· 2025-06-23 07:20
Core Viewpoint - Huawei's Developer Conference 2025 (HDC2025) introduced the Harmony Agent Framework (HMAF) and the white paper titled "Agent Era, Harmony Applications Born Intelligent," marking a significant step towards integrating intelligent agents into the Harmony ecosystem [1][3]. Group 1: Harmony Agent Framework - The Harmony Agent Framework establishes a new value network for intelligent agents, providing structural support for the intelligent upgrade of third-party applications [6]. - Key components include new interaction methods for agents, upgraded protocols, efficient development processes, and enhanced security, facilitating deep collaboration between applications and agents within the Harmony system [6]. Group 2: Xiaoyi Intelligent Agent Open Platform - The Xiaoyi Intelligent Agent Open Platform aims to accelerate the evolution of Harmony applications into intelligent agents, offering comprehensive solutions that support various development models [9]. - Developers can access over 50 Harmony system plugins and utilize an upgraded intent framework to quickly implement multi-agent collaboration across different scenarios [12]. Group 3: Launch of Pioneer Intelligent Agents - The first batch of over 50 pioneer Harmony intelligent agents is set to launch, featuring capabilities such as generating clothing recommendations based on weather data and creating playlists through voice commands [13][15]. - Notable partners like Shenzhen Airlines and Ximalaya are also developing their intelligent agents, indicating a rapid expansion of the ecosystem [15]. Group 4: AI Capabilities and Integration - The Harmony system has integrated AI capabilities into over 4,000 applications, with 240+ standard intents connected to more than 470 services, enhancing the system's intelligent interaction capabilities [17]. - Xiaoyi has evolved into a more capable intelligent agent, offering features like real-time dialogue, AI photo editing, and contextual memory across various devices [17]. Group 5: Developer Engagement and Ecosystem Building - The Harmony Agent Framework and Xiaoyi Intelligent Agent Open Platform empower developers to easily access intelligent agent services, fostering a collaborative ecosystem for global developers and enterprises [19].