Workflow
智能体
icon
Search documents
企业级AI新赛道:字节跳动HiAgent如何与扣子协同布局?
Sou Hu Cai Jing· 2025-06-11 15:33
Ground Truth數据准备 品质优化 安全加固 合规治 Prompt优化 AB试验 场景模型 成本分析与优化 知识库治理 火山引擎总裁谭待在会上表示,深度思考、多模态和工具调用等模型能力的提升,是构建企业级智能体的关键。同时,为了推动智能体的规模化应用,降低 模型使用成本也至关重要。因此,豆包1.6按"输入长度"区间定价,使得企业能够以更低的成本享受到高级模型的能力。 事实上,大模型技术要真正进入企业的核心生产场景,并非易事。除了模型能力的提升外,还需要大量的工程化实践。火山引擎副总裁张鑫透露,一年多 前,HiAgent企业级智能体开发平台就推出了首个版本。如今,该平台已进行了大版本更新,旨在进一步补全企业级智能体的版图。 在科技领域,一项新技术的诞生总是伴随着质疑与探索。当大模型技术初次亮相时,业界普遍对其能否应用于企业的生产环境持保留态度,特别是其自带 的"幻觉"特性,让人对其准确性产生疑虑。毕竟,与聊天机器人不同,企业生产环境需要的是确定性和高效性,而非随意的"幻觉"。 然而,随着技术的不断进步,这些疑虑逐渐消散。2025年6月11日,字节跳动旗下的火山引擎在Force原动力大会上,发布了豆包大模 ...
腾讯研究院AI速递 20250612
腾讯研究院· 2025-06-11 14:31
生成式AI 一、 OpenAI发布推理新王o3-pro,网友实测:很强,也很慢 1. OpenAI发布推理新模型o3-pro,以推理能力最强、速度最慢为卖点,输入价格20美元/百 万tokens,输出80美元/百万tokens; 2. 在科学分析、写作、编程和数据分析领域,o3-pro比o3领先约14%,但在ARC-AGI-2测 试中几乎无提升,成本却大幅增加; 3. 用户测试显示o3-pro擅长复杂推理任务且环境感知能力强,但推理速度极慢,不适合简单 问题,主要面向专业用户。 https://mp.weixin.qq.com/s/xp_7KNVxYS7zLeBUOcWqNA 二、 Mistral首个开源强推理模型Magistral ,对比手快10倍 1. Mistral AI发布强推理模型Magistral,包括企业版Medium和开源版Small(24B参数),在 AIME2024等多项测试中表现优异; 2. Magistral通过自主研发的可扩展强化学习流水线实现多语言保真推理,适用于英法西德意 阿俄中等语言; 3. 利用Flash Answers技术,Magistral Medium实现比竞品快10倍 ...
豆包大模型的降价逻辑变了
Bei Jing Shang Bao· 2025-06-11 12:58
Core Insights - The launch of the Doubao 1.6 model by ByteDance at the Volcano Engine Force Conference marks a significant price reduction strategy, positioning it as the lowest-priced model in the industry, following a similar move a year prior with Doubao 1.5 [2][3] - The new pricing strategy based on "input length" reflects the evolving trends in large model development and the increasing demand for deep thinking capabilities and multimodal models [2][4] Pricing Strategy - The Doubao 1.6 model's pricing is set at 0.8 yuan per million tokens for input and 8 yuan per million tokens for output, which is one-third the cost of the previous Doubao 1.5 deep thinking model [3][5] - The Seedance 1.0 pro model is priced at 0.015 yuan per thousand tokens, with a cost of 3.67 yuan for generating a 5-second 1080P video [3][5] - The new pricing model aims to optimize costs for enterprises, allowing them to benefit from technological advancements and accelerate their AI development [4] Model Capabilities - Doubao 1.6 supports multimodal understanding and graphical interface operations, enabling it to handle real-world problems effectively [3] - The model has demonstrated its capabilities in various applications, including e-commerce image recognition and automated hotel booking [3] - The Doubao model's daily token usage has surged to over 16.4 trillion, a 137-fold increase since its initial release in May 2024 [5] Market Context - The competitive landscape in the large model sector is characterized by a price war, with significant enthusiasm for intelligent agents overshadowing ongoing price competition [3] - According to IDC, the total token usage on public clouds in China is projected to reach 114.2 trillion tokens in 2024, with Volcano Engine, Baidu Cloud, and Alibaba Cloud leading the market [5]
黄仁勋巴黎演讲:AI的下一波浪潮是机器人,数据中心将成为“AI工厂”
Feng Huang Wang· 2025-06-11 11:46
Core Insights - AI technology is fundamentally reshaping the future of computing and industry, marking the arrival of a new industrial revolution driven by "AI factories" [1] - Traditional data centers are evolving into AI factories that generate "intelligent tokens," providing power across various industries [1] - NVIDIA's new architecture, Blackwell, is designed to meet the increasing inference demands of AI models, achieving a significant performance leap [1] Group 1 - Huang Renxun predicts the next phase of AI, termed Agentic AI, which will understand tasks, reason, plan, and execute complex tasks, with robots as its physical embodiment [2] - The demonstration of a robot named "Greg" showcased the ability to learn and interact within a digital twin environment before being deployed in the physical world [2] - Major companies like BMW, Mercedes-Benz, and Toyota are utilizing Omniverse to create digital twins of their factories or products [2] Group 2 - NVIDIA has made significant progress in quantum computing, viewing it as a pivotal moment, and plans to connect quantum processors (QPU) with GPUs for enhanced computational tasks [2] - The entire cuQuantum quantum computing algorithm stack is now capable of accelerating on the Grace Blackwell system [2] - Huang Renxun emphasized deep collaboration with European partners, including the establishment of a large AI cloud with French company Mistral and partnerships with Schneider Electric for future AI factory design [2] Group 3 - NVIDIA is establishing AI technology centers in seven different countries to promote local ecosystem development and collaborative research [3] - A new computing era has begun, with NVIDIA providing a full-stack platform from chips to software and AI models to empower global developers and enterprises [3]
三六零周鸿祎:一个员工领导100个智能体将成常态
news flash· 2025-06-11 11:14
Core Viewpoint - The future will see employees managing multiple AI agents, leading to the emergence of "super individuals" and "super companies" with a high ratio of digital employees [1] Group 1 - On June 11, 360 (601360) held a launch event for its "Nano AI Super Search Intelligent Agent" [1] - Zhou Hongyi emphasized that it will become common for one employee to lead 100 intelligent agents [1] - Companies with a high proportion of digital employees are expected to become "super companies" [1]
银行业智能化转型:AI智能体的变革力量与未来展望 | 金融与科技
清华金融评论· 2025-06-11 10:51
Core Viewpoint - The development of AI agents is transforming the banking industry, enhancing operational efficiency and creating new growth opportunities, despite facing multiple challenges in deployment [2][3][9]. Group 1: AI Agent Overview - AI agents are intelligent entities capable of perceiving their environment, making decisions, and taking actions to achieve specific goals, marking a shift from basic functions to complex task execution [5][6]. - The architecture of AI agents typically includes four core modules: perception, decision-making, execution, and learning, each serving distinct functions [6]. Group 2: Applications in Banking - AI agents are being integrated into various banking functions, including customer service, wealth management, risk management, and operational efficiency [10][12][13]. - Examples include intelligent customer service agents like "工小智" and "招小宝" in China, and "Erica" in the US, which enhance customer interaction and operational efficiency [10][12]. Group 3: Implementation Challenges - Banks face challenges such as data privacy and security requirements, algorithmic bias, integration with existing IT infrastructure, and regulatory compliance [3][15][16]. - The need for a gradual and phased approach to implementing AI agents is emphasized to manage risks effectively while maximizing benefits [22][24]. Group 4: Strategic Development Path - The strategic implementation of AI agents in banks is proposed in four phases: focusing on cost reduction and efficiency, enhancing risk management, improving research capabilities, and driving business growth [22][24]. - Each phase aims to build foundational capabilities that support the overall transformation and innovation within the banking sector [22][24]. Group 5: Future Trends - Future developments in AI agents will include multi-modal interactions, deeper integration of generative AI, and the establishment of collaborative networks among different agents [26][27]. - The focus will also be on building trustworthy and responsible AI frameworks to ensure sustainable application and user trust [27].
区域型银行如何实现AI战略突围?
麦肯锡· 2025-06-11 09:24
Core Viewpoint - The competition for generative AI in regional banks has shifted from technological exploration to value realization, making it essential for these banks to capture AI value and implement applications effectively [1]. Group 1: Current State of Generative AI in Banking - Generative AI applications are expanding from internal use to client-facing services, transforming operational models and customer service methods within banks [2]. - The emergence of multi-agent systems is providing comprehensive solutions that can cover complex processes, allowing generative AI agents to act as virtual colleagues [3]. Group 2: Impact on Profitability - Generative AI is expected to significantly enhance productivity across industries, with banking projected to see a potential productivity increase of $200 billion to $340 billion, translating to a 14%-24% potential profit increase, which could rise to 60%-80% over the next three years [4]. Group 3: Challenges in AI Adoption - Despite the apparent technological benefits, regional banks face significant barriers to large-scale AI application, including data silos and a shortage of hybrid talent, with an estimated talent gap of 5 million in China by 2030 [7]. - Regional banks must address three core questions: how to focus on high-value scenarios with limited resources, how to balance short-term wins with long-term strategies, and how to manage innovation and ecosystem collaboration [7]. Group 4: High-Value AI Application Scenarios - Six high-value AI application scenarios are emerging as key areas for regional banks to leverage AI capabilities, transitioning from experimental phases to growth drivers [8]. - These scenarios include credit risk management, customer relationship management, software development efficiency, intelligent customer service, hyper-personalized services, and knowledge management [10]. Group 5: Strategic Pathways for Regional Banks - Regional banks must choose between three strategic models: "builders" who deeply reconstruct core business, "innovators" who enhance middle and back-office processes, and "adopters" who focus on efficiency improvements [14]. - A comprehensive AI transformation framework is necessary, integrating AI with overall business strategy and ensuring that AI investments are directly linked to financial metrics [15][16]. Group 6: Collaboration and Ecosystem Development - Finding suitable ecosystem partners is crucial for regional banks to quickly develop strategies and implement use cases, allowing them to leverage existing solutions and accelerate their AI adoption [17]. - The future of banking will see AI not just as a tool for efficiency but as a core competitive advantage for enhancing customer service, optimizing risk management, and improving operational resilience [18].
AI生物学家诞生!我国学者开发元生智能体,自主发现抗癌新靶点并设计验证实验,能力超越人类专家和主流大模型
生物世界· 2025-06-11 09:22
Core Viewpoint - The discovery and identification of therapeutic targets remain a critical bottleneck in drug development, with over 90% of candidate drugs failing in clinical development due to flawed initial hypotheses regarding biological function, disease relevance, or druggability [2][3]. Group 1: Target Discovery Challenges - Traditional target discovery relies on disease biologists integrating various independent biomedical data to form testable hypotheses, which is a slow and costly process, often exceeding $2 million per target [2][3]. - The failure rate in clinical development is largely attributed to issues with the selected targets rather than the compounds themselves [2]. Group 2: Introduction of OriGene - A new multi-agent virtual disease biologist system named "OriGene" has been developed, focusing on target discovery and clinical translation value assessment, outperforming human experts and leading AI models in target discovery capabilities [2][3][9]. - OriGene autonomously discovered new targets for liver cancer and colorectal cancer, demonstrating its ability to generate original targets validated through experiments [3][27]. Group 3: System Features and Functionality - OriGene integrates over 500 expert tools and organized biomedical databases, supporting multi-modal reasoning across genomics, transcriptomics, proteomics, phenomics, and pharmacology [11][12]. - The system features a multi-agent collaborative decision-making architecture, including a Coordinator Agent, Planning Agent, Reasoning Agent, Critic Agent, and Reporting Agent, enabling a closed-loop autonomous scientific decision-making process [12][13]. Group 4: Performance Evaluation - A specialized benchmark test set for target discovery, TRQA, was created, covering 1,921 multi-dimensional validation questions, demonstrating OriGene's superior performance in accuracy, recall, and robustness compared to human experts and other AI models [18][21]. - The system's self-evolving capabilities allow it to improve its reasoning ability over time through iterative learning and feedback from experiments [14][16]. Group 5: Practical Validation - In liver cancer, OriGene identified G protein-coupled receptor GPR160 as a key target, showing significant expression in cancer tissues and potential as a new immune checkpoint [23]. - For colorectal cancer, the system selected arginase ARG2 as a target, confirming its high expression in cancer tissues and demonstrating effective tumor suppression in patient-derived organoid models [25][27]. Group 6: Implications for Drug Development - The research signifies a major advancement in using AI to accelerate therapeutic target discovery, providing a scalable and adaptable platform for identifying mechanism-based treatment targets [27]. - As generative AI models and biomedical data resources mature, frameworks like OriGene are expected to facilitate AI-driven end-to-end drug discovery, enhancing the potential for precision medicine [27].
GPTBots亮相WaytoAGI东京黑客松,展示企业级AI智能体创新落地成果
Ge Long Hui· 2025-06-11 08:16
Core Insights - GPTBots.ai successfully hosted the "WaytoAGI Global AI Conference - Tokyo 2025" hackathon, attracting over 300 developers from Japan and around the world, showcasing innovative solutions based on the GPTBots enterprise-level AI framework [1][2] Group 1: Hackathon Highlights - The hackathon featured four main competition areas: enterprise process automation, customer interaction, data analysis insights, and open innovation [2] - Notable projects included a marketing management platform for Web3 that analyzes social media sentiment, a nail design AI that reduces design time from hours to minutes, and a modular video production AI that cuts content production costs by 40% [2][4] Group 2: AI Framework Capabilities - The core capabilities of the GPTBots AI framework include large model integration, workflow orchestration, and RAG knowledge retrieval technology [5] - The framework supports practical case studies from customer service to data analysis, demonstrating scalable and secure enterprise AI deployment best practices [5] Group 3: Regional Innovation and Trends - The hackathon highlighted regional innovation differences, with Japanese teams focusing on retail services and Chinese developers excelling in Web3 and decentralized finance [7] - The event revealed three major trends in enterprise AI applications: automation of contract compliance, customized design, and market analysis and marketing optimization [8] Group 4: Accelerating AI Adoption - GPTBots facilitates rapid prototype development, secure deployment, and system integration, helping enterprises unlock AI value [8] - The event underscored the importance of scalable and adaptable enterprise AI solutions as a core competitive advantage for global companies aiming to enhance operational efficiency and drive innovation growth [8]
端到端GUI智能体首次实现“犯错-反思-修正”闭环,模拟人类认知全过程
量子位· 2025-06-11 08:07
端到端多模态GUI智能体有了"自我反思"能力!南洋理工大学MMLab团队提出框架GUI-Reflection。 随着多模态大模型的发展, 端到端GUI智能体 在手机、电脑等设备上的自动化任务中展示出巨大潜力。它们能够看懂设备屏幕,模拟人类去 点击按钮、输入文本,从而完成复杂的任务。 然而,当前端到端GUI多智能体的训练范式仍存在明显的瓶颈:当前模型往往使用几乎完美的离线演示轨迹进行训练,使得模型缺乏反思和改 正自身错误的能力,并进一步限制了通过在线强化学习激发和提升能力的可能。 GUI-Reflection 的核心思想是在智能体的各个训练阶段引入 "反思与纠错"机制 ,这一机制贯穿 预训练、监督微调和在线训练 全过程,模 拟了人类 "犯错→反思→重试" 的认知过程。 1. GUI预训练阶段: GUI-Reflection 团队 投稿 量子位 | 公众号 QbitAI 提出GUI-Reflection Task Suite任务套件, 将反思纠错能力进一步分解,让模型在预训练阶段框架让模型初步接触反思类任务,为后续打 下基础。 2. 离线监督微调阶段: 构建自动化数据管道,从已有离线无错轨迹中构建带有反思和纠错的 ...