Workflow
智能体
icon
Search documents
从高考到实战,豆包大模型交卷了
机器之心· 2025-06-12 06:08
机器之心报道 编辑:Panda 高考余热尚在,依然还是有不少博主和媒体在测试各家 AI 模型解答最新高考题的能力。而现在,一个正被火热评测的主流模型迎来了重磅升级! 在 AI 基础设施方面,火山引擎围绕 AI 应用的实际需求,打造了三个套件: AgentKit 、 TrainingKit 和 ServingKit 。顾名思义,这三个套件的作用分别是智能 体、模型训练和模型 serving。 整场 Force 大会下来,我们最大的感受是火山引擎对「 AI 云原生 」概念的反复强调。简单来说,「AI 云原生」是指将云原生理念应用到 AI 领域,专注于 AI 工 作负载的云端构建和部署;其核心理念是以 AI 负载为中心,重新优化计算、存储与网络架构,让 GPU 可直接访问存储和数据库,降低 IO 延迟,并提供高速互联 和端到端安全保护。事实上,正如我们以前多次报道的那样,「AI 云原生」已经成为火山引擎「下一个十年」所遵循的云计算新范式。 事情发生在昨天、北京国家会议中心、火山引擎原动力大会 Forc e 202 5 现场。 火山引擎一口气发布了豆包大模型 1.6、Seedance 1.0 Pro、AI 云原生平台 ...
豆包最新版模型降价2/3 ,对话谭待:降价促进AI智能体普及
Bei Ke Cai Jing· 2025-06-12 05:05
Core Viewpoint - OpenAI and ByteDance have launched their latest models on the same day, with significant price reductions announced for their respective models, aiming to accelerate the development of AI applications in the industry [1][2]. Group 1: Pricing Strategy - OpenAI's o3 model has seen an 80% price reduction, while ByteDance's new model, Doubao 1.6, is priced at one-third of its previous version [1]. - The pricing strategy is aimed at releasing technological dividends to benefit developers and enterprises, thereby accelerating AI application development [2]. - The price war initiated by DeepSeek in May 2024 has led to rapid follow-up price reductions from other models, including Doubao [2]. Group 2: Cost Structure and Model Capabilities - The cost of large models is primarily influenced by the context length rather than the model's capabilities such as deep thinking or multimodal support [3][4]. - Doubao 1.6 integrates various capabilities into a unified pricing model, optimizing based on context length distribution [4]. - The pricing for Doubao 1.6 is structured as follows: for input ranges of 32K-128K, the cost is 1.2 yuan per million tokens for input and 16 yuan for output; for 128K-256K, it is 2.4 yuan for input and 24 yuan for output [6]. Group 3: Market Impact and Future Outlook - The majority of enterprises currently utilize large models within the 0-32K range, where Doubao 1.6's pricing is significantly lower than previous models, leading to a 63% cost reduction [6]. - The enhancements in Doubao 1.6's capabilities, including support for 256K context, are crucial for building complex intelligent agents [6]. - The company anticipates that 2024 will be the year of large model applications in China, while 2025 may see the widespread adoption of intelligent agents across various business processes [7].
企业级AI新赛道:字节跳动HiAgent如何与扣子协同布局?
Sou Hu Cai Jing· 2025-06-11 15:33
Ground Truth數据准备 品质优化 安全加固 合规治 Prompt优化 AB试验 场景模型 成本分析与优化 知识库治理 火山引擎总裁谭待在会上表示,深度思考、多模态和工具调用等模型能力的提升,是构建企业级智能体的关键。同时,为了推动智能体的规模化应用,降低 模型使用成本也至关重要。因此,豆包1.6按"输入长度"区间定价,使得企业能够以更低的成本享受到高级模型的能力。 事实上,大模型技术要真正进入企业的核心生产场景,并非易事。除了模型能力的提升外,还需要大量的工程化实践。火山引擎副总裁张鑫透露,一年多 前,HiAgent企业级智能体开发平台就推出了首个版本。如今,该平台已进行了大版本更新,旨在进一步补全企业级智能体的版图。 在科技领域,一项新技术的诞生总是伴随着质疑与探索。当大模型技术初次亮相时,业界普遍对其能否应用于企业的生产环境持保留态度,特别是其自带 的"幻觉"特性,让人对其准确性产生疑虑。毕竟,与聊天机器人不同,企业生产环境需要的是确定性和高效性,而非随意的"幻觉"。 然而,随着技术的不断进步,这些疑虑逐渐消散。2025年6月11日,字节跳动旗下的火山引擎在Force原动力大会上,发布了豆包大模 ...
腾讯研究院AI速递 20250612
腾讯研究院· 2025-06-11 14:31
生成式AI 一、 OpenAI发布推理新王o3-pro,网友实测:很强,也很慢 1. OpenAI发布推理新模型o3-pro,以推理能力最强、速度最慢为卖点,输入价格20美元/百 万tokens,输出80美元/百万tokens; 2. 在科学分析、写作、编程和数据分析领域,o3-pro比o3领先约14%,但在ARC-AGI-2测 试中几乎无提升,成本却大幅增加; 3. 用户测试显示o3-pro擅长复杂推理任务且环境感知能力强,但推理速度极慢,不适合简单 问题,主要面向专业用户。 https://mp.weixin.qq.com/s/xp_7KNVxYS7zLeBUOcWqNA 二、 Mistral首个开源强推理模型Magistral ,对比手快10倍 1. Mistral AI发布强推理模型Magistral,包括企业版Medium和开源版Small(24B参数),在 AIME2024等多项测试中表现优异; 2. Magistral通过自主研发的可扩展强化学习流水线实现多语言保真推理,适用于英法西德意 阿俄中等语言; 3. 利用Flash Answers技术,Magistral Medium实现比竞品快10倍 ...
豆包大模型的降价逻辑变了
Bei Jing Shang Bao· 2025-06-11 12:58
Core Insights - The launch of the Doubao 1.6 model by ByteDance at the Volcano Engine Force Conference marks a significant price reduction strategy, positioning it as the lowest-priced model in the industry, following a similar move a year prior with Doubao 1.5 [2][3] - The new pricing strategy based on "input length" reflects the evolving trends in large model development and the increasing demand for deep thinking capabilities and multimodal models [2][4] Pricing Strategy - The Doubao 1.6 model's pricing is set at 0.8 yuan per million tokens for input and 8 yuan per million tokens for output, which is one-third the cost of the previous Doubao 1.5 deep thinking model [3][5] - The Seedance 1.0 pro model is priced at 0.015 yuan per thousand tokens, with a cost of 3.67 yuan for generating a 5-second 1080P video [3][5] - The new pricing model aims to optimize costs for enterprises, allowing them to benefit from technological advancements and accelerate their AI development [4] Model Capabilities - Doubao 1.6 supports multimodal understanding and graphical interface operations, enabling it to handle real-world problems effectively [3] - The model has demonstrated its capabilities in various applications, including e-commerce image recognition and automated hotel booking [3] - The Doubao model's daily token usage has surged to over 16.4 trillion, a 137-fold increase since its initial release in May 2024 [5] Market Context - The competitive landscape in the large model sector is characterized by a price war, with significant enthusiasm for intelligent agents overshadowing ongoing price competition [3] - According to IDC, the total token usage on public clouds in China is projected to reach 114.2 trillion tokens in 2024, with Volcano Engine, Baidu Cloud, and Alibaba Cloud leading the market [5]
黄仁勋巴黎演讲:AI的下一波浪潮是机器人,数据中心将成为“AI工厂”
Feng Huang Wang· 2025-06-11 11:46
Core Insights - AI technology is fundamentally reshaping the future of computing and industry, marking the arrival of a new industrial revolution driven by "AI factories" [1] - Traditional data centers are evolving into AI factories that generate "intelligent tokens," providing power across various industries [1] - NVIDIA's new architecture, Blackwell, is designed to meet the increasing inference demands of AI models, achieving a significant performance leap [1] Group 1 - Huang Renxun predicts the next phase of AI, termed Agentic AI, which will understand tasks, reason, plan, and execute complex tasks, with robots as its physical embodiment [2] - The demonstration of a robot named "Greg" showcased the ability to learn and interact within a digital twin environment before being deployed in the physical world [2] - Major companies like BMW, Mercedes-Benz, and Toyota are utilizing Omniverse to create digital twins of their factories or products [2] Group 2 - NVIDIA has made significant progress in quantum computing, viewing it as a pivotal moment, and plans to connect quantum processors (QPU) with GPUs for enhanced computational tasks [2] - The entire cuQuantum quantum computing algorithm stack is now capable of accelerating on the Grace Blackwell system [2] - Huang Renxun emphasized deep collaboration with European partners, including the establishment of a large AI cloud with French company Mistral and partnerships with Schneider Electric for future AI factory design [2] Group 3 - NVIDIA is establishing AI technology centers in seven different countries to promote local ecosystem development and collaborative research [3] - A new computing era has begun, with NVIDIA providing a full-stack platform from chips to software and AI models to empower global developers and enterprises [3]
三六零周鸿祎:一个员工领导100个智能体将成常态
news flash· 2025-06-11 11:14
Core Viewpoint - The future will see employees managing multiple AI agents, leading to the emergence of "super individuals" and "super companies" with a high ratio of digital employees [1] Group 1 - On June 11, 360 (601360) held a launch event for its "Nano AI Super Search Intelligent Agent" [1] - Zhou Hongyi emphasized that it will become common for one employee to lead 100 intelligent agents [1] - Companies with a high proportion of digital employees are expected to become "super companies" [1]
银行业智能化转型:AI智能体的变革力量与未来展望 | 金融与科技
清华金融评论· 2025-06-11 10:51
Core Viewpoint - The development of AI agents is transforming the banking industry, enhancing operational efficiency and creating new growth opportunities, despite facing multiple challenges in deployment [2][3][9]. Group 1: AI Agent Overview - AI agents are intelligent entities capable of perceiving their environment, making decisions, and taking actions to achieve specific goals, marking a shift from basic functions to complex task execution [5][6]. - The architecture of AI agents typically includes four core modules: perception, decision-making, execution, and learning, each serving distinct functions [6]. Group 2: Applications in Banking - AI agents are being integrated into various banking functions, including customer service, wealth management, risk management, and operational efficiency [10][12][13]. - Examples include intelligent customer service agents like "工小智" and "招小宝" in China, and "Erica" in the US, which enhance customer interaction and operational efficiency [10][12]. Group 3: Implementation Challenges - Banks face challenges such as data privacy and security requirements, algorithmic bias, integration with existing IT infrastructure, and regulatory compliance [3][15][16]. - The need for a gradual and phased approach to implementing AI agents is emphasized to manage risks effectively while maximizing benefits [22][24]. Group 4: Strategic Development Path - The strategic implementation of AI agents in banks is proposed in four phases: focusing on cost reduction and efficiency, enhancing risk management, improving research capabilities, and driving business growth [22][24]. - Each phase aims to build foundational capabilities that support the overall transformation and innovation within the banking sector [22][24]. Group 5: Future Trends - Future developments in AI agents will include multi-modal interactions, deeper integration of generative AI, and the establishment of collaborative networks among different agents [26][27]. - The focus will also be on building trustworthy and responsible AI frameworks to ensure sustainable application and user trust [27].
区域型银行如何实现AI战略突围?
麦肯锡· 2025-06-11 09:24
Core Viewpoint - The competition for generative AI in regional banks has shifted from technological exploration to value realization, making it essential for these banks to capture AI value and implement applications effectively [1]. Group 1: Current State of Generative AI in Banking - Generative AI applications are expanding from internal use to client-facing services, transforming operational models and customer service methods within banks [2]. - The emergence of multi-agent systems is providing comprehensive solutions that can cover complex processes, allowing generative AI agents to act as virtual colleagues [3]. Group 2: Impact on Profitability - Generative AI is expected to significantly enhance productivity across industries, with banking projected to see a potential productivity increase of $200 billion to $340 billion, translating to a 14%-24% potential profit increase, which could rise to 60%-80% over the next three years [4]. Group 3: Challenges in AI Adoption - Despite the apparent technological benefits, regional banks face significant barriers to large-scale AI application, including data silos and a shortage of hybrid talent, with an estimated talent gap of 5 million in China by 2030 [7]. - Regional banks must address three core questions: how to focus on high-value scenarios with limited resources, how to balance short-term wins with long-term strategies, and how to manage innovation and ecosystem collaboration [7]. Group 4: High-Value AI Application Scenarios - Six high-value AI application scenarios are emerging as key areas for regional banks to leverage AI capabilities, transitioning from experimental phases to growth drivers [8]. - These scenarios include credit risk management, customer relationship management, software development efficiency, intelligent customer service, hyper-personalized services, and knowledge management [10]. Group 5: Strategic Pathways for Regional Banks - Regional banks must choose between three strategic models: "builders" who deeply reconstruct core business, "innovators" who enhance middle and back-office processes, and "adopters" who focus on efficiency improvements [14]. - A comprehensive AI transformation framework is necessary, integrating AI with overall business strategy and ensuring that AI investments are directly linked to financial metrics [15][16]. Group 6: Collaboration and Ecosystem Development - Finding suitable ecosystem partners is crucial for regional banks to quickly develop strategies and implement use cases, allowing them to leverage existing solutions and accelerate their AI adoption [17]. - The future of banking will see AI not just as a tool for efficiency but as a core competitive advantage for enhancing customer service, optimizing risk management, and improving operational resilience [18].
AI生物学家诞生!我国学者开发元生智能体,自主发现抗癌新靶点并设计验证实验,能力超越人类专家和主流大模型
生物世界· 2025-06-11 09:22
Core Viewpoint - The discovery and identification of therapeutic targets remain a critical bottleneck in drug development, with over 90% of candidate drugs failing in clinical development due to flawed initial hypotheses regarding biological function, disease relevance, or druggability [2][3]. Group 1: Target Discovery Challenges - Traditional target discovery relies on disease biologists integrating various independent biomedical data to form testable hypotheses, which is a slow and costly process, often exceeding $2 million per target [2][3]. - The failure rate in clinical development is largely attributed to issues with the selected targets rather than the compounds themselves [2]. Group 2: Introduction of OriGene - A new multi-agent virtual disease biologist system named "OriGene" has been developed, focusing on target discovery and clinical translation value assessment, outperforming human experts and leading AI models in target discovery capabilities [2][3][9]. - OriGene autonomously discovered new targets for liver cancer and colorectal cancer, demonstrating its ability to generate original targets validated through experiments [3][27]. Group 3: System Features and Functionality - OriGene integrates over 500 expert tools and organized biomedical databases, supporting multi-modal reasoning across genomics, transcriptomics, proteomics, phenomics, and pharmacology [11][12]. - The system features a multi-agent collaborative decision-making architecture, including a Coordinator Agent, Planning Agent, Reasoning Agent, Critic Agent, and Reporting Agent, enabling a closed-loop autonomous scientific decision-making process [12][13]. Group 4: Performance Evaluation - A specialized benchmark test set for target discovery, TRQA, was created, covering 1,921 multi-dimensional validation questions, demonstrating OriGene's superior performance in accuracy, recall, and robustness compared to human experts and other AI models [18][21]. - The system's self-evolving capabilities allow it to improve its reasoning ability over time through iterative learning and feedback from experiments [14][16]. Group 5: Practical Validation - In liver cancer, OriGene identified G protein-coupled receptor GPR160 as a key target, showing significant expression in cancer tissues and potential as a new immune checkpoint [23]. - For colorectal cancer, the system selected arginase ARG2 as a target, confirming its high expression in cancer tissues and demonstrating effective tumor suppression in patient-derived organoid models [25][27]. Group 6: Implications for Drug Development - The research signifies a major advancement in using AI to accelerate therapeutic target discovery, providing a scalable and adaptable platform for identifying mechanism-based treatment targets [27]. - As generative AI models and biomedical data resources mature, frameworks like OriGene are expected to facilitate AI-driven end-to-end drug discovery, enhancing the potential for precision medicine [27].