Workflow
多智能体协作
icon
Search documents
「AI 100」榜单启动招募,AI产品“年会”不能停丨量子位智库
量子位· 2025-12-30 03:57
Core Insights - The article discusses the emergence of numerous keywords in the AI product sector by 2025, highlighting transformative AI products that are leading the market [4] - The "AI 100" list by Quantum Bit Think Tank aims to evaluate and recognize the top AI products in China, reflecting the industry's evolution and future trends [4][12] Group 1: AI 100 List Overview - The "AI 100" list is divided into three main categories: "Flagship AI 100," "Innovative AI 100," and the top three products in ten popular sub-sectors [6] - The "Flagship AI 100" will focus on the strongest AI products of 2025, showcasing those that have achieved significant technological breakthroughs and practical application value [7] - The "Innovative AI 100" aims to identify products that are expected to emerge in 2026, representing cutting-edge AI technology and potential industry disruptors [8] Group 2: Sub-sector Focus - The ten hottest sub-sectors for the top three products include AI Browser, AI Agent, AI Smart Assistant, AI Workbench, AI Creation, AI Education, AI Healthcare, AI Entertainment, Vibe Coding, and AI Consumer Hardware [9] Group 3: Application and Evaluation - The evaluation of the "AI 100" list employs a dual assessment system combining quantitative and qualitative measures, focusing on user data and expert evaluations [13] - Quantitative metrics include user scale, growth, activity, and retention, while qualitative assessments consider long-term potential, technology, market space, and user experience [13]
OxyGent 多智能体协作框架新版本发布
Sou Hu Cai Jing· 2025-12-18 17:10
Core Insights - OxyGent has released a new version of its multi-agent collaboration framework, introducing features such as multi-modal information transfer, fine-grained message control, MCP reconnection, and front-end streaming output [1] - The framework has been effectively utilized in various business scenarios within JD.com and external communities, supporting the implementation of agent technology and driving AI application development [1] Group 1: OxyGent Framework Features - OxyGent allows developers to flexibly combine and build multi-agent systems by abstracting Agents, Tools, LLMs, and Flows into pluggable atomic AI components [2] - The framework follows a Stateless design principle and incorporates AOP design concepts, providing extreme scalability and full-link decision traceability [2] - OxyGent enables dynamic planning based on defined permission relationships between agents, allowing real-time generation of actual call flow diagrams [2] Group 2: OxyGent Execution Lifecycle - The execution lifecycle of Oxy includes several steps to manage and coordinate different components within the multi-agent system, ensuring data is processed, recorded, saved, and sent at the right time [3][4][5][6][7][8][9] - Key steps include data preprocessing, logging tool calls, saving data, formatting input, executing main logic, and post-processing results [3][4][5][6][7][8][9] Group 3: Data Scopes in OxyGent - OxyGent provides four data scopes: Application, SessionGroup, Request, and Node, allowing flexible read/write operations to enhance data management efficiency and development convenience [11][12] Group 4: Practical Applications of OxyGent - OxyGent has been effectively implemented in JD.com's internal business scenarios, such as SOP processes, data analysis, tool invocation, and multi-level classification [14][15][16][17][18] - In SOP scenarios, the framework supports breaking down business processes into multiple agents, improving execution efficiency and facilitating process tracking [15] - In data analysis, OxyGent automates data collection, preprocessing, analysis, and visualization, enhancing the efficiency of data-driven scenarios [16] Group 5: Community Feedback and Use Cases - OxyGent has demonstrated usability and scalability through community developer feedback, showcasing applications in metric queries, automated tool invocation, flowchart generation, and slow SQL governance [19] - Notable use cases include natural language to SQL conversion, web information extraction, and automated slow SQL diagnosis, validating OxyGent's practicality as an enterprise-level open-source agent platform [19] Group 6: Developer Support and Community Engagement - The company has provided detailed tutorials for developers to quickly get started with OxyGent, covering the entire process from environment setup to distributed agent deployment [20] - Community engagement has led to the resolution of common issues and the emergence of innovative external cases and code contributions through competitions [21]
王江平详解如何破除AI科学发现“堰塞湖”
Zhong Guo Xin Wen Wang· 2025-12-16 08:21
Core Insights - The rapid growth of AI prediction results is not matched by human verification and industrialization capabilities, creating a "bottleneck" in scientific discovery application [3][4] - The disparity between the exponential increase in AI predictions and the linear growth of human validation leads to a significant backlog of unverified results [3] Group 1: Reasons for the Bottleneck - The limitations of predictive models, including insufficient logical reasoning, depth of knowledge, and the presence of black box issues and hallucination risks [3] - The absence of standards and evaluation systems makes it difficult to determine the accuracy and composability of numerous prediction results [3] - Insufficient experimental validation capabilities due to poor environmental adaptability, low cross-platform interoperability, and a lack of a closed-loop system for autonomous experiments [3] Group 2: Proposed Solutions - Strengthening the construction of datasets, high-value knowledge centers, and evaluation standards for AI prediction results to reduce redundancy and establish authoritative assessment systems [4] - Accelerating the development of AI autonomous laboratories by promoting open-source and modular approaches, and exploring hybrid augmented intelligence that involves human participation [5] - Enhancing pilot testing platforms to leverage China's application scenarios and foster engineering innovation, while promoting collaboration between academia and industry [5]
2025年AI智能体在未来产业创新上的前沿应用与发展趋势报告(1)
Sou Hu Cai Jing· 2025-12-02 21:04
Core Insights - The report outlines the evolution of AI from large language models (LLMs) to Agentic AI, emphasizing a shift towards a closed-loop system of perception, decision-making, action, and learning [1][6] - The global Agentic AI market is projected to grow from approximately $5.29 billion in 2024 to $46-47 billion by 2030, with a compound annual growth rate (CAGR) exceeding 40% [15] - Key industry applications include finance, healthcare, education, manufacturing, and collaborative office environments, with a significant transformation expected in organizational operations and employment structures by 2028 [25][28] Industry Trends - The transition from model intelligence to behavioral intelligence marks a significant macro trend in the AI industry, moving towards a focus on closed-loop systems [6] - The report identifies five major evolutionary trends in Agentic AI, including a shift from application-driven to ecosystem-driven models and from single-agent to multi-agent collaboration [29] - The anticipated inflection point for large-scale application of AI agents is 2025, with expectations that 33% of enterprise software will integrate AI agent functionalities by 2028 [23] Market Dynamics - North America is identified as the primary funding pool for Agentic AI, while Europe focuses on privacy compliance and efficiency tools, and China leans towards outbound application services [15] - The report highlights the emergence of ten innovative solutions in Agentic AI technology, including retrieval-augmented generation (RAG) and multi-agent collaboration [30][32] - The expected impact of Agentic AI on traditional industries includes a 40% reduction in operational costs and a 20% increase in revenue by 2028 [25] Employment and Skills - The rise of AI agents is expected to lead to job displacement in repetitive and rule-based roles, while simultaneously creating new positions in AI development, training, and maintenance [28] - There will be a shift in skill requirements, with increased demand for creativity, strategic thinking, and emotional intelligence [28] Technological Innovations - Future breakthroughs in Agentic AI are anticipated in areas such as multi-modal integration, enhanced autonomous decision-making, and improved collaboration capabilities among multiple agents [38] - The report emphasizes the importance of safety and risk governance, proposing strategies for reliability, compliance, and ethical considerations in AI deployment [10][12]
AI营销新前沿:AI直播脚本如何重塑万亿直播电商场
Sou Hu Cai Jing· 2025-11-21 08:50
深夜十二点,某新锐美妆品牌的直播运营负责人李诚,正对着电脑屏幕上三份大同小异的直播脚本叹气。 为了迎接即将到来的"618"大促,他的团队已经连续两周"三班倒",但产出的脚本依旧面临"爆款难复刻,创意已枯竭"的窘境。 点击、转化、停留时长,这些冰冷的数据指标像一座座大山,压得整个内容团队喘不过气。当晚,他第一次在团队工作群里抛出了那个在业内流传已久、却 又充满争议的问题:"AI写直播脚本,到底靠不靠谱?" 这个问题背后,是2025年直播电商行业即将面临的深刻矛盾。 据《2025中国直播电商生态白皮书》预测,届时中国直播电商市场规模将突破5万亿元大关,日均开播场次预计超过1000万场(蓝皮书·P12)。 然而,与之相对的是,内容生产的"人力作坊"模式已然触顶。海啸般的流量需求与小舢板式的内容产能之间的巨大鸿沟,正迫使整个行业寻找新的生产力引 擎。 即便如此,一个标准(3-4小时)直播脚本的平均生产周期仍在2-3天左右。面对瞬息万变的市场热点和用户反馈,这种迭代速度无异于"刻舟求剑"。 这不仅是对AI工具的疑问,更是对AI营销未来形态的探索。人工智能,尤其是以多智能体(Multi-Agents)为代表的AIGC技术 ...
成本暴降99%!万人大会系统全是AI生成的,Vibe Coding终于真上战场了
量子位· 2025-11-17 12:00
Core Insights - The article discusses the evolution of AI tools from being mere toys to becoming essential business solutions, exemplified by Baidu's "秒哒" platform which can generate complete applications from simple natural language inputs [1][2][3]. Group 1: AI Application Development - The "秒哒" platform has evolved to version 2.0, significantly reducing development costs by 99% compared to traditional methods [4]. - It allows users to create full-stack applications without writing code, integrating backend logic, databases, and payment systems seamlessly [6][7]. - The platform has already generated over 400,000 applications, indicating a strong demand for such tools [55]. Group 2: User Experience and Functionality - Users can create various applications, such as e-commerce platforms and games, in just a few minutes, showcasing the platform's ease of use [25][32]. - The platform supports a wide range of functionalities, including payment processing, image editing, and video generation, all without requiring additional development [23][48]. - Applications can be published directly to the internet and integrated with search engines for visibility [39][40]. Group 3: Technological Framework - The platform operates through a multi-agent collaboration system, where different AI agents handle various aspects of application development, mimicking a micro-development team [42]. - It leverages Baidu's ecosystem, allowing for easy integration of services like maps, SMS, and payment processing [44][46]. - Continuous upgrades to backend capabilities ensure that applications can handle complex data management and user interactions effectively [48]. Group 4: Market Potential and Community Engagement - The platform targets a broad audience, enabling individuals without coding skills to transform their ideas into functional applications, thus tapping into a previously underserved market [56]. - Baidu has initiated a hackathon to encourage non-programmers to create innovative applications, further expanding the community around the platform [58]. - The international version, MeDo, has also gained traction, indicating the global appeal of such AI-driven development tools [70].
硅谷风投正集体押注一批“反叛”的AI实验室,一个月砸下25亿美元,AI研究需要巨头体系外的新范式
Xi Niu Cai Jing· 2025-11-13 07:43
Core Insights - A new wave of investment is emerging in "AI laboratories," referred to as neolabs, which aim to redefine AI research paradigms rather than replicate the paths of giants like OpenAI and Anthropic [1] - Five neolab startups have raised or negotiated up to $2.5 billion in funding within the past month, indicating a significant shift in capital allocation towards fundamental research [1] - The giants' dominance has created a paradox where their scale and processes hinder rapid experimentation, presenting an opportunity for smaller, agile teams to explore innovative theories [1] Neolab Startups - Isara, founded by former OpenAI researcher Eddie Zhang, is developing a software system for thousands of AI agents to collaborate on complex tasks, with a target valuation of $1 billion [2] - Humans&, founded by ex-xAI researcher Eric Zelikman, aims to create emotionally intelligent AI and is in discussions for $1 billion funding at a $4 billion valuation [3] - Periodic Labs, founded by a former OpenAI research head, focuses on automating scientific research, while Reflection AI, founded by ex-DeepMind researchers, challenges the closed-source model of giants [6] Investment Trends - Investors are drawn to neolabs not only out of curiosity but also because they offer a "safer risk" profile, with the potential for a "half-exit" by selling to giants like Amazon or Microsoft [5] - The trend indicates a shift from a competition of singular capabilities to a focus on multi-agent collaboration, long-term learning, and explainability in AI research [6] Challenges Ahead - The high cost of computing resources remains a significant challenge for neolabs, as giants dominate the high-end GPU supply chain [7] - There is a lack of mature evaluation systems for long-term tasks and agent collaboration quality, complicating the assessment of these new AI systems [7] - Neolabs must establish viable business models that connect foundational research to industry applications, ensuring a closed loop of "research-product-revenue" to avoid becoming mere incubators for larger companies [7]
艾瑞咨询:2025年中国营销智能体研究报告
Sou Hu Cai Jing· 2025-11-04 14:11
Core Insights - The report by iResearch focuses on the development of marketing intelligence agents, which utilize generative AI or machine learning algorithms to automate marketing tasks, highlighting their transformative value in the marketing sector [1] Group 1: Development Background - The global marketing environment is undergoing three significant changes: accelerated iteration of platform advertising rules, increased privacy regulations, and rising digital marketing investments, with digital channels expected to account for 61.1% of marketing spend by 2025 [8][12] - Chinese companies face challenges in overseas marketing due to cultural differences, complex channels, compliance, and cross-border payment issues, which marketing intelligence agents can help address through multilingual content generation and compliance checks [13][15] Group 2: Technological Evolution - Marketing tools have evolved from single advertising platforms to intelligent agents capable of market insights, content generation, ad optimization, and performance reporting, enabling cross-channel automation [10][24] - The key capabilities of marketing intelligence agents include market insights, content generation, ad optimization, and performance evaluation, which collectively enhance marketing efficiency and decision-making quality [24] Group 3: Industry Ecosystem - The ecosystem consists of upstream technology providers (both domestic and international), advertising channels, midstream toolchain companies, and downstream sectors focusing on cross-border e-commerce, brands, and gaming [1][32] - Major players in the ecosystem include domestic models like Wenxin Yiyan and international models like ChatGPT, with advertising channels such as Douyin and Google Ads serving as platforms for deployment [1][32] Group 4: Business Models - The primary business models in this sector include revenue sharing from ad placements, subscription models, and value-added services such as creative production and consulting [1][29] - The market for intelligent marketing agents in China is expected to exceed 100 billion yuan by 2030, indicating significant growth potential [1] Group 5: Benchmark Cases - Notable examples of marketing intelligence applications include Meta's Advantage+ automated advertising product, which streamlines the entire shopping and app advertising process, and Tiandong Technology's Navos marketing AI Agent, which optimizes market analysis and ad placement [1][15]
达摩院推出多智能体框架ReasonMed,打造医学推理数据生成新范式
机器之心· 2025-11-03 04:04
Core Insights - The article discusses the development of ReasonMed, a new paradigm for generating high-quality medical reasoning data, addressing the challenges in constructing large-scale medical reasoning datasets [2][3][27]. Data Challenges - There is a scarcity of high-quality medical reasoning data, with existing datasets being limited in scale and lacking a systematic pipeline for large-scale construction [2]. - Current datasets often rely on a single model for generation, failing to leverage diverse knowledge domains from multiple pre-trained models [2]. - The cost of constructing high-quality medical reasoning datasets is prohibitively high, requiring significant computational and human resources [2]. ReasonMed Framework - ReasonMed integrates knowledge from four authoritative medical question benchmarks, aggregating approximately 195,000 medical questions across various specialties [3]. - The framework employs multiple proprietary models to collaboratively generate and validate medical reasoning paths, enhancing knowledge coverage and logical consistency [3]. - A multi-agent interaction system is designed to validate and optimize reasoning data across multiple dimensions, balancing quality and cost [3]. Data Generation Process - The data generation process consists of three main steps: data collection, multi-agent reasoning generation and validation, and layered optimization and refinement [12]. - ReasonMed has successfully generated a dataset of 370,000 high-quality medical reasoning samples, significantly outperforming existing public datasets in quality metrics [13]. Model Performance - Models trained on the ReasonMed dataset, such as ReasonMed-7B and ReasonMed-14B, have demonstrated superior performance on various authoritative medical question benchmarks, achieving an accuracy of 82.0% on PubMedQA, surpassing larger models like LLaMA3.1-70B [22][21]. - The hybrid training strategy combining reasoning paths and summary answers has proven to be the most effective, achieving a comprehensive accuracy of 69.6% [23]. Cost Efficiency - The layered optimization mechanism of ReasonMed has reduced data construction costs by over 70%, demonstrating a cost-effective approach to generating complex reasoning chains [25]. - The project illustrates a scalable framework for generating reasoning data that can be applied to other knowledge-intensive fields, such as life sciences and materials science [27]. Community Impact - ReasonMed has garnered positive feedback from the research community, being recognized as a new paradigm for high-quality reasoning data generation and gaining significant attention on platforms like Hugging Face [30].
AI玩狼人杀战绩如何,今年的B站超级科学晚也追人工智能热点
Xin Jing Bao· 2025-11-01 09:03
Core Insights - The event showcased AI-driven research in various scientific fields, emphasizing the intersection of entertainment and scientific innovation [1][2] - The AI's performance in the game "Werewolf" demonstrated a competitive edge over human players, indicating advancements in AI capabilities [1] Group 1: AI Research and Applications - The AI players in the "Werewolf" game achieved a win rate of 70% against human players, while the average human win rate was 67% [1] - The research on AI playing "Werewolf" was conducted by a team from Tsinghua University, highlighting the academic backing of such AI applications [1] Group 2: Scientific Research and Industry Relevance - Eight foundational scientific research projects were awarded, covering fields such as mathematics, physics, robotics, medicine, chemistry, artificial intelligence, biology, and quantum technology [1] - One notable project focused on creating a magnetic fluid material to address challenges in drug delivery for chemotherapy, aiming to improve the precision of drug targeting to only 0.7% of current capabilities [2]