ChatGPT agent

Search documents
DeepSeek新模型曝光,梁文锋亲自督战,要和OpenAI硬碰硬
3 6 Ke· 2025-09-05 12:48
近日,彭博社援引知情人士消息称,DeepSeek 正在开发一款"真正的AI 智能体",即具备更高阶AI Agent 功能的模型,以抢占下一代人机交互入口。并 且,新模型将直接对标业内"老大哥" OpenAI ,与之展开竞争。 发布时间上,据称,DeepSeek 创始人梁文锋正在督战,目标在今年四季度发布这款新模型。 传DeepSeek年底将推出超强 AI 智能体 2025年的版本答案是Agent,几乎成为一种业内共识。 图源:微博 新的AI系统的核心特征为:1、能基于过往的行动进行学习和自我完善,也充分的成长空间;2、根据用户最少的指令,自动完成多个步骤的复杂任务。 值得一提的是,此前针对DeepSeek流量下滑,360创始人周鸿祎给出了自己的看法和爆料。 周鸿祎表示,DeepSeek官网流量下降是因为梁文锋"就没想认真做to C的App",在流量高峰期时,即使网站速度"慢得要死",梁文锋也不在乎。周鸿祎还 提到,"DeepSeek可能在憋大招"。 由此,或许也进一步验证了关于布局AI Agent的爆料。目前,针对此消息,DeepSeek官方并未给出具体回应。 距离年初DeepSeek R1的发布已经过去将近 ...
Delegate work to ChatGPT agent
OpenAI· 2025-08-06 23:10
Product Overview - Chat GPT Agent enables users to research, write code, and take action online [1] - The agent can connect to internal data sources via admin-approved connectors or utilize the web for information gathering [1] - It offers a library of starter prompts for various tasks [1] - The agent can adapt its next step based on findings during the task execution [6] Key Features & Functionality - Chat GPT Agent spins up a VM to write code, research, interact with websites, and build documents [3] - It uses tools like search, image generation, deep research, and operator to complete tasks [4] - Users can monitor the agent's chain of thought in activity view and interrupt or take over control if needed [4] - The agent can build spreadsheets with relevant articles, historical and projected growth rates, and customizable business plan calculators [7] - It can also generate research reports with analysis of e-commerce growth rates, success drivers, and future projections [7] Potential Benefits - Chat GPT Agent can automate tasks that could take hours or days, freeing up time for other work [8] - It can analyze e-commerce growth and build charts and new tabs [2] - The agent can create a business launch plan with formulas [2]
经济学人:英美情报界如何使用AI模型?
Sou Hu Cai Jing· 2025-07-31 06:22
【文/经济学人】 就在唐纳德·特朗普宣誓就任总统的那一天,一家名为深度求索(DeepSeek)的中国公司发布了一款世 界级的大语言模型(LLM)。特朗普后来形容,这对美国AI行业敲响了"警钟"。美国参议院情报委员 会副主席马克·华纳(Mark Warner)表示,美国情报界(由18个机构和组织组成)"被打了个措手不 及"。 2024年,拜登政府开始担心中国的情报部门和军方可能会在人工智能(AI)应用上抢占先机。于是, 拜登政府下令情报机构、五角大楼以及(负责核武器研发的)能源部更激进地试验尖端的AI模型,并 加强与"前沿性"AI实验室的合作,重点包括AI初创公司Anthropic、谷歌DeepMind和OpenAI。 7月14日,五角大楼向Anthropic、谷歌、OpenAI以及埃隆·马斯克旗下的xAI(该公司的聊天机器人在最 近一次更新后一度自视为希特勒)等企业分别授予了最高达2亿美元的合同。这些公司将测试"代理 型"(agentic)AI模型。此类模型能够代替用户执行任务,并将复杂任务拆分为若干步骤,还可以操控 其他设备,比如汽车或计算机。 这些前沿实验室不仅活跃在军事领域,也正深度介入欧美的情报界。早 ...
英美情报界如何使用AI模型?
Guan Cha Zhe Wang· 2025-07-31 05:52
Core Insights - The emergence of DeepSeek's large language model (LLM) has raised concerns in the U.S. regarding China's advancements in AI, particularly in intelligence and military applications [1][8] - The Biden administration is responding by accelerating AI experimentation within intelligence agencies and the Department of Defense, collaborating with leading AI firms like Anthropic, Google, and OpenAI [1][2] - The U.S. intelligence community is increasingly utilizing AI models, with significant contracts awarded to companies for developing "agentic" AI models capable of executing complex tasks [1][2] Group 1: U.S. Developments - The Pentagon awarded contracts up to $200 million to companies like Anthropic and Google for testing agentic AI models [1] - All U.S. intelligence agencies are now widely using AI models, with firms customizing models based on specific agency needs [2] - Despite advancements, the application of AI in national security is still not meeting expectations, with agencies struggling to adapt existing technologies effectively [4] Group 2: European Initiatives - The UK intelligence community is also integrating advanced LLM capabilities, with companies like Mistral leading efforts in Europe [3] - Mistral's Saba model is specifically trained for regional language processing, enhancing its utility in intelligence operations [3] - The Israeli military has significantly increased its use of OpenAI's GPT-4 model, indicating a growing reliance on advanced AI technologies in military contexts [3] Group 3: Challenges and Concerns - Experts express concerns about the reliability and transparency of AI models, emphasizing the need for consistency in intelligence applications [6][7] - The current focus on developing advanced agentic models may overlook the necessity for models that can perform causal reasoning and understand real-world logic [7] - There are warnings that China may be advancing faster in AI applications for military and intelligence purposes, potentially outpacing U.S. efforts [7][8]
硬核「吵」了30分钟:这场大模型圆桌,把AI行业的分歧说透了
机器之心· 2025-07-28 04:24
Core Viewpoint - The article discusses a heated debate among industry leaders at the WAIC 2025 forum regarding the evolution of large model technologies, focusing on training paradigms, model architectures, and data sources, highlighting a significant shift from pre-training to reinforcement learning as a dominant approach in AI development [2][10][68]. Group 1: Training Paradigms - The forum highlighted a paradigm shift in AI from a pre-training dominant model to one that emphasizes reinforcement learning, marking a significant evolution in AI technology [10][19]. - OpenAI's transition from pre-training to reinforcement learning is seen as a critical development, with experts suggesting that the pre-training era is nearing its end [19][20]. - The balance between pre-training and reinforcement learning is a key topic, with experts discussing the importance of pre-training in establishing a strong foundation for reinforcement learning [25][26]. Group 2: Model Architectures - The dominance of the Transformer architecture in AI has been evident since 2017, but its limitations are becoming apparent as model parameters increase and context windows expand [31][32]. - There are two main exploration paths in model architecture: optimizing existing Transformer architectures and developing entirely new paradigms, such as Mamba and RetNet, which aim to improve efficiency and performance [33][34]. - The future of model architecture may involve a return to RNN structures as the industry shifts towards agent-based applications that require models to interact autonomously with their environments [38]. Group 3: Data Sources - The article discusses the looming challenge of high-quality data scarcity, predicting that by 2028, existing data reserves may be fully utilized, potentially stalling the development of large models [41][42]. - Synthetic data is being explored as a solution to data scarcity, with companies like Anthropic and OpenAI utilizing model-generated data to supplement training [43][44]. - Concerns about the reliability of synthetic data are raised, emphasizing the need for validation mechanisms to ensure the quality of training data [45][50]. Group 4: Open Source vs. Closed Source - The ongoing debate between open-source and closed-source models is highlighted, with open-source models like DeepSeek gaining traction and challenging the dominance of closed-source models [60][61]. - Open-source initiatives are seen as a way to promote resource allocation efficiency and drive industry evolution, even if they do not always produce the highest-performing models [63][64]. - The future may see a hybrid model combining open-source and closed-source approaches, addressing challenges such as model fragmentation and misuse [66][67].
腾讯研究院AI每周关键词Top50
腾讯研究院· 2025-07-25 10:21
Group 1: Core Insights - The article highlights the top 50 keywords related to AI developments from July 21 to July 25, showcasing significant advancements and trends in the industry [1] - Key players such as OpenAI, NVIDIA, and Tencent are actively involved in various AI applications and model developments, indicating a competitive landscape [2][4] Group 2: Applications - OpenAI's ChatGPT agent and Tencent's QQ Music integration demonstrate the growing application of AI in consumer products [2][4] - The introduction of various AI tools like MiniMax Agent and CodeBuddy AI IDE reflects the trend towards enhancing productivity and user experience in software development [2][4] Group 3: Models and Technologies - The K2 ranking by Kimi and updates on models like Qwen3 and OpenReasoning-Nemotron signify ongoing improvements in AI model performance and capabilities [2][4] - Innovations in ASR technology by Tencent and other companies highlight the focus on enhancing voice recognition and interaction [4] Group 4: Opinions and Trends - Insights from industry leaders such as Eric Schmidt and Huang Renxun emphasize the importance of learning loops and the role of the Chinese supply chain in AI development [5] - Discussions on AI's potential to drive GDP growth and the evolution of AI agents indicate a broader economic impact and investment interest in the sector [5]
2025 年 7 月 21 日全球科技新闻汇总
Haitong Securities International· 2025-07-21 04:48
Investment Rating - The report does not explicitly provide an investment rating for the industry or specific companies. Core Insights - Arm's entry into the cloud ASIC market raises concerns as it competes with established IC design firms like Broadcom and Marvell, which have also expanded into ASIC services. Arm has yet to secure significant orders from major cloud service providers (CSPs) [8] - Yangtze Memory Technologies Corp (YMTC) aims for a fully domestic production line and targets a 15% global market share by 2026, leveraging local suppliers and overcoming previous production bottlenecks [9] - The demand for NVIDIA's GB200 servers and ASIC servers is strong, indicating robust growth in the cloud service provider sector, despite concerns over AWS layoffs affecting future growth [10] Summary by Sections Arm's ASIC Market Entry - Industry insiders suggest that Arm's move into the ASIC business is not entirely competitive against its customers, as established firms are also entering this space. Arm has not yet secured significant cloud ASIC orders, and market leaders still dominate [8] YMTC's Domestic Production Strategy - YMTC is collaborating with Chinese suppliers to implement a fully domestic production line, aiming to match international standards in 3D NAND technology. The company has received substantial funding to support its semiconductor manufacturing advancements [9] CSP Demand and Server Shipments - The strong demand for GB200 servers and ASIC servers is expected to yield positive results for U.S. CSPs. Despite tariff-related challenges, customer orders remain robust, suggesting continued growth in the AI-driven cloud market [10]
腾讯研究院AI速递 20250721
腾讯研究院· 2025-07-20 16:02
Group 1 - Kimi K2 surpasses DeepSeek to become the top open-source model globally, ranking fifth overall and closely following leading closed-source models [1] - K2 inherits the DeepSeek V3 architecture with parameter adjustments, including an increase in expert numbers and a reduction in attention heads [1] - Two of the top 10 open-source models are from China, challenging the perception that "open-source equals weak performance" [1] Group 2 - Decart releases MirageLSD, the first real-time, unlimited diffusion video model capable of processing any video stream with a 40-millisecond delay [2] - Karpathy invests as an angel investor, foreseeing broad applications in real-time film production, game development, and AR [2] - The breakthrough lies in the real-time stream diffusion architecture, addressing error accumulation through frame-by-frame generation and historical enhancement methods [2] Group 3 - Suno V4.5+ offers layered generation and fusion of vocals and instruments, allowing users to upload personal vocals or accompaniments for AI-assisted creation [3] - The new "Inspire" mode enables users to upload personal dry vocals for AI to learn and create music that matches their vocal characteristics [3] - The platform has optimized creative thresholds and enhanced AI collaboration efficiency with the launch of Suno V4.5+ [3] Group 4 - Tencent Yuanbao App integrates QQ Music services, enabling users to search for songs with a phrase and play them instantly without leaving the chat interface [4] - The technology is driven by a dual-engine system combining mixed models and DeepSeek-R1, capable of recognizing vague music descriptions and providing contextual recommendations [4] - User experience improvements include seamless account connectivity, multimodal interaction, and creative assistance, reflecting the evolution of AI assistants from tools to partners [4] Group 5 - OpenAI's ChatGPT agent faces criticism from competitors like Manus and Genspark, highlighting its limitations despite integrating multiple functionalities [5] - The ChatGPT agent can automate tasks like retirement planning and shopping lists, but its output is considered simplistic compared to competitors [5] Group 6 - PhysRig, developed by UIUC and Stability AI, introduces a framework for character animation with micro-physical binding, embedding rigid skeletons into elastic soft bodies [6] - This method replaces traditional techniques with micro-physical simulations, addressing issues of volume loss and deformation artifacts [6] - The framework outperforms traditional methods across 17 character types and 120 animation tests, supporting cross-species motion transfer [6] Group 7 - OpenAI's mysterious general reasoning model achieved a gold medal level in IMO 2025 by solving five problems and scoring 35 points [7] - The model demonstrates deep creative thinking capabilities lasting several hours, surpassing previous AI's minute-level reasoning [7] - This achievement is a result of breakthroughs in general reinforcement learning rather than task-specific training, although the model will not be released [7] Group 8 - The creator of Claude Code emphasizes that the best AI tools should empower users, advocating for simple, universal tools rather than complex systems [8] - The focus is on providing foundational capabilities that allow users to control their workflows rather than having the tools dictate them [8] - Effective workflows should involve exploration and planning followed by user confirmation before coding, utilizing test-driven development for iterative improvement [8] Group 9 - The focus on agents, open-source, and the choice of DSV3 architecture is justified by the need to stimulate model capabilities without relying on external products [9] - Open-sourcing enhances visibility and community contributions, ensuring genuine model progress rather than superficial improvements [9] - The DSV3 architecture has been proven superior in experiments, allowing for cost-effective adjustments without introducing ineffective variables [9] Group 10 - Many current AI products are expected to be replaced as they do not adhere to scaling laws, with a focus on enhancing model capabilities rather than merely expanding tools [10] - Current AI models exhibit lower data efficiency compared to humans, indicating that algorithm improvements are more critical than simply increasing data scale [10] - Research on multi-agent systems is evolving to explore not just interactions but also extending reasoning capabilities from minutes to hours or even days [10]
ChatGPT Agent遭暴击,国产AI轮番“公开处刑”
Hu Xiu· 2025-07-19 04:00
Core Insights - The excitement surrounding the release of OpenAI's ChatGPT agent is primarily felt by competing companies rather than end users, indicating a competitive landscape in the agent market [5][6]. - Companies like Manus and Genspark are actively comparing their products with ChatGPT, suggesting a fierce competition and positioning themselves as superior alternatives [1][4][50]. Product Comparisons - Manus has released multiple tweets highlighting its agent's capabilities compared to OpenAI's, claiming to be faster and more efficient [1]. - Genspark showcased a demo that emphasizes its agent's ability to complete tasks more smoothly than ChatGPT, indicating a focus on user experience [4]. - The ChatGPT agent has been rolled out to Pro users, with demand exceeding expectations, leading to a phased rollout for Plus and Team users [6]. User Experience and Performance - A user tested the ChatGPT agent by generating a comprehensive retirement plan presentation, which took about 20 minutes to complete, but the final product was deemed simplistic [12][14]. - The agent's process involved automatic information gathering without user intervention, showcasing its efficiency [13]. - Comparisons with Manus and Genspark revealed that while ChatGPT can generate presentations, the quality and aesthetics of the outputs from competitors were often superior [50][105]. Market Dynamics - The launch of the ChatGPT agent is perceived as a significant event in the agent market, akin to a "competitive bomb" being dropped, which has prompted other companies to enhance their offerings [5]. - The competitive landscape is characterized by rapid responses from companies like Manus and Genspark, who are eager to demonstrate their products' advantages over ChatGPT [1][4][50]. Financial Independence and Retirement Planning - The article discusses a financial independence model (FIRE) for a high-income individual aiming to retire at 30 with $5 million, highlighting the challenges of achieving such goals in a high-cost city like Vancouver [156][160]. - The analysis indicates that even with high savings rates (80-90%), the target of $5 million may not be feasible without extraordinary investment returns or additional income sources [157][159].
Say hello to ChatGPT agent.
OpenAI· 2025-07-18 18:08
[Music] So we have been on this journey of like not just improving our models but the tools the model can use and it's kind of like a symbiosis of some kind like the better the tools are the better the agent can use it the better the agent is the more powerful tool it can use and it like goes on and on. Every once in a while I'm just you know taken back by it a little bit. it does something that I didn't expect or it's better than I realized.Yeah, I probably have that moment like once a week at least. I'm g ...