Workflow
大语言模型
icon
Search documents
云南探索创建智能执法大模型
Core Viewpoint - Yunnan Province is launching an innovative application project for ecological environment intelligent supervision based on a large language model to address issues such as a shortage of enforcement personnel, broad regulatory scope, and high regulatory difficulty [1] Group 1: Data Foundation - The model relies on the Yunnan Provincial Ecological Environment Data Resource Center's extensive data resources, integrating over 500 legal documents, 100 typical enforcement cases, and more than 60,000 pollution source data entries to create a high-quality corpus for training the large language model [2] - The corpus includes ecological environment-related laws, standards, technical guidelines, and enforcement cases, ensuring that enforcement personnel can generate answers based on the most accurate and comprehensive information [2] Group 2: Intelligent Applications - The model has developed four intelligent assistants: Environmental Knowledge Assistant, Pollution Source Assistant, Violation Behavior Analysis Assistant, and Pollution Source Statistical Analysis Assistant, significantly reducing time costs associated with traditional manual data retrieval and analysis [3] - Each assistant serves specific functions, such as providing intelligent Q&A based on legal regulations, offering quick references for on-site enforcement, generating penalty suggestions for violations, and automating statistical analysis to improve efficiency [3] Group 3: Future Directions - Yunnan Province plans to further activate ecological environment data potential and explore AI applications, gradually embedding AI technology into various ecological environment regulatory tasks to accelerate the digital transformation of ecological environment supervision [4]
新东方海外营收增速放缓,暂无开发大模型计划
Di Yi Cai Jing· 2025-04-23 13:50
海外备考和咨询业务增长放缓是受到宏观经济形势和国际关系变化的影响。 | (in thousands US$, except per ADS(1) data) | 3Q FY2025 | 31 | | --- | --- | --- | | Net revenues | 1,183,055 | | | Operating income | 124.519 | | | Non-GAAP operating income (2)(3) | 142,056 | | | Net income attributable to New Oriental | 87,255 | | | Non-GAAP net income attributable to New Oriental (2)(3) | 113,344 | | | Net income per ADS attributable to New Oriental - basic | 0.54 | | | Net income per ADS attributable to New Oriental - diluted | 0.54 | | | Non-GAAP net ...
AI动态汇总:openAI发布GPT-4.1,智谱发布GLM-4-32B-0414系列
China Post Securities· 2025-04-23 07:54
- GPT-4.1 significantly improved coding capabilities, achieving 54.6% in SWE-bench Verified tests, outperforming GPT-4o by 21.4% and GPT-4.5 by 26.6%[12][13][15] - GPT-4.1 demonstrated enhanced instruction-following ability, scoring 38.3% in Scale's MultiChallenge benchmark, a 10.5% improvement over GPT-4o[12][13][17] - GPT-4.1 achieved new SOTA in long-context understanding, scoring 72.0% in Video-MME benchmark, surpassing GPT-4o by 6.7%[12][13][22] - GLM-4-32B-0414 utilized 15T high-quality data for pretraining and applied reinforcement learning techniques to improve instruction-following, engineering code, and function-calling capabilities[26][28][30] - GLM-Z1-32B-0414 enhanced mathematical and logical reasoning through stack-sorting feedback reinforcement learning, significantly improving complex task-solving abilities[31][33] - GLM-Z1-Rumination-32B-0414 focused on deep reasoning and open-ended problem-solving, leveraging extended reinforcement learning and search tools[34] - Seed-Thinking-v1.5 adopted MoE architecture with 200B parameters, achieving 86.7% on AIME 2024 and 55.0% on Codeforces benchmarks, showcasing strong STEM and coding reasoning capabilities[35][37][41] - Seed-Thinking-v1.5 employed dual-track reward mechanisms for training, combining verifiable and non-verifiable data strategies to optimize model outputs[36][38][40] - GPT-o3/o4-mini introduced visual reasoning into the chain of thought (CoT), achieving 96.3% accuracy in V* benchmark, marking a major breakthrough in multimodal reasoning[42][46][48] - Video-R1 model applied T-GRPO algorithm to incorporate temporal reasoning in video tasks, achieving 35.8% accuracy in VSI-Bench, surpassing GPT-4o[63][65][68] - Pangu Ultra, a dense model with 135B parameters, achieved top performance in most English and all Chinese benchmarks, rivaling larger MoE models like DeepSeek-R1[69][73][74]
Agent、DeepSeek、多模态热点炸场!60+重磅嘉宾共探AI未来,2025全球机器学习技术大会完美收官!
AI科技大本营· 2025-04-21 10:24
以下文章来源于CSDN ,作者CSDN CSDN . 成就一亿技术人 作者 | 《新程序员》编辑部 出品 | CSDN(ID:CSDNnews) 在万物向 "智 " 生长的 2025 年,AI 领域的热潮持续升温,正引领着技术革新与产业探索的新浪潮。 了新的破解思路?围绕这些关键问题,欢迎回看大会首日视频,看众多技术大咖如何从理论、算法到实际应用层面进行了深度剖析 ,以此 了解 AI 技术 的更多最新进展: 大模型技术创新驱动的 AI 生态和应用演进 李建忠 CSDN 高级副总裁、 Boolan 首席技术专家 4 月 18-19 日,由 CSDN 联合高端 IT 咨询与教育平台 Boolan 举办的 2025 全球机器学习技术大会(ML-Summit 2025),在上海虹桥西郊庄园丽笙 大酒店隆重拉开帷幕。本次大会围绕 AI 最前沿的发展趋势与落地实践,聚焦大语言模型技术演进、AI 智能体、具身智能、DeepSeek 技术解析与行业 实践等 12 大专题,邀请了超 60 位来自全球顶尖科技企业与学术机构的重磅嘉宾齐聚一堂,全面呈现 AI 领域的技术风向与应用前沿。 在生成式 AI 重构技术边界的浪潮下,产业实 ...
类脑智能是AI新突破关键,上海全链条布局产业新赛道
Di Yi Cai Jing· 2025-04-19 05:49
中科院院士蒲慕明表示,我们要借鉴大脑的结构和计算特点,让人工智能突破算力、数据和参数规模的限制,实现更高级别的通用人工智能。 自上世纪80年代类脑计算的概念被首次提出起,有关类脑人工网络、类脑机器学习、类脑芯片等领域的技术研究不断涌现。而随着算力、芯片、算法三要素 的不断突破,以及多学科交叉融合的逐步深入,类脑智能发展正迎来新的发展契机。 4月18下午,在2025全国类脑智能产业创新发展推进会上,类脑智能产业创新发展联盟发起成立,类脑智能未来产业基金矩阵在会上首次亮相。 类脑智能未来产业基金矩阵由上海未来产业基金、博康共赢基金、道禾基金、杨浦科创集团等10家投资机构联合发起,将聚焦类脑智能产业新赛道,支持类 脑产业前沿技术研究、落地与应用,完善类脑产业布局,推动区域类脑产业发展。 于2017年就在全国率先开展类脑智能布局的上海,通过深化基础原创理论研究、加快关键核心技术攻关、承接国家重大战略任务等举措,在类脑计算芯片、 类脑视觉系统研发等领域取得了重要成果。 当前,规模定律(Scaling Law)已将达到算力和数据的瓶颈,人工智能的性能提升将会放缓。新算法与人工网络模型的进一步优化,借鉴低功耗但复杂而 精巧 ...
奥特曼自诩:达到或接近天才水平!OpenAI,重磅发布!
Zheng Quan Shi Bao· 2025-04-17 04:31
Core Insights - OpenAI has launched two new reasoning models, o3 and o4-mini, which are capable of image-based reasoning, marking a significant advancement in the o series [1][6] Group 1: Model Performance - The o3 model is described as the most powerful reasoning flagship model, excelling in programming, mathematics, science, and visual perception benchmarks [1][8] - The o4-mini model is optimized for cost-effective reasoning, providing a balance between performance and affordability [1][8] - In external evaluations, o3 made 20% fewer significant errors in challenging real-world tasks compared to its predecessor, particularly in programming and creative tasks [8] Group 2: Image Reasoning Capabilities - Both models can integrate images into reasoning processes, allowing for "thinking with images" [10] - Users can upload various types of images, and the models can interpret them even if they are of low quality [10] - For example, o3 can analyze a photo of a notebook and deduce the written content through reasoning [10] Group 3: Task Execution and Tool Utilization - o3 and o4-mini can autonomously execute tasks by accessing tools within ChatGPT and utilizing custom user tools via API [13] - The models can perform complex tasks such as searching for data, generating code, and creating visual representations based on user queries [13] Group 4: Future Developments - OpenAI's CEO, Sam Altman, indicated that o3 will soon be upgraded to a professional version, o3-pro [4] - The company has been releasing models at a rapid pace, including the recent launch of the GPT-4.1 series, which aims to attract users with cost-effective options [15] - There is ongoing anticipation for the release of GPT-5, which has faced delays due to integration challenges [16]
解读国内首个DeepResearch AI Agent 智谱沉思模型
2025-04-15 14:30
Summary of Conference Call on Zhipu's AI Product Company and Industry - The conference call focused on Zhipu, a company specializing in AI technology, particularly its latest product, the Autoglm Chen Si version, which is a deep research AI agent. This product has garnered significant market attention due to its unique capabilities in deep research and operational functions [1][14]. Core Points and Arguments - **Product Launch**: Zhipu AI introduced its latest AI agent product, Autoglm Chen Si version, at the Zhongguancun Forum, highlighting its dual capabilities in deep research and operational tasks [1]. - **Technical Architecture**: The product utilizes a chain model combined with a test-type scaling technology, designed specifically for completing tasks efficiently [2]. - **Local vs. Cloud Operation**: A key distinction between Zhipu's product and competitors like Manus is that Zhipu's agent operates on local client machines, while Manus runs in a cloud environment. This local operation addresses user concerns about data security and reduces computational costs [3][4]. - **Model Development**: Zhipu's agent is developed in-house, tailored for specific tasks, which enhances its performance in task planning compared to general models used by competitors like Linus [5]. - **User Experience**: Current limitations include the inability to handle multiple windows simultaneously and the overall processing time for tasks, which has been noted as relatively long [6][8]. - **Computational Power**: The company has invested approximately 2000 units of computational power, equivalent to around 2000 to 3000 NVIDIA 4090 graphics cards, to support the X400 product [9]. - **Marketing Strategy**: The decision to offer the product for free is attributed to missed market opportunities and competitive pressures from free offerings by other companies like Deep Sync [10]. - **Future Business Model**: There are considerations for evolving the business model, potentially introducing a membership system based on market conditions [10]. - **Data Source Challenges**: The product faces challenges in accessing certain online information sources, particularly academic databases like Google Scholar, which limits its functionality [11][12]. Other Important Content - **Browser Compatibility**: Currently, the agent primarily supports Chrome, with plans to improve compatibility with other browsers like IE and Safari [8]. - **User Authentication**: The agent requires users to log in to access certain features, with login information stored locally for convenience [12]. - **Investor Engagement**: The call included a Q&A session for investors, indicating a focus on transparency and engagement with stakeholders [13]. This summary encapsulates the key aspects of the conference call regarding Zhipu's AI product, highlighting its technological innovations, market positioning, and future strategies.
元戎启行周光:智驾最终拼的是 AI 技术,不只是规模丨具身智能对话#13
晚点Auto· 2025-04-14 13:47
以下文章来源于晚点LatePost ,作者晚点团队 晚点LatePost . 晚一点,好一点 先有一个移动能力的 "通才",才有更强的智驾系统。 文 丨 张家豪 编辑 丨 程曼祺 全无人驾驶,始终被视作自动驾驶行业皇冠上的明珠,就像登顶珠穆朗玛峰有 19 条路线一样,不同的公司选 择了不同的路线通往无人驾驶的最终目标。 Waymo、小马们选择了基于高精地图的 RoboTaxi 路线,在特定的路线已经实现了 RoboTaxi,为市民提供没有 司机的出行服务;以特斯拉为代表的车企与供应商,则是通过渐进式路线,卖车搭配辅助驾驶方案,收集数据 一步步迭代方案,试图逼近技术极限。 没有人能笃定哪条路线一定能成功登顶,也还有不同的公司,在尝试不同的登顶路线。 在今年的英伟达 GTC( GPU Technology Conference)上,元戎启行周光提出了一套新的解法,他说,大语言 模型的发展,经历了从弱专家模型(初代 Siri)、到通才(ChatGPT)、再到强专家模型(垂直模型)的过 程。智驾也可以复制这样的路线,一个移动能力的通才,能开好汽车、能骑好摩托车、能让配送小车随时找到 你,之后就可能进化到强专家模型—— ...
Circle to Search, XOXO:「圈定即搜」功能交互解析与入门指南
3 6 Ke· 2025-04-14 07:52
Core Points - Circle to Search is a new interaction method introduced by Google, first showcased at the Galaxy Unpack event, and is available on Samsung Galaxy S24 and Google Pixel 8 series devices [2][6] - The feature allows users to select screen areas with simple gestures and provides results based on the selection, enhancing the overall user experience on Android devices [3][7] Group 1: Functionality and User Experience - Circle to Search enables users to quickly access search results by selecting text or images on the screen, with additional tools for translation and music identification [3][8] - The design emphasizes rapid response, low error rates, and intuitive screen segmentation, making it more user-friendly compared to Google Lens [8][9] - The development team utilized a streamlined approach, focusing on prototype creation rather than extensive documentation, which led to a more agile development process [9][10] Group 2: Comparison with Competitors - Circle to Search significantly simplifies the search process compared to similar features in other operating systems like HyperOS and Flyme, which require waiting for screen recognition before initiating actions [15][17] - The integration of Google Search provides a robust backend for Circle to Search, allowing for immediate and relevant results, unlike some competitors that rely on slower AI models [17][20] Group 3: Technical Implementation - The feature leverages the new Android Ink API for smooth and responsive gesture recognition, enhancing the user experience with minimal latency [25][27] - Circle to Search supports various Android devices, including Google Pixel and select Samsung models, with a straightforward activation process [29][31] Group 4: Practical Applications - Users can utilize Circle to Search for various tasks, such as quickly sharing screenshots, translating text, and navigating to locations based on selected text [49][51] - The feature's versatility allows for seamless integration into daily activities, making it a valuable tool for information retrieval and sharing [52][53] Group 5: Future Prospects - Google aims to enhance Circle to Search by integrating AI capabilities, potentially improving its functionality while maintaining a smooth user experience [56][57] - The ongoing development reflects a commitment to creating a comprehensive and user-friendly search tool that adapts to the evolving needs of smartphone users [58][59]
人类一生所学不过4GB,加州理工顶刊新研究引热议
量子位· 2025-04-13 04:08
西风 一水 发自 凹非寺 量子位 | 公众号 QbitAI 24小时不间断学习且不遗忘,一辈子也只有 4GB 的"知识储量"? 科学家们最新研究,计算出了人类学习积累上限,就这么多~~ (甚至还不如一块U盘能装) 。 这是来自Cell旗下神经科学顶刊Neuron上的一项工作,它提出了一个发人深省的悖论: 人类信息处理速度仅为每秒10bit,而我们的感官系统却能以每秒10亿bit的速率收集数据。 大语言模型 每个参数就能存储2bit 知识,一个70亿参数的模型就能存储140亿bit的知识。 △ 结论来自华人学者朱泽园"Physics of Language Models"系列论文 难怪研究人员还提出了一项推论: 随着算力的不断提升,机器在各类任务中的表现超越人类只是时间问题。 另外,按照这项研究的结论,马斯克目前的脑机接口研究也有问题了。 研究人员表示: $\mathbf{S}i=\text{Sifting Number}=\dfrac{\text{Sensory information rate}}{\text{Bewiavel throughput}}=\dfrac{1\text{Gbit/s}}{10\t ...