Software and Internet

Search documents
加码多模态能力,夸克发布全新“AI相机”
Guan Cha Zhe Wang· 2025-04-28 09:29
Core Viewpoint - Quark AI Super Box has launched a new AI camera feature called "Photo Ask Quark," enhancing the search experience through visual understanding and reasoning capabilities [1][12]. Group 1: Product Features - The AI camera can identify locations from photos, assist in travel planning, and provide translations for foreign menus [3]. - It can also remove unwanted objects from images, adjust facial expressions, and generate social media captions [3]. - The camera acts as a life assistant by diagnosing appliance issues and suggesting purchases for damaged items [5]. Group 2: Health Applications - The AI camera can interpret medical reports, generate personalized health plans, and provide medication guidelines [7]. - It can create a tailored weekly meal plan based on health conditions like high uric acid levels [7]. Group 3: Work and Learning Support - The AI camera can enhance productivity by completing contracts from handwritten notes, solving complex calculations from images, and assisting with coding by adding annotations [10]. Group 4: Industry Context - The launch of the AI camera aligns with the growing trend of multimodal capabilities in AI, with competitors like OpenAI and Google also enhancing their models [13].
MCP对AI应用的影响
2025-04-27 15:11
• 国内 MCP(多通道平台)发展滞后于国外,主要体现在多任务规划能力和 生态系统建设方面,国外超级 AI agent 如 Manners 和 CodeBot 已能独 立完成复杂任务调用,而国内尚无类似应用,阿里正通过钉钉、夸克及百 炼平台引入三方生态加速布局。 • Manas 超级代理入口在处理复杂任务时 TOKEN 消耗显著增加,每次约 15 万至 30 万 TOKEN,但企业用户对其接受度高,通过 MCP 接入的开发 者已达上千家,日 TOKEN 调用量达 3,500 亿至 4,500 亿,显示出强劲的 市场需求。 • Zinus 产品定价较高,每月 199 美金或 299 美金,但市场反应积极,开发 者数量众多,日 TOKEN 调用量同样在 3,500 亿至 4,500 亿之间,技术溢 价性强,但未来可能面临价格竞争。 • 钉钉和夸克在阿里生态中分别定位为 ToB 和 ToC 的 AI 入口级应用,钉钉 侧重商业化落地与营收,夸克则关注 DAU 增长和 TOKEN 消耗,两者将在 各自领域推动阿里生态发展。 • 预计 2025 年阿里千问模型费用将下调 30%至 50%,旨在降低企业使用 成本,提高市 ...
百度的后DeepSeek时代,一切为了应用
Bei Jing Shang Bao· 2025-04-27 09:50
Group 1 - The core viewpoint emphasizes the importance of applications over models in the AI landscape, as articulated by Baidu's founder, Li Yanhong, during the Create2025 Baidu AI Developer Conference [2] - Baidu launched a "nine-piece set" of tools and models aimed at reducing costs and enhancing capabilities for developers, including two new models with up to 80% price reduction [3] - The rapid iteration of models raises questions about the longevity of application value, but Li Yanhong asserts that finding the right scenarios and models will ensure applications remain relevant [2][3] Group 2 - Baidu introduced two new models, Wenxin Model X1 Turbo and 4.5 Turbo, which are multi-modal and strong reasoning models, indicating a shift towards multi-modal models as the future standard [3] - The company is also focusing on no-code programming tools like Miaoda and the general-purpose intelligent agent "Xinxiang," which can generate applications and provide comprehensive solutions to complex user problems [4] - The industry is witnessing a rapid evolution in application development, with major tech companies like Alibaba and Tencent also launching competitive products and services to support developers [4]
字节跳动“扣子空间”测评:AI智能体正在抹平专业门槛,做一款游戏仅需3分钟
Tai Mei Ti A P P· 2025-04-27 04:23
AI智能体,作为AI技术落地的核心形态,正迅速渗透到各类生活场景中。 图:想让"扣子空间"生成的封面图,结果它没做并丢了两个网站 其中,最让打工人心动的,莫过于交给用户一条口令,AI智能体就能够根据要求完成用户所需要的工 作,仿佛AI智能体成为了"打工人福音"。 那么,现阶段AI智能体,真的能成为打工人福音吗? 为此,大模型之家主要是对号称"办公室数字牛马"的字节跳动"扣子空间"进行了全面评测。从内容领域 以及产品性能等方面,扣子空间到底表现如何? 文 | 大模型之家 "未来数年,数以百万计的AI智能体(AI Agent)将构建全新经济生态,推动全球产业格局进入'智能体 密度竞争'时代",这一预言在2025年正在加速照进现实。 大模型之家注意到,自今年1月24日OpenAI发布全球首个AI智能体Operator,到智谱推出智能体框架 GLM-PC 1.1;再到Monica团队推出通用AI智能体Manus,字节跳动正式发布"扣子空间"(Coze Space)。 办公场景应用:文档撰写与表格生成 作为"办公室数字牛马",第一件事,必须是文档撰写能力。 大模型之家提出了:撰写一篇关于上海茶饮行业近几年的发展情况以及 ...
交互设计助力创新文化遗产传播路径
Xin Hua Ri Bao· 2025-04-27 02:34
Core Insights - The report emphasizes the need to enhance the protection of cultural heritage and historical culture in urban and rural construction, highlighting its significance as a carrier of human civilization [1] - The integration of modern technology, such as H5 interactive design, XR technology, mobile applications, and social media platforms, is crucial for innovating the dissemination of cultural heritage [1] Group 1: H5 Interactive Technology - H5 interactive technology serves as a core medium for cultural heritage dissemination due to its low threshold, cross-platform capabilities, and social characteristics [2] - It deconstructs complex cultural content into accessible interactive units, reducing cognitive load and enhancing user engagement [2] - H5 can integrate location-based services (LBS) and real-time data interfaces to intelligently push cultural content based on user location and behavior, creating a virtual-physical integration [3] Group 2: XR Technology - XR technology, encompassing VR, AR, and MR, provides a multi-dimensional solution for the digital narrative and interactive experience of cultural heritage [4] - It transforms static cultural entities into interactive digital twins using high-precision digitization, allowing users to engage with multi-modal information in a virtual-physical context [5] - The non-linear narrative model of XR technology enables users to become active explorers, constructing personalized cognitive maps through autonomous interactions [5] Group 3: Mobile Applications and Social Media - Mobile applications and social media platforms create an ecological matrix for cultural heritage dissemination through terminal adaptability, data-driven approaches, and community operations [6] - They facilitate the transformation of physical spaces into intelligent cultural narrative environments, enhancing user interaction through ergonomic design and multi-dimensional feedback mechanisms [6] - User-generated content and algorithmic recommendations on social media activate community-driven cultural heritage dissemination, while data analytics provide quantitative support for cultural strategies [7]
从AI全栈落地到全球短剧出海,昆仑万维财报里的增长秘密
Sou Hu Cai Jing· 2025-04-27 01:54
Core Insights - The article highlights Kunlun Wanwei's successful transition into a "platform-level" AI company, showcasing its robust financial performance and strategic growth in 2024 [2][21] - The company has diversified its business model, focusing on AI-driven applications across various sectors, including social media, short dramas, music, and search [4][18] Financial Performance - Kunlun Wanwei reported a total revenue of 5.66 billion yuan in 2024, marking a year-on-year growth of 15.2% [2] - The overall gross margin reached 73.6%, with R&D expenses amounting to 1.54 billion yuan, reflecting a significant increase of 59.5% [2] - Overseas business revenue accounted for 91% of total income, reaching 5.15 billion yuan, which is a 21.9% increase year-on-year [4] AI Application Growth - The AI social product Linky achieved a monthly revenue exceeding 1 million USD, with 3 million active users and over 10 million downloads [12] - The AI music platform Mureka generated an annual recurring revenue (ARR) of approximately 12 million USD, with a monthly revenue of around 1 million USD [12] - The short drama platform DramaWave reached an ARR of about 120 million USD, with a monthly revenue of 10 million USD, surpassing Netflix in the Korean market with its hit series "Engagement Storm" [5][15] Strategic Initiatives - Kunlun Wanwei's "All in AGI and AIGC" strategy has been fully implemented, creating a diversified business matrix that includes AI social, short dramas, music, and search [4][18] - The company has adopted a dual approach in the short drama sector, developing both a distribution platform (DramaWave) and an AI-driven content creation platform (SkyReels) [14][15] - The launch of SkyReels, an AI short drama creation platform, allows creators to generate scripts and videos using natural language input, significantly streamlining the production process [14][15] Market Position and Future Outlook - The company aims to achieve an ARR of 360 million USD by the end of 2025, positioning the short drama business as a key growth driver [18] - Kunlun Wanwei's self-developed models and technology have established it as a competitive player in the global AI industry, with significant advancements in various AI applications [19][20] - The market has recognized Kunlun Wanwei as a mature and diversified "AI industry matrix," reflecting confidence in its business model and technological capabilities [21][22]
李彦宏点评 DeepSeek 又贵又慢,网友:这就有点“既要又要”了
程序员的那些事· 2025-04-26 15:13
以下文章来源于MaxAIBox ,作者Max 2 月 14 日,百度宣布了文心大模型不止要免费,而且还要开源。 2 月 16 日晚,百度搜索和文心智能体平台分别宣布,将全面接入 DeepSeek 和文心大模型最新的深度 搜索功能。2 月 18 日,DeepSeek-R1 满血版已经在百度 APP 搜索上线。 此外,2 月 18 日晚间,李彦宏在 2024 年第四季度及全年财报表示: MaxAIBox . MaxAIBox.com 汇集优秀 AI 工具,探索 AI 无限可能 1 众所周知,百度曾经坚持闭源路线,但 DeepSeek 爆火出圈后,随着各行各业众多企业接入满血版 DeepSeek-R1,百度也跟上了。 从 DeepSeek 我们学到一点,那就是将最为优秀的模型开源供所有人使用,将可以极大地推动其 应用,因为大家出于好奇自然会想去尝试开源模型,进而推动其更广泛的应用。 2 4 月 25 日,百度在武汉举办了一场 AI 开发者大会,李彦宏上台发表了题为《模型的世界,应用的天 下》的演讲。 他指出,"只要找对场景,选对基础模型,学一点调模型的方法,做出来的应用不会过时。" + "没有应 用,芯片、模型都没 ...
“DeepSeek不是万能的”,李彦宏今年押注AI 应用:模型价再“打骨折”,重点布局多智能体、多模态
AI前线· 2025-04-25 08:25
作者 | 褚杏娟、华卫 在 4 月 25 日的百度 Create 开发者大会现场,百度创始人李彦宏发布了两大模型、多款热门 AI 应用,并宣布将帮助开发者全面拥抱 MCP。同时,百度 正式点亮了国内首个全自研的三万卡集群,可同时承载多个千亿参数大模型的全量训练,支持 1000 个用户同时做百亿参数的大模型精调。 "所有这些发布,都是为了让开发者们可以不用担心模型能力、不用担心模型成本、更不用担心开发工具和平台,可以踏踏实实地做应用,做出最好的应 用!"李彦宏说道。 李彦宏表示,大模型厂商卷生卷死,几乎每周都在发布新模型,但开发者不敢大胆用,因为担心自己的应用被模型迭代快速覆盖掉。李彦宏认为这是把 双刃剑:一方面,开发者确实需要理解技术发展趋势;另一方面,这么多日益强大的模型提供了更多的选择,打开了更多的可能性。 "只要找对场景,选对基础模型,有时候还要学一点调模型的方法,在此基础上做出来的应用是不会过时的"。他强调,"没有应用,芯片、模型都没有价 值。模型会有很多,但未来真正统治这个世界的是应用,应用才是王者。" 发布两大新模型, 价格最高降 80% 文心大模型 4.5 Turbo 和文心大模型 X1 Tur ...
巨头专家聊Agent与Coze
2025-04-24 01:55
巨头专家聊 Agent 与 Coze2025042320250416 摘要 • 低代码 AI 智能体开发平台提供一站式解决方案,支持 30 秒无代码生成 chatbot,集成近 500 款插件,保障用户数据安全与隐私,为企业提供高 效便捷的 AI 应用开发能力。 • 扣子空间作为 AI 协同办公生态产品,通过 MCP 协议与专家认证,自动化 工作流,动态调用 API,严格权限管理和数据加密,显著提升工作效率并 保障用户隐私。 • MCP 协议已与金融、地图等领域头部厂商及专家级模型 API 集成,覆盖全 行业,40%能力由字节孵化,60%由开发者贡献,通过审核机制确保数据 安全。 • 字节跳动正内测基于豆包的多模态完全体模型,支持文本、图像和语音交 互,重点突出图片和视觉理解,通过情感分析定位人物内心性格及情感表 达。 • 开发者生态系统已构建,应用商店提供近 800 款 AI 应用,开发者可获得 70%收益分成,目前已有近 15 万家开发者接入,覆盖各行各业,并通过 广告等形式推广。 Q&A 扣子和 Agent 技术结合对隐私的影响如何? 扣子和 Agent 技术的结合在隐私保护方面具有显著优势。首先,扣子 ...
我悟了如何与AI说话!谷歌 69 页官方提示词秘籍全解析,中文版免费下载
AI科技大本营· 2025-04-22 10:26
(You don't need to be a data scientist or a machine learning engineer – everyone can write a prompt.) 作者 | 王启隆 出品 | CSDN(ID:CSDNnews) 最近,Google 官方发布了一份长达 69 页的 【Prompt Engineering 白皮书】 ,可以说是目前最系统、最权威的"AI 沟通指南"了。我们也是第一时 间翻译好了这本书,准备 【免费】 送给大家! 怎么拿?很简单, 看完这篇文章,参与文末的小活动就行! 现在咱们聊聊,为啥这份白皮书突然就刷屏了?为啥说它是"必学秘籍"? 你不必是数据科学家或机器学习工程师——人人都可以编写提示词。 你苦口婆心解释半天,它抓着一个无关紧要的词就开始自由发挥…… 你想要个 A,它自信满满地给你个 B,还附赠一套又臭又长、看似完美的错误逻辑…… 同一个问题,昨天它懂你,今天它就装傻,效果全看"缘分"…… Google 这份白皮书,不是某个博主的心得体会,不是零散的技巧合集,而是 Google 官方基于对大语言模型(LLM)的深刻理解,系统性梳理出来的 ...