Workflow
Accio Agent
icon
Search documents
AI动态汇总:智元推出机器人世界模型平台genieenvesioner,智谱上线GLM-4.5a视觉推理模型
China Post Securities· 2025-08-25 11:47
- The Genie Envisioner platform introduces a video-centric world modeling paradigm, directly modeling robot-environment interactions in the visual space, which retains spatial structure and temporal evolution information. This approach enhances cross-domain generalization and long-sequence task execution capabilities, achieving a 76% success rate in long-step tasks like folding cardboard boxes, outperforming the π0 model's 48%[12][13][16] - The Genie Envisioner platform comprises three core components: GE-Base, a multi-view video world foundation model trained on 3000 hours of real robot data; GE-Act, a lightweight 160M parameter action decoder enabling real-time control; and GE-Sim, a hierarchical action-conditioned simulator for closed-loop strategy evaluation and large-scale data generation[16][17][19] - The GLM-4.5V visual reasoning model, with 106B total parameters and 120B activation parameters, achieves state-of-the-art (SOTA) performance across 41 multimodal benchmarks, including image, video, document understanding, and GUI agent tasks. It incorporates 3D-RoPE and bicubic interpolation mechanisms to enhance 3D spatial relationship perception and high-resolution adaptability[20][21][22] - GLM-4.5V employs a three-stage training strategy: pretraining on large-scale multimodal corpora, supervised fine-tuning with "chain of thought" samples, and reinforcement learning with RLVR and RLHF techniques. This layered training enables superior document processing capabilities and emergent abilities like generating structured HTML/CSS/JavaScript code from screenshots or videos[23][24][26] - VeOmni, a fully modular multimodal training framework, decouples model definition from distributed parallel logic, enabling flexible parallel strategies like FSDP, HSDP+SP, and EP. It achieves 43.98% MFU for 64K sequence training and supports up to 192K sequence lengths, reducing engineering complexity and improving efficiency by over 90%[27][28][31] - VeOmni introduces asynchronous sequence parallelism (Async-Ulysses) and COMET technology for MoE models, achieving linear scalability in training throughput for 30B parameter models under 160K sequence lengths. It also integrates dynamic batch processing and FlashAttention to minimize memory waste and optimize operator-level recomputation[31][32][34] - Skywork UniPic 2.0, a unified multimodal framework, integrates image understanding, text-to-image (T2I) generation, and image-to-image (I2I) editing within a single model. It employs a progressive dual-task reinforcement strategy (Flow-GRPO) to optimize image editing and T2I tasks sequentially, achieving superior performance in benchmarks like GenEval and GEdit-EN[35][38][39] - UniPic 2.0 leverages Skywork-EditReward, an image-editing-specific reward model, to provide pixel-level quality scores. This design enables precise recognition of image elements and generation of corresponding textual descriptions, achieving 83.5 points in MMBench, comparable to 19B parameter models[38][42][43] - FlowReasoner, a query-level meta-agent framework, dynamically generates personalized multi-agent systems for individual queries. It employs GRPO reinforcement learning with multi-objective reward mechanisms, achieving 92.15% accuracy on the MBPP dataset and outperforming baseline models like Aflow and LLM-Blender[63][64][68] - FlowReasoner utilizes a three-stage training process: supervised fine-tuning with synthetic data, SFT fine-tuning for workflow generation, and RL with external feedback for capability enhancement. It demonstrates robust generalization, maintaining high accuracy even when the base worker model is replaced[66][68][69]
360集团官宣“All in Agent”战略;Accio邀请码在海外爆火,阿里国际站回应:正在扩容丨AIGC日报
创业邦· 2025-08-16 01:10
Group 1 - 360 Group announced its "All in Agent" strategy, encouraging all employees to establish AI beliefs and measure business processes and roles through intelligent integration rates [2] - Alibaba's Accio Agent gained rapid popularity overseas, facilitating market research, product development, and supplier communication for cross-border e-commerce, with a large influx of users leading to an invitation-only system [2] - Tencent Cloud launched the upgraded CloudBase AI CLI, which integrates AI tools into a unified management platform, potentially reducing coding workload by 80% [2] Group 2 - Qiniu Intelligent introduced the "Lingxi AI" natural interaction solution, aiming to address the core challenges of embodied intelligence by providing an open interaction platform for hardware manufacturers and developers [2] - The solution is designed to eliminate barriers related to complex algorithms and insufficient computing power, allowing innovators to focus on application and scenario innovation [2] - Qiniu's CEO emphasized the future exploration of voice interaction and embodied intelligence integration to enhance AI's role in human life [2]
影石就向员工“撒钱”致歉;多位投资人辟谣DeepSeek完成7亿美元C轮融资;京东完成收购香港佳宝超市丨邦早报
创业邦· 2025-08-16 01:10
Group 1 - YingShi Innovation's founder Liu Jingkang apologized for a viral video showing cash being thrown at employees during a team-building event, clarifying it was meant to celebrate the launch of their A1 drone after extensive overtime work [3] - Weibo's official account denied rumors about IP location being precise to the city level, emphasizing that the platform's IP location display aims to reduce impersonation and misinformation [5] - DeepSeek's reported $700 million Series C funding was labeled as false by multiple investors, with claims that the company had not previously raised funds before entering this round [6] Group 2 - Xia Haijun, former CEO of Evergrande, was reported to be hiding in California, with evidence showing his wife holds assets worth $24 million in the U.S. [11][14] - JD.com completed the acquisition of Hong Kong's Jia Bao supermarket, aiming to enhance its supply chain and retail presence in the Greater Bay Area [15] - Meta's market capitalization surpassed $2 trillion for the first time, making it the sixth U.S. company to reach this milestone [21] Group 3 - WeChat denied rumors about a palm payment service franchise, stating it is still in the internal testing phase and warning users against scams [17] - Amazon founder Jeff Bezos's mother passed away at 78, with her early investment in Amazon contributing to the family's wealth [19] - Tencent Cloud launched CloudBase AI CLI, a tool that can reduce coding workload by 80% for developers [28] Group 4 - The National Bureau of Statistics reported significant growth in the manufacturing value of smart drones and vehicle equipment, with increases of 80.8% and 21% respectively in July [30] - Li Lai announced a $1.3 billion deal with AI pharmaceutical company Superluminal to accelerate drug development for obesity and heart diseases [26] - WeRide secured a multi-million dollar investment from Grab to deploy L4 Robotaxis in Southeast Asia [27]
Perplexity疯砸345亿抢谷歌;AI Agent接管中小企业生意链条?;AGI的4层突破与3大难关 |混沌AI一周焦点
混沌学园· 2025-08-15 12:07
Core Trends - Perplexity attempts to acquire Google's Chrome browser for $34.5 billion, targeting its 3 billion users and aiming to challenge Google's market dominance, although the likelihood of success is low [3][12] - Alibaba's Accio Agent automates the entire business chain for small and medium enterprises, enabling them to bypass human bottlenecks and drive growth directly [4][13] - NVIDIA's Cosmos and Jetson Thor empower robots with reasoning and autonomous decision-making capabilities, presenting opportunities for intelligent transformation in traditional industries like retail and healthcare [5][16] - The software industry is undergoing a reshuffle as tools like Meituan's NoCode and Baidu's 秒哒 enable non-experts to create software applications, democratizing innovation [6][20][25] AI Events - The "2025 China AI Gala" will showcase various AI and robotics performances, featuring robots like智元A2 and傅利叶GR-2, highlighting the integration of AI in entertainment [7] - At the WAIC conference, notable figures in AI were recognized, including 夏立雪, who was awarded "AI Person of the Year" [8] AI Innovations - NVIDIA's upgraded Cosmos model allows robots to understand and predict object states and environmental changes, enhancing their operational capabilities in various settings [16] - Baichuan's new medical reasoning model, Baichuan-M2-32B, outperforms existing open-source models, facilitating the deployment of AI medical assistants in healthcare [18][22] Business Developments - xAI's Grok 4 is now available for free globally, potentially igniting a price war in the AI model market [20] - The World Robot Conference featured over 200 companies and numerous new products, showcasing advancements across various sectors [21][24]
Accio邀请码在海外爆火 阿里国际站回应:正在扩容
Core Insights - Alibaba International Station's Accio Agent has gained rapid popularity among overseas buyers, facilitating key aspects of cross-border e-commerce such as market research, product development, and supplier communication [1] Group 1 - Accio Agent was launched recently and has seen a surge in user interest, leading to an invitation-only access model [1] - A large number of invitation codes were quickly claimed by users from over ten countries, indicating strong demand [1] - Alibaba International Station is currently working on expanding the service to allow more global SMEs to experience this new AI-driven trade approach [1]
阿里国际站回应Accio邀请码在海外爆火:正在扩容
Sou Hu Cai Jing· 2025-08-15 04:48
Core Insights - Alibaba International Station has launched Accio Agent, touted as the world's first AI Agent capable of conducting business, targeting overseas buyers [1] - The AI Agent automates key processes in cross-border e-commerce, including market research, product development, and supplier communication [1] - The service is highly valued, with similar fully automated e-commerce workflow services priced at $250,000 [1] User Engagement - The launch day saw a significant influx of users, leading to an invitation-only access model [1] - A large number of invitation codes were quickly claimed by users from over ten countries [1] - Alibaba International Station is currently working on expanding capacity to allow more global SMEs to experience this new AI trading method [1]
第一个能帮你做生意的Agent来了。
数字生命卡兹克· 2025-08-12 01:05
Core Viewpoint - Accio Agent, recently upgraded by Alibaba International Station, is positioned as a transformative tool for international trade, enabling businesses to streamline their operations and enhance efficiency in sourcing and product development [1][4][7]. Group 1: Accio Agent Overview - Accio Agent has accumulated 2 million enterprise-level customers, indicating significant traction in the ToB sector [4][5]. - The platform is designed primarily for foreign trade and overseas markets, but it also offers valuable functionalities for domestic users [9][10]. Group 2: User Experience and Functionality - The initial experience with Accio involved creating custom merchandise, highlighting the challenges faced in sourcing manufacturers and understanding product specifications [11][14]. - Accio simplifies the process of finding suppliers by providing a curated list of manufacturers based on specific requirements, such as small batch orders and customization options [26][30]. - The platform allows users to send inquiries directly to suppliers without the need for extensive manual searching, significantly reducing the time and effort required [32][80]. Group 3: Advanced Capabilities - Accio can assist in product design and supplier sourcing for more complex projects, demonstrating its ability to handle multifaceted requests [38][60]. - The platform effectively analyzes user input and generates comprehensive reports, including venue selection and vendor recommendations for events, showcasing its versatility [66][78]. - Accio's systematic approach to project management, from ideation to execution, sets it apart from traditional models, emphasizing its strength in vertical industry applications [81][82].