Workflow
多模态数据处理
icon
Search documents
来火山引擎「算子广场」,一键处理多模态数据
Cai Fu Zai Xian· 2025-08-15 10:59
Core Insights - The article discusses the launch of the "Operator Square" feature within the AI Data Lake Service (LAS) by Volcano Engine, aimed at enhancing multi-modal data processing capabilities for enterprises [1][3][8] Group 1: Product Development - Volcano Engine's AI Data Lake Service (LAS) was released in June, designed to support multi-modal data and provide core capabilities in lake storage, management, and computation [3] - The introduction of the "Operator Square" significantly lowers the development threshold for multi-modal data processing by encapsulating complex AI algorithms into pre-set operators, allowing enterprises to quickly build data processing workflows without starting from scratch [3][4] Group 2: User Experience and Efficiency - Users can visually drag and drop to quickly assemble modular workflows, reducing the need for complex coding and reliance on specialized data scientists and algorithm engineers [4] - The "Operator Square" includes over 100 plug-and-play standardized operators and integrates mainstream open-source operator libraries, covering various multi-modal data processing scenarios [3][4] Group 3: Application and Impact - The multi-modal data lake solution has been applied to automate content review systems, achieving a content review coverage rate of 99.5% and significantly improving the accuracy and timeliness of identifying non-structured audio and video data [6] - The solution supports dynamic resource scheduling based on task types, ensuring high concurrency performance and addressing challenges in multi-modal data integration and resource management [8]
当虹科技正式发布BlackEye Vision机器人超远距离远程操控系统
Zheng Quan Ri Bao Wang· 2025-08-06 07:15
Core Viewpoint - Hangzhou Donghong Technology Co., Ltd. has officially launched the BlackEyeVision robot ultra-long-distance remote control system, marking a significant advancement in commercializing ultra-low latency remote control solutions in China [1][2] Group 1: Technology Breakthroughs - The BlackEyeVision system is built on three major technological breakthroughs, addressing key pain points in remote robot control [1] - The self-developed frame-level encoding technology achieves ultra-low latency transmission within 80 milliseconds, surpassing the human neural response limit by 20 milliseconds [1] - The system utilizes the BlackEye multimodal audiovisual model, providing video compression capabilities of 10 to 100 times, ensuring high-definition video transmission even in narrow bandwidth environments [1] Group 2: System Capabilities - The system can simultaneously process multimodal data, including video, audio, LiDAR, images, signaling, and text, enabling intelligent extraction of key information and environmental semantic understanding [1] - It supports AI empowerment at the edge, enhancing the overall functionality of the remote control system [1] Group 3: Application and Future Plans - The system has been applied in both pre-installed and retrofitted forms in products like robotic dogs and inspection robots, demonstrating effectiveness in emergency rescue and industrial inspection scenarios [2] - The company plans to expand the application of the BlackEyeVision system to lighter scenarios, such as home environments, enabling real-time interaction and remote control [2] - The launch of the BlackEyeVision system represents the company's first step into the robotics field, with ongoing efforts to promote the implementation and innovation of intelligent video technology across various scenarios [2]