Workflow
人机协作
icon
Search documents
科学与健康|机“慧”共生 人形机器人在2025世界人工智能大会展现澎湃动力
Xin Hua She· 2025-07-27 10:46
Core Insights - The World Artificial Intelligence Conference in Shanghai showcased over 60 intelligent robots, highlighting advancements in both industrial and service applications [2][4] - The event emphasized the transition of humanoid robots from mere demonstrations to practical solutions in various sectors, indicating a significant shift towards commercialization [6][9] Group 1: Industry Developments - The exhibition area exceeded 70,000 square meters for the first time, featuring robots capable of diverse tasks, from combat to service [2][3] - Shanghai Electric introduced a humanoid robot designed for industrial environments, capable of autonomous identification and handling of various box sizes, enhancing warehouse efficiency [3] - The XMAN-F1 service robot by Qianlang Intelligent showcased its ability to prepare popcorn and customize drinks, indicating the potential for future service applications [4] Group 2: Technological Advancements - The progress in robot components, such as tactile grippers using innovative sensory technology, allows robots to handle fragile items with precision [5] - The collaboration between large models for task planning and small models for execution is expected to advance robotic capabilities significantly [7] Group 3: Human-Robot Interaction - The conference highlighted the importance of understanding in human-robot collaboration, suggesting that true cooperation relies on mutual comprehension of intentions [9] - Industry leaders emphasized the need for ethical frameworks and policies to ensure the safe integration of robots into society, moving beyond their role as mere tools [9][10]
三工视频 · 新360行之生成式人工智能导演丨AI不是对手,而是超级助手
Huan Qiu Wang Zi Xun· 2025-07-26 07:55
Core Insights - The rapid development of AI technology has transitioned from simple image generation to high-quality video content creation, leading to the emergence of generative AI directors in the film industry [1][3] - The film "Tracing Back" directed by Wang Xingjian and Yang Xiaolu showcases the full integration of AI in the filmmaking process, raising questions about the core value of directors when machines can generate images [1][3] Group 1: AI in Filmmaking - Generative AI is now used throughout the entire production process, from script visualization to final rendering, demonstrating the speed of technological iteration [1] - The directors emphasize that while AI enhances efficiency, emotional expression still requires human intervention, with AI improving animation production efficiency by 50% [3][5] Group 2: Human-AI Collaboration - The directors express cautious optimism regarding the potential of AI, suggesting that human emotions and feelings will guide AI's use in filmmaking [5] - A collaborative model is envisioned where AI handles bulk rendering, allowing humans to focus on emotional decision-making, potentially reshaping the film industry workflow [5] Group 3: Philosophical Reflections - The directors reflect on the essence of creativity, with Wang comparing AI's role to that of photography and painting, suggesting that directors should capture genuine emotional moments [5] - Yang views AI as an "accelerator of imagination," emphasizing its value in freeing creators from repetitive tasks to focus on more creative narrative construction [5]
弗吉尼亚大学提出Moving Out:实现物理世界人机无缝协作!
具身智能之心· 2025-07-25 07:11
Core Insights - The article emphasizes the need for a benchmark that simulates physical interactions and diverse collaboration scenarios to enhance the adaptability and generalization capabilities of intelligent agents in human-robot collaboration [3][6]. Group 1: Key Innovations - Introduction of the Moving Out benchmark, a physically-grounded human-robot collaboration environment that simulates various collaborative modes influenced by physical properties and constraints [8]. - Design of two evaluation tasks aimed at assessing the adaptability of intelligent agents to human behavioral diversity and their ability to generalize to unknown physical properties [10][11]. - Proposal of the BASS method, which enhances collaboration performance in physical environments through behavior augmentation, simulation, and action selection [13][14]. Group 2: Experimental Results - The BASS method demonstrated superior performance in both AI-AI and human-robot collaboration compared to baseline methods such as MLP, GRU, and Diffusion Policy [15][18]. - Evaluation metrics included Task Completion Rate (TCR), Normalized Final Distance (NFD), Waiting Time (WT), and Action Consistency (AC), with BASS showing significant improvements in these areas [16][17]. - User studies indicated that BASS significantly outperformed Diffusion Policy in terms of usefulness and physical understanding, reducing issues like object handover failures and delays in assistance [18]. Group 3: Related Work - Existing human-AI collaboration research has limitations, and Moving Out addresses these by providing a physically-grounded environment, diverse collaboration modes, and continuous state-action spaces [19][21]. - Previous works often focused on discrete environments with limited physical attributes or lacked independent task division, highlighting the need for more comprehensive evaluation methods that consider physical interactions [21].
AI来了,打工人能快乐摸鱼吗?
腾讯研究院· 2025-07-22 08:41
Core Viewpoint - The article emphasizes that AI is not meant to replace humans but to alleviate their workload by taking over repetitive and low-value tasks, allowing employees to focus on more meaningful work [2][5][27]. Group 1: AI's Role in the Workplace - A significant portion of the workforce is already utilizing AI for various tasks, with 36% of jobs seeing AI involvement in at least 25% of daily tasks [2]. - The Stanford study reveals that employees prefer AI to handle mundane tasks such as scheduling appointments and data entry, rather than creative or high-judgment tasks [6][12]. - Over 46% of evaluated tasks were rated highly by workers as tasks they would like AI to take over, particularly those that are repetitive and low-value [8]. Group 2: Task Classification and Human Agency - The study categorized tasks into five levels based on human involvement, with a majority of respondents favoring a collaborative approach (H3) rather than complete AI takeover (H1) [17][18]. - The "Human Agency Scale" indicates that most workers are not opposed to AI but seek a partnership where AI handles routine tasks while humans retain decision-making roles [18][19]. Group 3: Skills and Future Workforce Dynamics - The research indicates a shift in the value of skills, with traditional high-paying skills becoming more automated, while interpersonal and management skills are becoming increasingly valuable and irreplaceable [20][23]. - The future workforce will prioritize skills such as judgment, empathy, and cross-team communication, which AI cannot easily replicate [25][26]. Group 4: Misalignment of AI Development and User Needs - There is a notable mismatch between the tasks AI developers focus on and the actual needs of users, leading to potential inefficiencies in AI deployment [14][17]. - Many AI companies are investing in areas where user willingness to adopt AI is low, which could hinder the overall acceptance and effectiveness of AI solutions in the workplace [15][17]. Group 5: The Ideal AI Partnership - The article concludes that the ideal AI should not be a replacement but a partner that understands when to step back, allowing humans to focus on tasks that require creativity and interpersonal interaction [28][30].
生成式AI引领微软工作革新:UX设计与人机协作的新篇章
Sou Hu Cai Jing· 2025-07-20 08:07
Core Insights - Generative AI is becoming a key driver of transformation in the UX design field, with Microsoft Digital Division leading the charge by integrating advanced AI technologies like Microsoft Copilot into daily design and development workflows [1][2] - The introduction of AI is fundamentally changing the collaboration dynamics within product development teams, allowing for real-time alignment of plans and goals, thus breaking away from traditional linear workflows [4][5] Group 1: Impact on Design Processes - The workflow of product designers has been revolutionized, shifting from meticulous prototyping for each interface to a more dynamic and adaptive process facilitated by generative AI [1][2] - Designers are now focusing on prompt expression logic and dynamic adaptive card design, moving away from standard UI design [2][4] Group 2: Collaboration and Efficiency - AI enables designers to collaborate directly with product managers and engineers, significantly enhancing work efficiency by allowing simultaneous design prompt discussions [4][5] - The design philosophy has evolved into an open and abstract framework, presenting both challenges and opportunities for closer collaboration among team members [4][5] Group 3: User Testing and Experience - Generative AI has the potential to optimize user testing by covering all usage scenarios, leading to more precise and efficient testing outcomes [5] - AI is reshaping user experience by enabling a more integrated and comprehensible interaction across various interfaces and applications [5][6] Group 4: Broader Applications of AI - AI is seen as a powerful tool for assisting employees in managing daily tasks, from finding optimal parking spots to integrating feedback for performance evaluations [6][10] - The relationship between humans and machines is evolving from simple interaction to deep collaboration, emphasizing the need for adaptive and personalized design [10]
AI如何赋能科技教育?看深圳罗外实现精准赋能到创意迸发
Nan Fang Du Shi Bao· 2025-07-18 00:41
Core Insights - AI is transforming education from a mere tool to a "transformational engine" that injects innovative momentum into the educational ecosystem [2] - The integration of AI in classrooms enhances creativity and collaboration among students, allowing them to become "AI collaborative designers" [4][5] - The role of teachers is evolving into "intelligent teaching architects," while students take on the role of "project managers" in the design process [5][7] Group 1: AI in Creative Education - The course "AI-Driven Campus Cultural Innovation" focuses on the relationship between user needs, technology implementation, and design aesthetics [2] - Students engage in a creative process that involves inputting ideas, generating designs through AI, and iterating on those designs [4] - The use of laser engraving technology allows students to see their digital designs materialize into physical objects within 15 minutes [5] Group 2: AI in Programming Education - The "AI Empowered Python Basics" course combines coding logic with AI safety, making complex programming concepts more accessible [7] - Students practice real-world tasks like building calculators, enhancing their understanding of programming logic and AI's role in simplifying code [7] - A focus on "AI safety defense" teaches students about the risks associated with code execution, reinforcing secure programming practices [7] Group 3: AI in Interdisciplinary Learning - The course "When Museums Meet AI" merges technology with humanities, allowing students to explore AI's applications in museum curation [8] - Students act as "digital curators," using AI to assist in the entire process from theme conception to technical planning [8] - The course emphasizes the balance between AI creativity and historical authenticity, ensuring that students ground their AI-generated content in factual research [9] Group 4: AI as a Collaborative and Evaluative Tool - AI serves as a "creative catalyst," helping students expand their ideas through cross-disciplinary knowledge [10] - During the creation process, AI acts as a "work incubator," facilitating the transition from concept validation to final product [11] - In the iterative phase, AI provides multidimensional evaluation, offering precise optimization suggestions based on industry standards [13]
议程公布 | 2025智能机器人关键技术大会——具身智能专题论坛、康养机器人专题论坛
机器人圈· 2025-07-17 13:40
Core Viewpoint - The "2025 Intelligent Robot Key Technology Conference" will be held in Qiqihar City from July 22-24, 2025, focusing on advancements in intelligent robotics and their applications across various industries [1]. Group 1: Embodied Intelligence Forum - The "Embodied Intelligence Forum" will take place on the afternoon of July 23, 2025, emphasizing core technological innovations and cross-industry applications in embodied intelligence [2]. - The forum will feature expert reports and PhD flash presentations aimed at promoting the full-chain development of embodied intelligence from theoretical breakthroughs to industrial implementation [2]. Group 2: Expert Reports - Key presentations include: - "Cognitive Navigation Technology for Embodied Intelligence" by Professor Yue Yufeng from Beijing Institute of Technology, addressing dynamic environment perception and autonomous decision-making [3]. - "High-Quality Development Path for Mining Embodied Intelligent Robots" by Wang Lei, focusing on intelligent solutions for specialized scenarios [3]. - "Dynamic Locomotion Control of Legged Robots" by Professor Zhang Guoteng, innovating adaptive technologies for complex terrains [3]. - "Human-Machine Collaboration Driven by Cross-Modal Embodied Intelligence" by Associate Professor Yang Kun, exploring multi-modal perception integration and operational optimization [3]. - "Fall Prediction Research Based on Transfer Learning and Attention Fusion ResNet" by Professor Wu Chuanyan, enhancing intelligent health monitoring systems [3]. - "Skill Learning for Robot Manipulation of Flexible Objects" by Fu Tianyu, tackling challenges in unstructured environments [3]. Group 3: PhD Flash Presentations - The forum will also showcase young scholars presenting cutting-edge research on the application innovations of embodied intelligence in industrial and medical fields, highlighting the youthful energy driving technological implementation [4]. Group 4: Health and Rehabilitation Robots Forum - The "Health and Rehabilitation Robots Forum" will be held on the morning of July 24, 2025, addressing technological solutions to aging challenges [6]. - Expert reports will cover topics such as: - "Robot Empowerment Paths for China's Aging Population" by Zhang Jianhua, outlining technological routes to address aging society issues [6]. - "Technological Innovation in Elderly Care Services and Applications of Care Robots" by Lan Zhi, discussing care scenarios across institutions, communities, and homes [6]. - "Key Technologies and Clinical Research of Lower Limb Rehabilitation Exoskeleton Robots" by Guo Zhao, revealing new mechanisms for gait reconstruction and neural compensation [6]. - "Intelligent Gait Analysis and Clinical Applications" by Ji Bing, driving innovations in AI-enabled rehabilitation assessment paradigms [6]. - "Design and Implementation of Acupuncture Robot Systems" by He Zhaoshui, overcoming automation challenges in traditional therapies [6]. - "Bionic Arm Systems with Multi-Modal Tactile Perception" by Zhang Ting, exploring fine manipulation challenges in human-robot interaction [6]. - "Personalized Rehabilitation Assessment and Motion Control Optimization Driven by Muscle Coordination" by Sheng Yixuan, pioneering personalized functional reconstruction solutions [6]. Group 5: Youth Innovation Reports - The forum will feature flash presentations from young scholars on topics such as: - "Minimum Impact Trajectory Planning for Lower Limb Rehabilitation Robots" by Wang Xincheng [7]. - "Cardiovascular Health Risk Perception Technology Based on Multi-Sensor Fusion" by Xie Shiqin [7]. - "Design and Analysis of Multi-Posture Lower Limb Rehabilitation Robots" by Yu Hongfei [7]. - "Development of Portable Multi-Channel fNIRS Systems" by Xiang Jiayao [7].
这家AI律所爆火,1小时搞定合同审核,红杉、贝恩都看上了
3 6 Ke· 2025-07-15 04:19
Core Insights - Crosby has secured $5.8 million in seed funding from Sequoia Capital and Bain Capital, focusing on enhancing sales contract review efficiency for high-growth go-to-market (GTM) startups [2][3] - The company aims to reduce traditional contract review times from 2-7 days to under 60 minutes, achieving an 80% acceleration in the review process [2][12] - Crosby's innovative approach combines AI pre-processing with licensed attorney review, creating a unique dual-entity operational model that enhances both speed and compliance [11][13] Company Overview - Crosby targets the $300 billion legal services market, addressing the significant pain point of slow contract processing that hampers business progress [4][5] - The company positions contracts as "APIs," promoting a seamless integration of legal processes into business operations [5][7] - Crosby's automated contract processing system includes features like risk clause identification, integration with tools like Slack and CRM, and real-time contract status updates [9][15] Market Strategy - The company adopts a proactive approach, directly addressing the needs of high-growth GTM startups that require rapid contract processing to avoid revenue delays [13][19] - Crosby's fixed pricing model replaces traditional hourly billing, providing startups with predictable cost structures [19] - The company has successfully implemented pilot projects with clients like Cursor, Clay, and UnifyGTM, showcasing its efficiency in contract processing and driving growth through real user feedback [16][20] Team Composition - Crosby's team consists of experienced legal professionals from prestigious backgrounds, ensuring high-quality contract review alongside its AI capabilities [22] - The engineering team, with experience from major tech companies, supports rapid iteration and development of the company's hybrid architecture [22]
重磅!Science子刊最新封面!里程碑突破:机器人首次自主手术100%成功!
机器人大讲堂· 2025-07-11 10:35
Core Viewpoint - The article highlights a significant breakthrough in surgical automation with the introduction of the SRT-H surgical robot, which can independently perform complex soft tissue surgeries without direct human intervention, achieving a 100% success rate in gallbladder removal surgeries [1][3]. Group 1: Technological Advancements - The SRT-H robot completed 8 gallbladder surgeries autonomously, demonstrating the ability to handle 17 different task instructions seamlessly [2][3]. - The robot can self-correct during procedures, averaging 6 self-corrections per surgery, showcasing its adaptability in complex surgical environments [3][21]. - The robot's design includes a "layered brain" architecture, separating high-level strategy and low-level execution, allowing it to understand and execute commands in natural language [11][15]. Group 2: Surgical Complexity and Training - Gallbladder surgery was chosen for testing due to its commonality and moderate difficulty, requiring precise control and coordination [7][10]. - The research team trained the robot using over 16,000 recorded trajectories from experienced surgeons, ensuring a robust dataset for learning [18][20]. - The robot's ability to recognize and adapt to varying anatomical structures during surgery was validated through diverse gallbladder samples [10][12]. Group 3: Performance Comparison - In a head-to-head comparison, the robot demonstrated superior precision and stability in surgical tasks compared to an experienced human surgeon, although the surgeon was faster [31][34]. - The robot's average motion stability was significantly better, with lower mean jerk values compared to human performance [33][34]. Group 4: Future Implications - The ultimate goal is to develop a "universal surgical robot" capable of performing various surgical procedures autonomously, currently classified at Level IV autonomy [38][40]. - The potential for robots to assist in remote or extreme environments, such as space or deep-sea operations, is emphasized, indicating a transformative impact on healthcare delivery [40][41].
Devin 教你做 Agent:把 AI 当做需要指导的初级开发者
Founder Park· 2025-07-07 12:08
Core Insights - The article emphasizes the importance of treating AI as a junior developer that requires clear guidance rather than a magical tool, highlighting the need for engineers to adapt their management style to effectively utilize programming agents [1][3][9] - Senior engineers are found to be the quickest adopters of these tools, which can save approximately 80% of time on medium to large tasks [1][8][24] Introduction - The article introduces a practical guide based on two years of experience building Devin, an autonomous programming agent, and aims to share valuable insights from customer feedback and internal practices [1][3] Getting Started: Basics and Daily Applications - Key principles for effective communication with agents include providing specific instructions, indicating starting points, anticipating potential errors, and establishing a feedback loop [10][11][13][15] - The guide suggests integrating agents into daily workflows to enhance personal efficiency, such as handling new requests without interrupting deep work and managing urgent issues on the go [17][19][20] Intermediate: Managing Complex Tasks - For complex tasks, the article recommends having agents draft initial versions and collaborating on implementation plans, while also setting checkpoints to ensure alignment with expectations [23][25][26] - It emphasizes the importance of teaching agents how to validate their work and increasing testing coverage in areas frequently modified by AI [28][29] Advanced: Automation and Customization - The article discusses creating automation templates for repetitive tasks and implementing intelligent code reviews using agents [30][33] - It highlights the need for a unified development environment to enhance agent performance and suggests building custom tools to empower agents [35][36] Practical Considerations: Embracing Change - The article outlines the limitations of autonomous agents, such as their debugging capabilities and knowledge cut-off dates, advising users to manage expectations and time effectively [39][42][43] - It concludes by asserting that the value of software engineers will not diminish, as deep technical knowledge and understanding of business codebases remain essential in the evolving landscape of software development [50]