Workflow
语音助手
icon
Search documents
钉钉联手通义实验室发布Fun-ASR语音识别大模型,支持企业专属模型定制训练
Xin Lang Ke Ji· 2025-08-22 05:21
Core Insights - The collaboration between DingTalk and Tongyi Laboratory has led to the launch of a new voice recognition model, Fun-ASR, which can understand industry-specific jargon across ten sectors, including home decoration and animal husbandry [1][2] - Fun-ASR has been integrated into various DingTalk functionalities such as meeting subtitles, simultaneous interpretation, smart minutes, and voice assistants [1] Technical Highlights - Fun-ASR enhances the recognition capability of industry-specific vocabulary, trained on over one hundred million hours of audio data, and co-created with real scenarios from DingTalk's multi-industry clients [1] - The model features improved contextual awareness and understanding, utilizing existing information within DingTalk, such as contact lists and schedules, to optimize inference and provide reliable transcription results [1] - Fun-ASR supports customized voice recognition model training for enterprises with advanced needs, allowing for algorithm optimization based on real scenario voice data provided by the companies [1] Future Plans - The voice team leader from Tongyi Laboratory expressed excitement about the partnership with DingTalk, aiming to expand the data and model scale of Fun-ASR to enhance the replicability of voice intelligence solutions for enterprise clients [2] - DingTalk's CTO highlighted the rapid development of Fun-ASR within three months of collaboration, achieving recognition from leading clients and marking a significant breakthrough towards industry leadership [2]
J.D. Power研究:先进配置渗透率大幅上升
Core Insights - The 2025 China Automotive Intelligent Experience Study (TXI) by J.D. Power reveals a significant increase in the intelligent innovation index, marking a new high in the automotive industry's smart technology integration [1][2] - The study indicates that the penetration rate of advanced configurations has risen sharply, reflecting the rapid development of AI-assisted driving and chip computing power [2] Group 1: Intelligent Innovation Index - The intelligent innovation index increased by 38 points to 588 in 2025, following a 22-point rise in 2024 [2] - The market depth index, which measures the penetration of advanced configurations, rose by 80 points to 275 [2] Group 2: Market Trends - The new energy vehicle market depth saw a significant year-on-year increase of 86 points, reaching 339, solidifying its leading position in intelligent development [2] - There is a noticeable polarization in brand performance, with domestic brands rapidly iterating and responding to user demands, while international brands focus on technological maturity and functional stability [2] Group 3: User Experience and Expectations - User complaints regarding technology configurations have increased, with an average of 10.7 complaints per advanced technology configuration, indicating a shift towards optimizing user experience rather than merely adding features [3] - The perception of advanced driver-assistance systems (ADAS) has significantly improved, with consumers increasingly expecting human-like driving experiences [3] Group 4: Quality and Usability Challenges - Overall quality complaints in the industry rose by 34.8 per 100 vehicles, with usability issues becoming more prominent, particularly in smart cockpit configurations [4] - The proportion of complaints related to "difficult to understand/use" issues in smart cockpit configurations increased to 36% [4] Group 5: Safety as a New Focus - Consumer demand for safety features has surged, with high usage frequency and repurchase intent for safety-related configurations such as departure assistance and driver monitoring [4]
一文读懂数据标注:定义、最佳实践、工具、优势、挑战、类型等
3 6 Ke· 2025-07-01 02:20
Group 1 - The importance of data annotation for AI and ML is highlighted, as it enables machines to recognize patterns and make predictions by providing meaningful labels to raw data [2][5] - According to MIT, 80% of data scientists spend over 60% of their time preparing and annotating data rather than building models, emphasizing the foundational role of data annotation in AI [2][5] - Data annotation is defined as the process of labeling data (text, images, audio, video, or 3D point cloud data) to enable machine learning algorithms to process and understand it [3][5] Group 2 - The data annotation field is rapidly evolving, significantly impacting AI development, with trends including the use of annotated images and LiDAR data for autonomous vehicles, and labeled medical images for healthcare AI [5][6] - The global data annotation tools market is projected to reach $3.4 billion by 2028, with a compound annual growth rate of 38.5% from 2021 to 2028 [5][6] - AI-assisted annotation tools can reduce annotation time by up to 70% compared to fully manual methods, enhancing efficiency [5][6] Group 3 - The quality of AI models is heavily dependent on the quality of their training data, with well-annotated data ensuring models can recognize patterns and make accurate predictions [5][6] - A 5% improvement in annotation quality can lead to a 15-20% increase in model accuracy for complex computer vision tasks, according to IBM research [5][6] - Organizations typically spend between $12,000 to $15,000 per month on data annotation services for medium-sized projects [5][6] Group 4 - Currently, 78% of enterprise AI projects utilize a combination of internal and outsourced annotation services, up from 54% in 2022 [5][6] - Emerging technologies such as active learning and semi-supervised annotation methods can reduce annotation costs by 35-40% for early adopters [5][6] - The annotation workforce has shifted significantly, with 65% of annotation work now conducted in specialized centers in India, the Philippines, and Eastern Europe [5][6] Group 5 - Various data annotation types include image annotation, audio annotation, video annotation, and text annotation, each requiring specific techniques to ensure effective machine learning model training [9][11][14][21] - The process of data annotation involves several steps, from data collection to quality assurance, ensuring high-quality and accurate labeled data for machine learning applications [32][37] - Best practices for data annotation include providing clear instructions, optimizing annotation workload, and ensuring compliance with privacy and ethical standards [86][89]
专家建议:App适老化并非简单做“加减法”
Xin Jing Bao· 2025-06-01 02:17
Core Viewpoint - The article emphasizes the need for a comprehensive approach to app adaptation for the elderly, moving beyond superficial changes to create a user-friendly ecosystem that caters to their specific needs [1][2][3]. Group 1: Current Challenges in App Adaptation - Many apps only implement superficial changes like font enlargement and simplified interfaces, failing to address deeper usability issues [1]. - Complex interaction processes and low voice recognition success rates hinder elderly users, leading to operational failures [1]. - Some apps reduce functionality instead of enhancing it, limiting the choices available to elderly users [1]. Group 2: Systematic Optimization Suggestions - Experts advocate for systematic interaction optimization rather than mere reduction of features, focusing on core functions relevant to elderly users [2]. - A user stratification design strategy is recommended, offering different interface complexities for "digital immigrants" (under 70) and "digital refugees" (over 75) [2]. - The design should allow for flexible interface complexity adjustments based on individual user capabilities and preferences [3]. Group 3: Multi-Sensory Feedback and Interaction - Emphasis on multi-sensory feedback is crucial, integrating visual, auditory, and tactile cues to enhance user experience and reduce errors [3][5]. - Voice interaction is highlighted as a key alternative to traditional interfaces, with suggestions for creating a voice corpus tailored to elderly users [4]. - The importance of emotional prioritization in voice assistant interactions is noted, advocating for customizable speech parameters to improve user comfort [5]. Group 4: Hardware and Ecosystem Considerations - The concept of "product ecosystem adaptation" is introduced, suggesting that elderly-friendly design should extend beyond apps to include hardware solutions [6]. - Development of "screenless voice devices" is proposed to meet basic needs without the complications of touchscreens [6]. - Community and family involvement is essential for effective voice system integration, with suggestions for remote assistance features [7]. Group 5: Policy and Community Support - The article calls for government-led initiatives to establish standards and certifications for elderly-friendly apps, ensuring accessibility and usability [7]. - Community resources should be mobilized to provide digital literacy training for elderly users, enhancing their confidence and skills [8]. - The need for a holistic approach that combines app adaptation with real-world support systems is emphasized, ensuring a seamless user experience [9]. Group 6: Towards an Inclusive Digital Environment - The shift from "elderly adaptation" to "age-inclusive design" is advocated, promoting designs that cater to all users regardless of age [9][10]. - The ultimate goal is to create a digital environment where elderly users do not feel they are using a "special version" of an app, but rather a universally accessible tool [10].