Workflow
深度学习
icon
Search documents
远洋鱿钓渔情预报系统“苍鹭”发布
Core Insights - The "Canglu" AI squid fishing forecast system has been officially launched, developed by Shanghai Ocean University and two subsidiaries of China National Fisheries Corporation, filling a gap in intelligent forecasting for distant squid fishing in China [1][2] Group 1: System Features - The system provides precise fishing ground forecasts for the next five days and resource abundance predictions for the following year, enhancing operational efficiency in squid fishing [1] - It integrates multiple platforms including web, onboard, and mobile, allowing real-time access to nearly 20 marine factors related to safe and efficient production [1] - The system employs AI, big data mining, and deep learning technologies, combined with knowledge from biology, ecology, and fishery science, to improve the accuracy of squid fishing forecasts [1] Group 2: Impact on Industry - The application of the "Canglu" system on distant fishing vessels has the potential to increase single-vessel output to approximately 500 tons by October 2025, representing a 48% increase compared to the historical annual output of over 330 tons [2] - The system has successfully extended the squid fishing season in the North Pacific to November for the first time, indicating a significant enhancement in operational capabilities [2]
研判2025!中国文本转语音技术行业发展历程、产业链、发展现状、竞争格局及趋势分析:作为人机交互的重要组成部分,行业应用需求不断扩大[图]
Chan Ye Xin Xi Wang· 2025-11-10 00:59
Core Insights - The text-to-speech (TTS) technology is becoming a crucial part of social development, enhancing information accessibility and providing equal opportunities for special groups [1][10] - The market size of China's TTS technology industry is projected to reach 18.76 billion yuan in 2024, reflecting a year-on-year increase of 22.77% [1][11] - The industry is experiencing a shift from early mechanical simulations to advanced AI-driven systems capable of generating human-like speech [1][11] Industry Overview - TTS technology converts text into speech, allowing users to hear content without reading, thus breaking the limitations of information transmission [4][10] - The technology's core value lies in enabling human-machine interaction through natural speech [4][10] Technical Mechanism - The TTS process involves three main components: text preprocessing, speech synthesis, and speech output [5][6] - Text preprocessing includes tasks like word segmentation and semantic understanding, while speech synthesis uses complex algorithms to generate speech signals [5][6] Industry Chain - The TTS industry chain consists of upstream (hardware and algorithm support), midstream (core technology), and downstream (application fields like education, finance, and media) [8][10] - In education, TTS technology is used for personalized learning experiences, aiding students with reading disabilities [8][10] Market Dynamics - The network audio-visual industry, a key segment of new media, is increasingly utilizing TTS technology for content creation, with the user base expected to reach 1.091 billion by 2024 [9][10] Competitive Landscape - The TTS industry is characterized by international technology leadership and domestic market focus, with major players like Google and Microsoft in high-end markets, while domestic companies excel in Chinese language applications [11][12] - Key domestic companies include iFlytek, Baidu, and Yunzhisheng, with competition expected to intensify around edge computing and ethical technology [11][12] Future Trends - The industry is moving towards human-like expression and long-scene adaptability, with emotional expression becoming a core breakthrough point [14][15] - Multi-modal integration is anticipated to enhance TTS capabilities, allowing for collaborative content production across various media [15][16] - As the industry grows, regulatory frameworks will strengthen, focusing on data privacy and voice copyright protection [16]
“我不想一辈子只做PyTorch!”PyTorch之父闪电离职,AI 圈进入接班时刻
AI前线· 2025-11-08 05:33
Core Insights - Soumith Chintala, the founder of PyTorch, announced his resignation from Meta after 11 years, marking a new leadership phase for the popular open-source deep learning framework [2][4] - PyTorch has become a core pillar in global AI research, supporting exascale AI training tasks and achieving over 90% adoption among major AI companies [2][9] Group 1: Chintala's Contributions and Career - Chintala played a pivotal role in advancing several groundbreaking projects at Meta's FAIR department, including GAN research and the development of PyTorch [5][12] - He rose from a software engineer to vice president in just eight years, a rapid ascent closely tied to the rise of PyTorch [5][10] - His departure comes amid significant layoffs at Meta AI, affecting around 600 positions, including those in the FAIR research department [4][6] Group 2: PyTorch's Development and Impact - PyTorch, created in 2016, evolved from the earlier Torch project and has become the standard framework in both academic and industrial settings [12][15] - The framework's success is attributed to its community-driven approach, user feedback, and the integration of features that meet real-world needs [15][16] - PyTorch has gained a reputation for its ease of use and flexibility, making it a preferred choice among researchers and developers [15][16] Group 3: Future Directions and Chintala's Next Steps - Chintala expressed a desire to explore new opportunities outside of Meta, emphasizing the importance of understanding the external world and returning to a state of "doing small things" [20][21] - He acknowledged the strong leadership team now in place at PyTorch, which gives him confidence in the framework's future [21]
AI六巨头同台:AGI,不再是“未来”的事了
3 6 Ke· 2025-11-08 01:43
Core Insights - The roundtable discussion among AI pioneers indicates that General Artificial Intelligence (AGI) is no longer a distant goal but is beginning to manifest in real-world applications [1] - The conversation highlights a paradigm shift in AI development, with varying perspectives on the timeline and nature of AGI [21][32] Group 1: Evolution of AGI - The emergence of AGI is a result of 40 years of gradual evolution rather than a sudden breakthrough [2] - Key figures in AI, such as Geoffrey Hinton and Yoshua Bengio, shared their pivotal moments that led them to pursue AI research, emphasizing the foundational work that has shaped today's AI landscape [3][4][10][14] - The collective contributions of these pioneers have created a historical framework for understanding AI's development, with each playing a unique role in advancing the field [20] Group 2: Perspectives on AGI's Timeline - Different experts provided varied timelines regarding the realization of human-level intelligence, reflecting their distinct understandings of intelligence itself [21][34] - Yann LeCun suggested that AGI will evolve gradually over the next five to ten years, rather than appearing as a singular event [23] - Fei-Fei Li pointed out that certain AI capabilities have already surpassed human abilities in specific areas, indicating that some aspects of AGI are already present [25] - Huang Renxun emphasized that AGI-level intelligence is already being applied in practical scenarios today [28] - Geoffrey Hinton predicted that machines will outperform humans in debates within the next 20 years, signaling a significant advancement in AI capabilities [29] - Yoshua Bengio noted the exponential growth in AI's planning abilities over the past six years, suggesting that AI could reach engineer-level capabilities within five years, though he cautioned against making definitive predictions [31] Group 3: Transition from Language to Action - The discussion highlighted a shift from AI's focus on language capabilities to the need for action-oriented intelligence [35] - Fei-Fei Li stressed the importance of spatial intelligence and the ability to perform tasks, which current AI models struggle with [37] - LeCun argued that existing large language models are far from achieving true intelligence and emphasized the need for self-organizing learning methods [39][41] - Huang Renxun described AI's evolution from a tool to a production system, capable of executing tasks in real-time, thus marking a significant paradigm shift in AI's role [43][44] Conclusion - The dialogue concluded that AGI is not a product that will launch on a specific date but is already permeating various sectors and processes [48] - The rapid advancements in AI suggest that the landscape will continue to evolve, potentially leading to a different world in the near future [49][50]
一文读懂人工智能在供应链领域的典型应用
3 6 Ke· 2025-11-07 06:31
Overview - The article discusses the transformative impact of artificial intelligence (AI) and machine learning (ML) on marketing and supply chain management, emphasizing the need for businesses to adapt to these technologies for improved decision-making and operational efficiency [1][6]. AI Terminology Overview - AI encompasses a broad field focused on creating machines capable of tasks requiring human-like intelligence, while ML is a subset of AI that enables computers to learn from data without explicit programming [2][4]. Importance of AI - AI is being rapidly adopted across industries as it directly correlates with business efficiency, profitability, and competitiveness, moving beyond experimental phases to practical applications in daily operations [6][9]. Applications of AI in Marketing - AI is utilized in marketing through personalized recommendations, customer service chatbots, and predictive analytics, enhancing customer engagement and operational effectiveness [10][12]. Marketing's Impact on Supply Chain - Marketing activities can trigger demand shocks, necessitating a responsive supply chain to avoid stockouts and missed revenue opportunities, highlighting the interconnectedness of marketing and supply chain functions [13][15]. Challenges in Modern Supply Chains - Modern supply chains face challenges such as complexity, uncertainty, speed expectations, and sustainability, driving the need for AI to enhance demand forecasting and proactive measures [19][20]. AI in Demand Forecasting and Planning - AI enhances demand forecasting and planning by integrating time series analysis with machine learning, allowing for more accurate predictions and operational actions [20][22]. AI in Inventory Optimization - AI aids in inventory management by determining optimal stock levels based on real-time data and demand forecasts, balancing availability and cost [24][26]. AI in Logistics and Transportation - AI transforms logistics by optimizing delivery routes, predicting arrival times, and enabling predictive maintenance, thus improving efficiency and reliability [27][29]. AI in Supplier and Risk Management - AI strengthens supplier and risk management through continuous performance analysis and real-time monitoring of external events, allowing for proactive risk mitigation [33][34]. AI in Warehousing and Automation - AI automates and optimizes warehousing processes, improving accuracy and efficiency in inventory handling and order fulfillment [37][38]. AI in Sustainability and ESG - AI supports sustainability efforts by optimizing processes to reduce waste and emissions, facilitating the transition to circular supply chains [38][40]. Unified Perspective on Marketing and Supply Chain - Understanding AI's value requires viewing marketing and supply chain as interconnected systems, where AI synchronizes demand creation and fulfillment [61][63]. Emerging Trends in AI-Driven Supply Chains - New trends in AI include digital twins for simulation, proactive AI agents for planning, and visual models for real-time monitoring, indicating a shift towards more autonomous and intelligent supply chain operations [66][67].
ICML 2026新规「避坑」指南:参会非必须、原稿将公开、互审设上限
机器之心· 2025-11-06 05:28
Core Points - The ICML 2026 conference will take place from July 7 to July 12, 2026, in Seoul, South Korea, with a double-blind review process for all submitted papers [4] - Authors of accepted papers can choose whether to attend the conference in person or only have their papers included in the proceedings [7] - The original submission versions of accepted papers will be made publicly available, and authors of rejected papers can also choose to make their original submissions public [10] Submission Requirements - Papers must be submitted as a single file, with a maximum of 8 pages for the main text, while references, impact statements, and appendices have no page limit [5] - There will be no separate submission deadline for supplementary materials, and authors can add one extra page to the final version of accepted papers [6] - Papers that do not comply with the submission requirements will be rejected without review [11] Important Dates - The submission website will open on January 8, 2026, with the abstract submission deadline on January 23, 2026, and the full paper submission deadline on January 28, 2026 [14][15] Review Process - Authors are required to participate in the review process, with specific mutual review requirements for both papers and authors [17] - The double-blind review policy prohibits simultaneous submissions to multiple conferences or journals [18] - All submissions must be anonymized and should not contain any information that could reveal the authors' identities [21] Ethical Guidelines - Each paper must include a potential societal impact statement, which should be placed at the end of the paper and will not count towards the page limit [23] - Authors must submit a plain language summary to communicate the significance of their research to the public [24] - Violations of the review process or ethical guidelines may result in sanctions or rejection of the submission [22][23]
奥特曼和纳德拉,艰难重组后首次对谈:「我们是天作之合」
3 6 Ke· 2025-11-03 00:23
Group 1 - The core of the article revolves around the significant partnership between Microsoft and OpenAI, which aims to reshape the future of AI through a newly established agreement [3][8][60] - The partnership began in 2019 when Microsoft invested $1 billion in OpenAI, providing essential funding and cloud computing resources to support AI model training [5][7] - The recent agreement marks a new phase in their relationship, with OpenAI restructuring to create a Public Benefit Corporation (PBC) under its non-profit foundation, allowing for both profit and public good [8][10] Group 2 - OpenAI's foundation now holds shares valued at $130 billion, making it one of the largest charitable foundations globally, with plans to invest $25 billion in healthcare and AI safety [10][12] - Microsoft holds approximately 32.5% of OpenAI's shares, valued at around $135 billion, binding the fates of both companies together [15][16] - The partnership has evolved into one of the most successful collaborations in the industry, with both leaders expressing optimism about the future value of OpenAI [17][18] Group 3 - The new agreement includes exclusive deployment of OpenAI's advanced AI models on Microsoft's Azure cloud platform for the next seven years, making Azure a central hub for AI development [19][20] - Microsoft reported a 27% year-over-year revenue increase in its intelligent cloud segment, driven by Azure's growth, particularly in AI-related contracts [20][23] - OpenAI has committed to a $250 billion pre-purchase contract for Azure resources, ensuring ample computing power for its AI model training [20][22] Group 4 - Both companies face challenges related to computing power shortages, which have limited OpenAI's ability to onboard new users and expand its models [24][26] - Microsoft has significantly increased its capital expenditures to build data centers and acquire AI chips, yet still struggles to meet the soaring demand for computing resources [26][27] - The leaders predict that AI computing power will remain tight for the next few years, despite potential future advancements in technology [28][29] Group 5 - The partnership is also driving a transformation in software paradigms, with AI changing how users interact with applications, moving from traditional interfaces to conversational agents [33][34] - Microsoft is integrating AI capabilities into its Office products, enhancing their value and user engagement, while also exploring new business models for AI assistants [36][39] - The collaboration is expected to boost productivity and economic growth, with predictions of a potential return to 4% annual growth in the U.S. economy due to AI advancements [52][53] Group 6 - The partnership between Microsoft and OpenAI is not just about profit but also focuses on ensuring that AI benefits humanity as a whole [64][65] - Both companies are actively involved in shaping regulations and standards for AI to promote safe and responsible development [62][64] - The collaboration exemplifies a blend of idealism and pragmatism, aiming to harness technological innovation for the greater good [64]
端到端和VLA,这些方向还适合搞研究
自动驾驶之心· 2025-11-03 00:04
Core Viewpoint - The article discusses the evolution of autonomous driving technology, highlighting the transition from rule-based systems to end-to-end models represented by companies like Ideal and XPeng, and currently to the world model phase represented by NIO, emphasizing the continuous presence of deep learning throughout these changes [1]. Group 1: Course Introduction - The course covers the development from modular production algorithms to end-to-end systems and now to VLA, focusing on core algorithms such as BEV perception, visual language models (VLM), diffusion models, reinforcement learning, and world models [5]. - Participants will gain a comprehensive understanding of the end-to-end technology framework and key technologies, enabling them to reproduce mainstream algorithm frameworks like diffusion models and VLA [5]. - Feedback indicates that students completing the course can achieve approximately one year of experience as end-to-end autonomous driving algorithm engineers, benefiting from the training for internships and job recruitment [5]. Group 2: Instructor Profile - The main instructor, Jason, holds a C9 undergraduate degree and a PhD from a QS top 50 university, with multiple published papers in CCF-A and CCF-B journals [6]. - He is currently an algorithm expert at a leading domestic manufacturer, engaged in the research and production of cutting-edge algorithms, with extensive experience in the development and delivery of autonomous driving perception and end-to-end algorithms [6]. Group 3: Research Guidance - The program aims to enhance practical skills and knowledge in cutting-edge topics, with a focus on helping students publish high-level papers to improve their academic prospects [8]. - The community includes over 300 instructors specializing in autonomous driving and embodied intelligence, with a high manuscript acceptance rate of 96% over the past three years [8]. Group 4: Research Process - The guidance process includes selecting research topics based on student interests, explaining key concepts, and providing essential foundational knowledge and recommended learning materials [11]. - Students will learn how to critically read literature, conduct research, and write various sections of a paper, including methods and experimental results, with continuous feedback and support throughout the process [11].
Meta裁员、OpenAI重组:万字复盘谷歌起笔的AI史诗,如何被「群雄」改写剧本?
机器之心· 2025-11-02 01:37
Core Insights - The AI industry is transitioning from a phase of rapid investment and growth to a more competitive and cost-conscious environment, as evidenced by layoffs and restructuring among major players like Meta, OpenAI, and AWS [1][2]. Group 1: Historical Context of AI Development - Google was founded with AI as a core principle, influenced by co-founder Larry Page's background in machine learning [5][9]. - The term "Artificial Intelligence" was first coined in 1956, but the field faced significant setbacks due to limitations in computing power and data, leading to two major "AI winters" [8]. - Larry Page's vision for Google included the belief that AI would be the ultimate version of their search engine, aiming to understand everything on the web [9][10]. Group 2: Key Innovations and Breakthroughs - Google's early AI efforts included the development of the PHIL language model, which significantly improved search functionalities and contributed to the company's revenue through AdSense [14][15][16]. - The introduction of neural networks and deep learning at Google was catalyzed by the arrival of key figures like Geoff Hinton, who advocated for the potential of deep learning [19][21]. - The "cat paper," which demonstrated a deep learning model's ability to recognize images without supervision, marked a significant milestone for Google Brain and had profound implications for YouTube's content understanding [30][34]. Group 3: Competitive Landscape and Strategic Moves - The success of AlexNet in 2012 revolutionized deep learning and established GPU as the core hardware for AI, leading to a surge in interest and investment in AI talent [35][39]. - Google acquired DNN Research, further solidifying its leadership in deep learning, while Facebook established its own AI lab, FAIR, to compete in the space [41][43]. - The acquisition of DeepMind by Google in 2014 expanded its AI capabilities but also led to internal conflicts between DeepMind and Google Brain [56][57]. Group 4: Emergence of OpenAI and Market Dynamics - OpenAI was founded in 2015 with a mission to promote and develop friendly AI, attracting talent from Google and other tech giants [66][68]. - The launch of ChatGPT in late 2022 marked a pivotal moment in the AI landscape, rapidly gaining users and prompting a competitive response from Google [97][99]. - Google's response included the rushed launch of Bard, which faced criticism and highlighted the challenges of adapting to disruptive innovations [102][103]. Group 5: Future Directions and Challenges - Google is now focusing on the Gemini project, aiming to unify its AI efforts and leverage its extensive resources to compete effectively in the evolving AI landscape [105][106]. - The competitive dynamics in the AI industry are shifting, with emerging players in China and the ongoing evolution of established companies like OpenAI and Meta [109][110].
一名科学家试着成为更好的CEO |WAVES
3 6 Ke· 2025-10-30 17:42
袁进辉坦言,过去的屡败屡战都是自找的。在被光年之外并购前,OneFlow始终挣扎在生死线边缘,最 困难的时候选择了降薪、减员。他后来补充说,整个OneFlow六七年都是这样的状态。 文 | 施嘉翔 编辑 | 刘旌 袁进辉和OneFlow一度是ChatGPT爆发后最走运的那个: 在王慧文发布"英雄帖"后,OneFlow被光年之 外并购,这家成立6年的公司一跃成为牌桌上资源最好的团队。 但仅仅两个月后,一切又戏剧化地烟消 云散。 在聚光灯下经历跌宕起伏的四个月后,袁进辉和OneFlow团队决定重新创业。从有全行业最充沛的资 源,到突然消失,而其他玩家已经身处高位,心理冲击毋庸置疑。但袁进辉说,若仍想留在牌桌上,再 创业是唯一选择。 原因在于,王慧文生病的消息传出后,有公司冲到光年之外办公室挖人,OneFlow骨干成员也接到过超 千万的年薪,但没有一家公司能同时留住所有人。创业是避免团队被肢解的唯一方法。 在光年之外以前,袁进辉的创业同样起伏。和大部分公司不同,OneFlow成立之初就像实验室,前四年 没有任何收入,依赖外界投入。他们在做的事情在当时听起来匪夷所思:颠覆巨头重金投入的深度学习 系统框架( 比如Goo ...