多模态交互

Search documents
首届国际通用人工智能大会:东西方视角共探AGI未来
Huan Qiu Wang Zi Xun· 2025-05-26 09:52
Core Insights - The first International Conference on General Artificial Intelligence (AGI) was held in Beijing, focusing on the development of AGI and the need for China to establish an independent narrative in this field [1][3] - The conference featured over 40 prominent speakers from renowned institutions worldwide, showcasing cutting-edge research and advancements in AGI [3][5] - A new publication titled "Standards, Ratings, Testing, and Architecture for General Artificial Intelligence" was released, providing a mathematical definition of AGI and filling a gap in international standards [7] Group 1: Conference Overview - The conference took place from May 24 to 25, gathering nearly a thousand experts and scholars from various countries to discuss AGI technologies [1] - The event included four keynote speeches and six thematic meetings, highlighting the latest breakthroughs in AGI research [3][8] - The conference aimed to inject new momentum into the exploration of AGI and foster international collaboration in overcoming cognitive boundaries [14] Group 2: Keynote Presentations - Professor Zhu Songchun introduced the "CUV framework theory" based on Eastern philosophy, emphasizing the need for China to create its own AGI technology narrative [3] - Notable presentations covered topics such as embodied intelligence, natural intelligence, and generative artificial intelligence, reflecting the latest advancements in the AGI field [5] Group 3: Thematic Meetings - The six thematic meetings focused on various aspects of AGI, including multi-agent systems, cognitive and social intelligence, and the integration of AI with law, economics, and art [8][11] - Discussions included the latest research on multi-modal interaction, social behavior simulation, and the design of AI chips and systems for AGI [10][11] Group 4: Youth Engagement - The conference provided a platform for young researchers to showcase over a hundred innovative research outcomes, with 18 popular posters selected by attendees [12]
直击科博会:从“+AI”到“AI+” 大模型重构产业格局
Zheng Quan Ri Bao· 2025-05-11 16:27
Group 1 - The 27th China Beijing International Science and Technology Industry Expo showcased over 800 technology companies and institutions, featuring more than 600 globally debuting and industry-first technological achievements [1] - AI technology is rapidly transforming industry dynamics, shifting from a "+AI" integration model to an "AI+" scenario-driven model, significantly reshaping production and lifestyle [1] - The "Beijing Action Plan for Promoting 'Artificial Intelligence+'" focuses on the precise implementation of "large models + vertical scenarios," driving the digital and intelligent transformation of enterprises [1] Group 2 - Companies are advised to anchor their strategies on "high-value scenario exploration and data asset accumulation," utilizing standardized solutions for general business scenarios and customized development tools for specific scenarios [2] - The education technology sector is witnessing innovation, with products like the AI answering pen from NetEase Youdao providing immersive learning experiences through deep reasoning engines [2] - The financial technology sector is also seeing deep AI penetration, with institutions showcasing applications of large models in credit risk control, wealth management, and intelligent investment research [2] Group 3 - General large models possess strong knowledge generalization and language understanding capabilities but have high resource consumption and training costs, while vertical models focus on specialized knowledge and offer better business adaptability [3] - The mainstream industry path is a collaborative architecture of "general models + industry-specific models," enhancing practical application effectiveness [3] - In the financial sector, vertical large models may become the main battlefield for differentiated competition, with data quality and specialized knowledge bases being core barriers [3] Group 4 - Several technology companies showcased collaborative innovation results in building an open-source ecosystem for AI, which significantly promotes technological innovation and knowledge sharing [4] - The open-source ecosystem enhances the accessibility of AI technology, allowing companies to invest more economically and flexibly in acquiring and deploying AI solutions [4] - Companies can focus on application selection and data and knowledge mining, generating more commercially valuable AI applications [4]
2025年中国GEO行业研究生成即流量,GEO智启全域增长
Tou Bao Yan Jiu Yuan· 2025-05-08 00:35
Investment Rating - The report indicates a strong growth potential for the GEO industry, with a projected compound annual growth rate (CAGR) of 189.8% from 2024 to 2028, reaching a market size of 365 billion by 2028 [33][37]. Core Insights - The GEO industry is rapidly replacing traditional SEO due to the rise of AI search technologies, which enhance user experience by providing direct answers rather than requiring users to navigate through multiple links [11][41]. - GEO is characterized by its ability to generate high-quality content that aligns closely with user intent, leveraging AI to improve relevance and personalization [21][22]. - The market for GEO has grown significantly from 7.2 billion in 2019 to an estimated 16.7 billion in 2024, marking 2024 as a pivotal year for explosive growth [33][37]. Summary by Sections GEO Era Background - GEO (Generative Search Engine Optimization) enhances content to better match user search intent, improving search engine rankings through AI technology [19][21]. - The market for GEO is expected to expand rapidly, driven by the increasing user base of AI search engines, which grew from 310 million in January 2024 to 1.98 billion by February 2025, a growth rate of 538.7% [41][42]. GEO Era Development Analysis - The definition of high-quality content in the GEO era is evolving to emphasize innovative thinking, structured reproducibility, and fresh data [50][52]. - GEO service providers are focusing on building competitive barriers through authority, real-time adaptation, and multi-modal content compatibility [54][56]. Market Potential and Participants - The GEO market is still in its early stages, with major players including traditional search engines, cloud service providers, and specialized SEO agencies competing for market share [45][47]. - The report highlights that GEO will reshape over 300 billion in market value in the next five years, becoming a critical strategic point for brands seeking sustainable growth [39].
2025年中国GEO行业研究:生成即流量,GEO智启全域增长
Tou Bao Yan Jiu Yuan· 2025-05-07 13:10
Investment Rating - The report indicates a strong growth potential for the GEO industry, with a projected compound annual growth rate (CAGR) of 189.8% from 2024 to 2028, reaching a market size of 365 billion by 2028 [35][39]. Core Insights - The GEO industry is rapidly evolving due to breakthroughs in AI technology, which are reshaping information retrieval and user decision-making processes. Traditional SEO is declining, while GEO is emerging as a key method for meeting the demand for efficient, precise, and trustworthy information [3][11]. - GEO is defined as Generative Search Engine Optimization, which utilizes generative AI to create content that closely matches user intent, enhancing search engine rankings and user experience [19][21]. - The market for GEO has grown significantly since its inception in 2019, expanding from 7.2 billion to an expected 16.7 billion in 2024, marking a critical turning point for explosive growth [35][39]. Summary by Sections GEO Era Background - GEO is characterized by its ability to optimize existing content to better align with user search intent, thereby improving search engine rankings [19][21]. - The market size for GEO is projected to grow exponentially, with a CAGR of 189.8% from 2024 to 2028, contrasting sharply with the declining traditional SEO market [35][39]. - The GEO market is driven by advancements in large language models (LLMs) and their integration into search applications, significantly enhancing information retrieval efficiency [40][41]. GEO Era Development Analysis - The standards for high-quality content in the GEO era are being redefined, focusing on innovative thinking, structured reproducibility, and fresh data that holds high citation value in AI models [52][54]. - GEO is expected to lead a market value transformation exceeding 3 trillion in the next five years, becoming a critical strategic lever for brands seeking sustainable growth [41]. - The future of GEO is marked by three key trends: building authoritative credibility, personalized dynamic adaptation, and multi-modal compatibility [56][59].
魔法原子人形机器人“小麦”落地导购、主持人、理发师等多重场景
Bei Jing Qing Nian Bao· 2025-03-27 00:54
魔法原子人形机器人"小麦"落地导购、主持人、理 发师等多重场景 原标题:魔法原子人形机器人"小麦"落地导购、主持人、理发师等多重场景 3月26日,具身智能机器人公司魔法原子首次公开旗下人形机器人商业场景落地应用的视频,视频 显示,人形机器人小麦在交通引导、汽车导购、餐饮服务以及美容美发等多个场景展现了人机交互、视 觉识别及理解能力。 在商场停车场,人形机器人小麦作为交通疏导员,可以实时感知商场停车场信息,有序指引车流、 指挥停车,并引导顾客乘坐电梯。作为汽车导购的小麦机器人,则凭借VLM视觉语言大模型能力,识 别分析用户特征,基于相关信息预测用户偏好,并推荐相关车型。此外,在理发店,小麦智能识别顾客 发质后,可以主动调节吹风机工作模式,提供更具针对性的服务项目。餐厅服务员小麦,则通过大语言 模型与用户交流,根据顾客喜好推荐菜品,高效完成下单上菜。 作为头部手部齐全、外观相对完整、身材与人类相近的人形机器人,小麦全身具备42个自由度,能 够高度模仿人类的动作与姿态,可以通过智能语音、智能化面部表情、肢体动作与商业服务场景中的顾 客自然流畅地进行多模态交互。 通过多模态感知算法以及大模型技术,搭配以超声波传感器、 ...
速递丨智谱完成新一轮超10亿元融资,京杭联手重仓押注下一个Deepseek!
Z Finance· 2025-03-03 01:41
Core Viewpoint - The article highlights the strategic investment in Zhipu AI, marking a significant move in the generative AI sector in Hangzhou, with over 1 billion yuan raised from local investment funds, indicating a strong governmental push towards AI development [1]. Group 1: Zhipu AI's Strategic Positioning - Zhipu AI's open-source model is reshaping the global AI innovation landscape, showcasing two main paths for domestic AI breakthroughs: algorithm innovation reducing computational power dependency and building an open-source ecosystem that attracts global developers [2]. - The DeepSeek-R1 model demonstrates a cost-effective approach, achieving performance comparable to billion-dollar models at a cost of $5.6 million, challenging the traditional paradigm of "computational power equals competitiveness" [2]. - Zhipu AI's ChatGLM series has gained significant traction, with over 50,000 stars on GitHub and 30 million downloads, reflecting the value of open-source models as a technological foundation [3]. Group 2: Technological Advancements and Market Trends - The shift towards multi-modal interaction and physical world manipulation is evident, with Zhipu's GLM series models capable of understanding various inputs and executing complex tasks, enhancing efficiency in sectors like finance and education [6]. - The anticipated "open-source week" by DeepSeek and Zhipu's upcoming AutoGLM framework signify a transition from single-model to toolchain open-sourcing, potentially transforming development paradigms and innovation focus [3]. - The AI Agent technology is expected to see significant advancements by 2025, with applications in both enterprise efficiency and personalized consumer services, indicating a structural shift in organizational roles and decision-making processes [5]. Group 3: Regional Economic Impact - Hangzhou's digital economy now accounts for over 28.8% of its GDP, transitioning from e-commerce to hard technology, with strategic goals to become a hub for computational power and AI industry development [7]. - The collaboration between Zhipu AI and local industries is set to enhance the integration of large model technology into various sectors, driving intelligent upgrades across manufacturing, healthcare, finance, and government [8].