百度文心一言
Search documents
阿里AI双线炸场:千问硬刚ChatGPT ,二十年铺垫藏暗棋
Sou Hu Cai Jing· 2025-11-24 03:49
哈喽大家好,今天小无带大家聊聊互联网圈的年度大瓜! 退休大佬亲自压阵、千亿资金狂砸、两款AI产品密集炸场,一套组合拳下来,直接把AI赛道的竞争拉到白热化! 现象级联动 事情得从蚂蚁集团园区的那场"低调现身"说起。退休多年的马云戴着白色鸭舌帽、挂着阿里工牌,在董事长井贤栋、CEO韩歆毅的陪同下现 身,帽檐压得很低,模样十分低调。 但这时间点也太巧了——当天上午蚂蚁刚发布全模态AI助手灵光,支持30秒生成可编辑、可交互的小应用。 前一天阿里千问App刚开启公测,凭着免费优势和生活场景生态,直接对标ChatGPT进军C端市场。合着这是"大佬站台和产品亮剑"的同步操 作啊! 要我说,马云这趟绝对不是来怀旧打卡的。AI赛道现在有多卷?百度文心一言、字节豆包早就深耕C端了,阿里现在让千问从B端API服务转 向独立C端App,再加上蚂蚁灵光的协同,明显是要补齐AI生态的最后一块短板。 而马云的出现,更像是给团队和市场注入"强心针",暗示这场AI战役的级别,远超普通业务布局。 更绝的是,阿里的AI实力已经有了实打实的证明。刚结束的史上最长双11,成了AI落地的"试金石":近600个品牌成交破亿,3.4万个品牌成交 翻倍,1. ...
大模型技术学习过程梳理:Agent、RAG、通用大模型等......
自动驾驶之心· 2025-11-23 02:04
点击下方 卡片 ,关注" 大模型之心Tech "公众号 戳我-> 领取大模型巨卷干货 做大模型社区也有几个月的时间了,柱哥最近也和不少同学交流了心得。 很多刚研一或者直博的同学非常焦虑,本科学的内容完全用不上。 上来就被transformer、Lora、多模态大模 型、Agent唬的一愣一愣的,接触的深度学习框架也往往不知从何入手。 这时候是最容易迷茫和焦虑的,实验室如果没人交流更是雪上加霜。近期我也和社区内部的同学开了一个小范 围的交流会,一些同学能从我们分享中抓到关键的部分,跟着社区里面的路线进步较快。有前沿的文章速递, 一些工具使用的配套介绍,也有行业的新闻动态等等。基础不错的同学已经可以顺利微调自己的大模型。 但还有相当多的同学卡住了,比如算力的问题,自建数据集的问题,还有模型优化、项目实战的问题等。关于 算力,前面分享过很多轻量化的方法,也能做出不错的性能,甚至SOTA,这能够适配一些算力不足的同学。 以上为我们的大模型社区:大模型之心tech知识星球的分享,也欢迎更多需要入门进阶的同学加入我们的社 区。近一年的搭建,社区内已经完成了技术路线分享、直播、问答、求职、赛事等多个版块的分享。实现了产 业 ...
从DeepSeek到千问灵光,杭州AI梦之队引领2025 AI风口
第一财经· 2025-11-18 06:30
Core Insights - Alibaba and Ant Group are intensifying their AI application efforts, with the launch of the Qwen model and the Lingguang AI assistant, aiming to compete directly with ChatGPT for overseas users [1][3][6] - The AI application landscape is rapidly evolving, with significant advancements from major players like Alibaba and Ant Group, marking 2025 as a pivotal year for AI applications [3][6][10] AI Application Developments - Ant Group's Lingguang AI assistant supports multi-modal outputs, including 3D, audio, and interactive maps, and can generate daily life applications in 30 seconds, positioning it as a leading general-purpose AI assistant [3][4][6] - Alibaba's Qwen app enhances previous AI offerings and aims to cover various life scenarios, indicating a strategic push towards comprehensive AI solutions [6][10] Competitive Landscape - The competition between Alibaba and Ant Group, along with other tech giants, is characterized by a "South Alibaba, North Byte" dynamic, highlighting the regional competition in AI applications [6][10] - The AI application market is witnessing a user engagement surge, with a reported 2.49 billion users of generative AI products in China by the end of 2024, representing 17.7% of the population [8][10] Strategic Focus and Future Outlook - Ant Group has released 18 large models, entering the trillion-parameter model category, and is focusing on application-driven AI strategies [7][8] - The AI industry is expected to see significant growth, with predictions that by 2028, at least 15% of daily tasks will be autonomously completed by AI agents [8][10] Regional Development and Talent Acquisition - Hangzhou is emerging as a key AI innovation hub, with government support for AI infrastructure and a focus on multi-route breakthroughs in large models [10][11] - The average salary for AI product managers in Hangzhou is the highest in the country, indicating a competitive talent market [13] User Engagement and Market Dynamics - As of October 2025, the daily active users (DAU) for the leading AI applications show significant engagement, with Doubao leading at 54.1 million DAU [15][16] - The ongoing battle for user attention among major tech firms is shaping the future of AI applications, with a focus on establishing differentiated market positions [15][17]
阿里公测千问对标ChatGPT,但9.9和9.11谁大还是“翻车”了
Di Yi Cai Jing· 2025-11-17 08:31
Core Insights - The article discusses the performance of various AI models, particularly focusing on Alibaba's Qwen model, in answering a simple mathematical question about the comparison between 9.9 and 9.11, highlighting the challenges AI faces in common-sense reasoning [1][9][10]. Group 1: AI Model Performance - Alibaba's Qwen model initially answered incorrectly that 9.11 is greater than 9.9, but later corrected itself after a breakdown of the reasoning process [1][9]. - Other prominent AI models, including ChatGPT-4o and Google's Gemini Advanced, also failed to answer the question correctly, indicating a broader issue in AI's handling of basic arithmetic and common-sense reasoning [10][11]. Group 2: Self-Correction and Learning - The Qwen model demonstrated self-correction capabilities by analyzing its initial mistake and providing the correct answer upon further questioning [9][10]. - The initial error was attributed to a mismatch between the reasoning process and the final conclusion, as well as cognitive biases related to the numerical representation of 9.11 [9]. Group 3: Market Position and Strategy - Alibaba is positioning the Qwen model as a competitive alternative to ChatGPT in the global market, with plans to integrate it into various consumer applications such as maps, food delivery, and shopping [11]. - The Qwen series has achieved significant global traction, with over 600 million downloads, showcasing its growing influence and competitiveness in the AI landscape [10][11].
大模型方向适合去工作还是读博?
具身智能之心· 2025-10-16 00:03
Core Insights - The article discusses the decision-making process for individuals in the large model field regarding whether to pursue a PhD or engage in entrepreneurial ventures related to agents [1][2] Group 1: Importance of Foundation in Large Models - A solid foundation in large models is crucial, as the field encompasses various directions such as generative models, multi-modal models, fine-tuning, and reinforcement learning [1] - Many mentors lack sufficient expertise in large models, leading to a misconception among students about their readiness for related positions [1] Group 2: Role of a Pioneer in Research - The suitability of an individual to take on the role of a "pioneer" in research is essential, especially in a field with many unexplored directions [2] - The ability to independently explore and endure failures is emphasized as a key trait for those aiming to innovate from scratch [2] Group 3: Community and Learning Resources - The "Large Model Heart Tech Knowledge Planet" community offers a comprehensive platform for beginners and advanced learners, featuring videos, articles, learning paths, and Q&A sections [2] - The community aims to provide a space for technical exchange and collaboration among peers in the large model domain [4] Group 4: Learning Pathways - The community has compiled detailed learning pathways for various aspects of large models, including RAG, AI Agents, and multi-modal training [4][9] - Each learning pathway includes clear technical summaries, making it suitable for systematic learning [4] Group 5: Benefits of Joining the Community - Members gain access to the latest academic advancements and industrial applications related to large models [7] - The community facilitates networking with industry leaders and provides job recommendations in the large model sector [7][68] Group 6: Future Plans and Engagement - The community plans to host live sessions with industry experts, allowing for repeated viewing of valuable content [65] - A focus on building a professional exchange community with contributions from over 40 experts from renowned institutions and companies is highlighted [66]
具身领域的大模型基础部分,都在这里了......
具身智能之心· 2025-09-20 16:03
Core Viewpoint - The article emphasizes the importance of a comprehensive community for learning and sharing knowledge about large models, particularly in the fields of embodied AI and autonomous driving, highlighting the establishment of the "Large Model Heart Tech Knowledge Planet" as a platform for collaboration and technical exchange [1][3]. Group 1: Community and Learning Resources - The "Large Model Heart Tech" community aims to provide a platform for technical exchange related to large models, inviting experts from renowned universities and leading companies in the field [3][67]. - The community offers a detailed learning roadmap for various aspects of large models, including RAG, AI Agents, and multimodal models, making it suitable for beginners and advanced learners [4][43]. - Members can access a wealth of resources, including academic progress, industrial applications, job recommendations, and networking opportunities with industry leaders [7][70]. Group 2: Technical Roadmaps - The community has outlined specific learning paths for RAG, AI Agents, and multimodal large models, detailing subfields and applications to facilitate systematic learning [9][43]. - For RAG, the community provides resources on various subfields such as Graph RAG, Knowledge-Oriented RAG, and applications in AIGC [10][23]. - The AI Agent section includes comprehensive introductions, evaluations, and advancements in areas like multi-agent systems and self-evolving agents [25][39]. Group 3: Future Plans and Engagement - The community plans to host live sessions with industry experts, allowing members to engage with leading figures in academia and industry [66]. - There is a focus on job sharing and recruitment information to empower members in their career pursuits within the large model domain [70].
但我还是想说:建议个人和小团队不要碰大模型训练!
自动驾驶之心· 2025-09-20 16:03
Core Viewpoint - The article emphasizes the importance of utilizing open-source large language models (LLMs) and retrieval-augmented generation (RAG) for businesses, particularly for small teams, rather than fine-tuning models without sufficient original data [2][6]. Group 1: Model Utilization Strategies - For small teams, deploying open-source LLMs combined with RAG can cover 99% of needs without the necessity of fine-tuning [2]. - In cases where open-source models perform poorly in niche areas, businesses should first explore RAG and in-context learning before considering fine-tuning specialized models [3]. - The article suggests assigning more complex tasks to higher-tier models (e.g., o1 series for critical tasks and 4o series for moderately complex tasks) [3]. Group 2: Domestic and Cost-Effective Models - The article highlights the potential of domestic large models such as DeepSeek, Doubao, and Qwen as alternatives to paid models [4]. - It also encourages the consideration of open-source models or cost-effective closed-source models for general tasks [5]. Group 3: AI Agent and RAG Technologies - The article introduces the concept of Agentic AI, stating that if existing solutions do not work, training a model may not be effective [6]. - It notes the rising demand for talent skilled in RAG and AI Agent technologies, which are becoming core competencies for AI practitioners [8]. Group 4: Community and Learning Resources - The article promotes a community platform called "大模型之心Tech," which aims to provide a comprehensive space for learning and sharing knowledge about large models [10]. - It outlines various learning pathways for RAG, AI Agents, and multi-modal large model training, catering to different levels of expertise [10][14]. - The community also offers job recommendations and industry opportunities, facilitating connections between job seekers and companies [13][11].
真的花了好久才汇总的大模型技术路线......
具身智能之心· 2025-09-16 00:03
Core Insights - The article emphasizes the transformative impact of large models on various industries, highlighting their role in enhancing productivity and driving innovation in fields such as autonomous driving, embodied intelligence, and generative AI [2][4]. Group 1: Large Model Technology Trends - The large model industry is undergoing significant changes characterized by technological democratization, vertical application, and open-source ecosystems [2]. - There is a growing demand for talent skilled in technologies like RAG (Retrieval-Augmented Generation) and AI Agents, which are becoming core competencies for AI practitioners [2][4]. - The article introduces a comprehensive learning community focused on large models, offering resources such as videos, articles, learning paths, and job exchange opportunities [2][4]. Group 2: Learning Pathways - The community provides detailed learning pathways for various aspects of large models, including RAG, AI Agents, and multimodal models [4][5]. - Specific learning routes include Graph RAG, Knowledge-Oriented RAG, and Reasoning RAG, among others, aimed at both beginners and advanced learners [4][5]. - The pathways are designed to facilitate systematic learning and networking among peers in the field [5]. Group 3: Community Benefits - Joining the community offers benefits such as access to the latest academic advancements, industrial applications, and job opportunities in the large model sector [7][9]. - The community aims to create a collaborative environment for knowledge sharing and professional networking [7][9]. - There are plans for live sessions with industry leaders to further enrich the community's offerings [65][66].
虚拟数字人:在技术迭代中进化
Jing Ji Ri Bao· 2025-09-14 21:53
Core Insights - The virtual digital human industry has shifted from initial hype to facing significant operational challenges, including high costs and low returns, leading to a decline in market interest [1][2][3] Group 1: Industry Trends - The rise of virtual beauty influencers like "Liu Yexi" in 2021 sparked a wave of brand engagement with virtual endorsers, leading to a surge in stock prices for related companies [2] - The initial belief in quick returns from virtual digital humans has been challenged by high production costs and diminishing user interest, resulting in many virtual endorsers being removed from platforms [2][3] - A report from QuestMobile indicates that in 2023, the GMV of virtual streamers was less than one-fifth that of real streamers, with a significant drop in average viewing time and a high fan attrition rate [3] Group 2: Technological Advancements - The development of generative AI has led to the evolution of virtual digital humans into "smart humans," utilizing advanced technologies for more human-like interactions [4] - Companies are leveraging modular tools and SaaS platforms to reduce production costs and deploy digital humans in practical applications across various sectors, moving away from purely entertainment-focused roles [4][5] Group 3: Market Expansion - Despite existing challenges, the virtual digital human market is projected to grow significantly, with estimates suggesting a core market size exceeding 48 billion yuan by 2025 [6] - Investment activity in the sector has increased, with 23 funding cases reported in 2025, totaling over 3.5 billion yuan, indicating renewed interest from capital markets [6] - Government initiatives are supporting the development of the digital human sector, with various regions launching new digital human projects to enhance service delivery [6] Group 4: Legal and Ethical Considerations - The emergence of legal issues surrounding virtual digital humans, including copyright and data privacy concerns, is becoming more prominent, as evidenced by recent court rulings [7] - Platforms are enhancing governance measures to mitigate risks associated with AI-generated content, including the identification and removal of misleading accounts [7] Group 5: Future Outlook - The consensus in the industry is that within the next five years, virtual digital humans will transition from being seen as mere novelties to becoming essential tools for digital transformation and economic growth [7]
从算力到应用:港股“科技七巨头”如何接棒AI浪潮第三阶段?
Sou Hu Cai Jing· 2025-08-18 11:46
Group 1 - The core viewpoint is that the Hong Kong technology sector presents significant valuation attractiveness, characterized by low valuations, high growth potential, and policy catalysts, making it an ideal choice for medium to long-term capital allocation [2][5] - The Hang Seng Technology Index's dynamic PE is approximately 25.8 times, which is about 20% lower than the Nasdaq 100 Index, and the valuation gap between leading tech companies in China and the US is between 10-20 times [5] - The overall PE of the Hang Seng Index is 10.2 times, lower than the S&P 500 (22.3 times) and Nikkei 225 (18.6 times), highlighting the valuation advantage of the technology sector [5] Group 2 - The current PE of the Hang Seng Technology Index is at the 8th percentile of the past five years, significantly below the historical median, especially after the internet sector has fully digested valuation bubbles during the 2023-2024 adjustment [5] - Leading companies like Alibaba and Baidu are transitioning their valuation focus from "consumer stocks" to "technology growth stocks," although their stock prices have not yet fully reflected the potential of technological upgrades [5] Group 3 - Factors driving the sector include improved earnings expectations, with companies like Tencent and Lenovo exceeding forecasts, and accelerated AI commercialization potentially opening new growth avenues [4][5] - The domestic economy is experiencing a mild recovery supported by policies favoring the digital economy and normalized regulation of platform economies, leading to marginal improvements in the fundamentals of tech companies [5] - Continuous inflow of southbound funds, with a cumulative net purchase exceeding 300 billion HKD in 2025, enhances the pricing power of Hong Kong stocks [5] Group 4 - The current valuation levels imply a high margin of safety, and if subsequent earnings growth materializes, the sector may experience a "Davis Double" effect [6] - Recommended investment targets include the Hang Seng Technology ETF (07188.hk), technology index funds under the Stock Connect, and leading companies in AI computing (SMIC), platform economy (Tencent, Alibaba), and hard technology (05188.hk) [6]