Workflow
语言模型
icon
Search documents
联想申请信息处理方法专利,实现不同维度特征向语言模型的有效输入
Jin Rong Jie· 2026-02-04 11:38
Group 1 - Lenovo (Beijing) Co., Ltd. has applied for a patent titled "An Information Processing Method, Module Determination Method, and Information Processing Device," with publication number CN121457525A, and the application date is October 2025 [1] - The patent abstract reveals a method that includes obtaining a first input feature, which can be one of text, image, video, or audio features, with dimensions differing from the first language model [1] - The method involves converting the first input feature into first probability information, which represents the probability distribution of vectors in a predefined vector set, and then determining a second input feature based on this probability information [1] Group 2 - Lenovo (Beijing) Co., Ltd. was established in 1992 and is located in Beijing, primarily engaged in the manufacturing of computers, communications, and other electronic devices [2] - The company has a registered capital of 565 million HKD and has invested in 108 enterprises, participated in 5,000 bidding projects, and holds 1,743 trademark records and 5,000 patent records [2] - Additionally, the company possesses 238 administrative licenses [2]
王小川时隔一年多再露面谈医疗行业痛点:百川智能一定会“出海”,也会走上IPO道路
Xin Lang Cai Jing· 2026-01-14 12:26
Core Insights - Wang Xiaochuan reaffirms Baichuan's commitment to the medical AI sector, indicating a strategic shift to focus solely on healthcare applications after diversifying into other areas previously [1][3] - The healthcare industry is experiencing a transformation with major AI companies entering the medical field, suggesting that large models are beginning to be applied effectively in healthcare [3] Group 1: Industry Challenges - Wang identifies two core issues in the healthcare sector: "insufficient supply" of qualified doctors and "structural imbalance" in the medical system [4] - The emergence of AI doctors is seen as a potential solution to the long-standing problem of doctor shortages, with expectations that by 2025, AI capabilities will surpass those of human doctors [4] - The existing medical system often leads to a disconnect between patients and doctors, where patients lack understanding of treatment options and risks [4][5] Group 2: Technological Approach - Wang emphasizes that the core of AI technology in healthcare should focus on language and symbols rather than multi-modal approaches, arguing that intelligence is derived from the ability to abstract problems [7][8] - He believes that many current healthcare issues are fundamentally decision-making problems, and that future AI applications will likely involve specialized models for image interpretation, with results processed by language models [9] - Wang critiques the overemphasis on data quality in model development, asserting that the essence of successful AI lies in the knowledge extraction from literature rather than raw data [9] Group 3: Future Plans - Baichuan plans to launch two consumer-facing products in the first half of 2026, focusing on directly assisting patients rather than serving healthcare providers [10] - The company aims to charge for services that provide value in decision-making for patients, while maintaining a cautious approach to regulatory boundaries [10] - Wang outlines Baichuan's competitive advantages as having a leading model, targeting high-value scenarios, and maintaining a different innovation pace compared to larger firms [11] Group 4: Market Expansion and IPO - Baichuan intends to expand internationally, with Wang asserting that companies that do not pursue global markets are not viable [11] - The company is also considering an IPO in the future, acknowledging that while it may take longer than other AI firms, it aims to optimize its business model before going public [12]
王小川时隔一年再露面谈行业痛点:医疗大模型进入医院内是“隔山打牛” 不认可多模态是主战场
Mei Ri Jing Ji Xin Wen· 2026-01-14 06:53
Core Insights - Wang Xiaochuan reaffirms Baichuan's commitment to the medical AI sector, indicating a strategic shift to focus solely on healthcare applications after diversifying too broadly in the past [1] - The healthcare industry is facing significant challenges, primarily due to a shortage of qualified doctors and an imbalance in the power dynamics between patients and healthcare providers [2] - The emergence of AI in healthcare is seen as a transformative opportunity, with the potential for AI capabilities to surpass human doctors by 2025 [2] - The relationship between patients and doctors is expected to evolve, with AI facilitating better communication and understanding of medical decisions [3] Industry Challenges - The core issues in the healthcare sector are identified as "supply shortage" and "structural imbalance," with a long-standing lack of good doctors [2] - The existing medical system often leads to a disconnect between patients and doctors, where patients are passive recipients of medical decisions [2] - Wang emphasizes that the future of healthcare will involve a shift in decision-making power towards patients, aided by AI [3] Technological Perspective - Wang argues against the mainstream view that multi-modal AI is the primary battleground, asserting that language and symbols are central to AI's intelligence [5] - He categorizes natural language, mathematical language, and code as formal languages, emphasizing that true intelligence lies in the ability to abstract and reason [6] - The focus in healthcare should be on decision-making rather than just image recognition, with AI expected to enhance the interpretative capabilities of medical data [6] Market Strategy - Baichuan plans to target the consumer market directly, moving away from traditional hospital-centric models, and aims to launch two products in the first half of the year [7] - The company is cautious about regulatory boundaries, ensuring that it does not cross into areas of direct diagnosis or prescription but focuses on aiding patient understanding and decision-making [7] - Wang believes that the significant growth potential for AI in healthcare lies outside of hospital settings, particularly in home healthcare scenarios [7] Future Outlook - Baichuan aims to expand internationally, with Wang stating that companies that do not pursue global markets are not competitive [8] - The company is preparing for an eventual public listing, with a focus on refining its business model and ensuring a favorable revenue-cost structure [9] - Wang's long-term vision is driven by a fascination with the complexities of life and the desire to find underlying mathematical models, which he believes AI can help elucidate [9]
为什么蔚来会押注世界模型?
自动驾驶之心· 2025-12-31 06:27
Core Insights - The article discusses the recent promotion of NIO's NWM 2.0, highlighting its positive reception and the potential of world models in intelligent driving [1] - It emphasizes that the true limit of intelligent driving lies in world models, which utilize video as a core component to understand spatiotemporal and physical laws, enabling machines to comprehend environments like humans do [1] Group 1: World Model Concept - World models address spatiotemporal cognition, while language models focus on conceptual cognition, with the former being more effective in modeling the real world's four-dimensional space-time [1] - The article mentions that many AI giants are developing general world models, including projects like Li Feifei's Marble, Yann LeCun's V-JEPA 2, and DeepMind's Genie 3 [1] Group 2: Challenges in Understanding World Models - The definition of world models remains vague, leading to confusion among newcomers in the field, who often spend significant time navigating challenges without clear guidance [1] - The article notes that understanding world models and completing tasks like data generation and closed-loop simulation can be particularly difficult for beginners [1] Group 3: Course Overview - A course is being offered to help individuals understand the world model domain in autonomous driving, featuring insights from industry algorithm experts [2][6] - The course will cover various aspects of world models, including their historical development, application cases, and different schools of thought within the field [6][10] Group 4: Course Structure - The course consists of six chapters, starting with an introduction to world models and their connection to end-to-end autonomous driving [6] - Subsequent chapters will delve into background knowledge, discussions on general world models, video generation-based models, OCC generation models, and industry applications [6][8][9][10] Group 5: Expected Outcomes - The course aims to equip participants with the skills to reach a level comparable to a world model autonomous driving algorithm engineer within a year [14] - Participants will gain a deeper understanding of key technologies such as BEV perception, multimodal large models, and generative models, enabling them to apply their knowledge in practical projects [14]
自变量王潜:具身智能是物理世界的独立基础模型|MEET2026
具身智能之心· 2025-12-22 01:22
Core Viewpoint - The article discusses the debate on whether embodied intelligence should be viewed as an application or as an independent foundational model, asserting that it is a foundational model specifically designed for the physical world, parallel to language and multimodal models [6][12][60]. Group 1: Differences Between Physical and Virtual Worlds - There is a fundamental difference between the physical world, characterized by randomness and continuous processes, and the virtual world, which is highly reproducible and low in randomness [2][10]. - Existing models based on language and visual modalities are inadequate for accurately representing the complexities and randomness of physical interactions [16][22]. Group 2: Need for a Separate Foundational Model - A separate foundational model for embodied intelligence is necessary due to the unique characteristics of the physical world, which often leads to unpredictable outcomes even under identical conditions [10][11]. - The current architectures and training methods struggle to capture the high randomness present in physical events, necessitating a new approach to model design [12][20]. Group 3: Future of Multimodal Models - Shifting the perspective to view embodied intelligence as an independent foundational model can lead to significant changes in model architecture and data utilization [9][23]. - The learning and perception processes in the physical world differ fundamentally from those in the virtual world, suggesting that future multimodal models should incorporate these differences [24][29]. Group 4: Scaling Laws and Data Utilization - The article emphasizes the importance of scaling laws in the development of large models, particularly in the context of robotics, where data acquisition and utilization are critical [46][51]. - A phased approach to training, utilizing both pre-training and post-training data, is recommended to enhance model performance [48][52]. Group 5: Hardware and AI Integration - The integration of AI in defining hardware is crucial for the development of embodied intelligence, advocating for a simultaneous evolution of both software and hardware [53][54]. - The potential for embodied intelligence to drive exponential growth in resources and capabilities is highlighted, suggesting a transformative impact on the future of artificial general intelligence (AGI) [59][60].
艾瑞观察:语言模型的价值重构与生态突围
艾瑞咨询· 2025-12-18 00:05
Core Insights - By 2025, the global focus of technological competition has shifted to language models, marking a transition from a "Spring and Autumn" period of "hundred models war" to a "Warring States" era where major companies prioritize "value realization" over mere parameter scale competition [1] - Language models are reshaping the underlying logic of the digital economy, with tech giants investing billions in R&D to transform these models from novelty tools into essential national-level utilities [1] Industry Overview - The AI industry is experiencing rapid expansion and deep technological iteration, driven by language models as the core engine [2] - Key trends include multi-modal integration, embodiment intelligence, and the practical application of intelligent agents, with language models serving as the indispensable "central nervous system" [2] Language Model Sub-industry - The language model sub-industry is generally positive but faces three core pain points in consumer applications: insufficient practicality, fragmented scenarios, and cost-ecosystem imbalance [3] - The recent launch of Alibaba's Qianwen APP has seen significant success, with over 10 million downloads within a week of public testing and monthly active users exceeding 30 million within 23 days [4] Qianwen APP's Strategic Approach - Qianwen APP's rise is attributed to its strategic adjustment addressing industry pain points through a "technology + scenario + ecosystem" framework, validating Alibaba's "user-first, AI-driven" strategy [6] - The app leverages Alibaba's Qwen series models, which are competitive with leading closed-source models, enhancing its capabilities in logical reasoning and long-text processing [6][8] Future Development Trends - The language model industry is expected to enter a new development cycle characterized by technological integration, ecological symbiosis, and value orientation [9] - Future models will focus on deep multi-modal integration and vertical precision, with open-source models driving innovation and reducing costs for small and medium enterprises [9] Conclusion - The language model industry is at a critical juncture, transitioning from technological explosion to industrial prosperity, with Qianwen representing a significant breakthrough in both domestic and global markets [10]
腾讯混元2.0上线
Di Yi Cai Jing· 2025-12-05 14:13
Core Insights - Tencent's latest language models, Tencent HY 2.0 Think and Tencent HY 2.0 Instruct, have been officially released, showcasing advancements in AI technology [2] Group 1: Model Specifications - The HY 2.0 model utilizes a mixture of experts (MoE) architecture, featuring a total of 406 billion parameters and 32 billion active parameters [2] - The model supports a context window of 256,000 tokens, enhancing its capability for processing large amounts of data [2] Group 2: Improvements Over Previous Version - Compared to the previous version (Hunyuan-T1-20250822), HY 2.0 Think has significantly improved pre-training data and reinforcement learning strategies [2]
观点分享:VLA解决的是概念认知,无法有效的建模真实世界的四维时空?
自动驾驶之心· 2025-10-14 07:12
Core Viewpoint - The article discusses the importance of world models in intelligent driving, emphasizing that true understanding of the environment requires a high-bandwidth cognitive system rather than merely extending language models [2][3][5]. Summary by Sections World Model vs. Language Model - The world model focuses on spatiotemporal cognition, while the language model addresses conceptual cognition. Language models have low bandwidth and sparsity, making them ineffective for modeling the real world's four-dimensional space-time [2][3]. - The world model aims to establish capabilities directly at the video level, rather than converting information into language first [3][4]. VLA and WA - VLA (Vision-Language Architecture) is essentially an extension of language models, adding new modalities but still rooted in language. In contrast, the world model seeks to create a comprehensive cognitive system [3][5]. - The ultimate goal of autonomous driving is to achieve open-set interactions, allowing users to express commands freely without being limited to a fixed set of instructions [3][4]. Importance of Language - Language remains crucial for three main reasons: 1. Incorporating physical laws such as gravity and inertia into the model [6]. 2. Understanding and predicting object movements in three-dimensional space over time [6]. 3. Absorbing vast amounts of data from the internet, which aids in training autonomous driving systems [7]. Integration of Models - The combination of language models (conceptual cognition) and world models (spatiotemporal cognition) is essential for advancing towards Artificial General Intelligence (AGI) [8]. Industry Trends - The autonomous driving industry is experiencing intense competition, with many professionals considering transitioning to embodied AI due to the saturation of current technologies [9]. - The ongoing debate between VLA and WA represents a larger industry transformation, highlighting the need for innovative solutions to break through current limitations [9]. Community and Resources - A community platform has been established to facilitate knowledge sharing and collaboration among professionals in the autonomous driving field, featuring resources such as learning routes, technical discussions, and job opportunities [25][26].
Qwen3-Max-Preview 上线,官方称系通义千问系列最强大的语言模型
Sou Hu Cai Jing· 2025-09-06 10:03
Core Insights - Alibaba's Tongyi Qwen has launched the latest Qwen-3-Max-Preview model, which is described as the most powerful language model in the Tongyi Qwen series [1] - The Qwen-3-Max model offers significant improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version [1][3] - The model supports over 100 languages and is optimized for retrieval-augmented generation (RAG) and tool invocation, although it does not include a dedicated "thinking" mode [1][3] Pricing and Performance - The input price for using the Qwen-3-Max model is $1.20 per million tokens, while the output price is $6 per million tokens [2][5] - The model can handle a context of up to 256,000 tokens, with a maximum output of 32,800 tokens [5] Technical Enhancements - Qwen-3-Max provides higher accuracy in mathematical, coding, logic, and scientific tasks, and it reliably follows complex instructions in both Chinese and English [1][3] - The model reduces hallucinations and generates higher-quality responses for open-ended questions, writing, and conversation [1][3]
大佬面对面!斯坦福2025 CS336课程全公开:从零开始搓大模型~
自动驾驶之心· 2025-06-24 11:47
Core Viewpoint - The article discusses the launch of Stanford University's CS336 course "Language Models from Scratch," which aims to provide a comprehensive understanding of language models through practical development and implementation [5][7]. Course Overview - The course focuses on the foundational aspects of language models, which are essential for modern natural language processing (NLP) applications. It emphasizes the importance of understanding language models for scientists and engineers in the fields of AI and ML [5][7]. - The course is structured into five major modules: Foundations, Systems, Extensions, Data, and Alignment & Reinforcement Learning [7]. Course Requirements - Students are expected to have proficiency in Python, as most assignments will require extensive coding. The course will provide minimal scaffolding, resulting in a higher volume of code written by students compared to other AI courses [7]. - A background in deep learning and system optimization is necessary, particularly familiarity with PyTorch and basic system concepts like memory hierarchy [7]. - Foundational knowledge in calculus, linear algebra, probability, and statistics is required, along with a basic understanding of machine learning principles [7]. Assignments - The course includes several assignments that cover various aspects of language model development, such as implementing a BPE tokenizer, training models on specific datasets, and optimizing performance on GPUs [8]. - Assignments are designed to simulate real-world challenges, including data processing and model alignment, with a focus on practical application and hands-on experience [8]. Course Schedule - The course is structured with a detailed schedule that outlines topics, materials, and deadlines for assignments, ensuring a systematic approach to learning [9].