深度思考
Search documents
国产大模型高考出分了:裸分683,选清华还是北大?
量子位· 2025-06-26 06:25
Core Insights - The article discusses the performance of various AI models in a simulated high school examination, comparing their scores and capabilities in different subjects [2][12]. Group 1: Overall Performance - Gemini achieved the highest score in science with 655 points, while Doubao scored 683 points in humanities, also ranking first [2]. - Doubao excelled in six subjects, maintaining top scores except in mathematics, chemistry, and biology [3][4]. Group 2: Subject-Specific Analysis - In the subject breakdown, Doubao scored 128 in Chinese, 141 in mathematics, and 144 in English, while Gemini scored 126 in Chinese and 140 in mathematics [3]. - The models showed significant improvement in mathematics compared to previous years, with most scoring around 140 points [13]. - Doubao and Gemini demonstrated better performance in visual comprehension tasks compared to other models, particularly in chemistry [22][42]. Group 3: Evaluation Methodology - The evaluation used a combination of national and provincial exam papers, with a total score of 750 points [9]. - Scoring was conducted through a mix of automated assessments and human evaluations, ensuring a fair testing environment [10][11]. Group 4: Model Development and Improvement - Doubao's advancements are attributed to three key strategies: multi-modal integration, enhanced reasoning capabilities, and dynamic thinking abilities [30][33][40]. - The model's training involved a three-phase process focusing on text, multi-modal data, and long-context support, significantly improving its performance in reading comprehension and reasoning tasks [35][36]. Group 5: Future Directions - The article suggests that combining text and image inputs can significantly enhance model performance, indicating a promising area for future exploration [42][43].
习惯问“为什么”和“怎么做”的人,差距到底有多大?
洞见· 2025-06-24 10:06
Core Viewpoint - The article emphasizes the importance of asking "why" instead of "how" to foster deep thinking and uncover fundamental solutions to problems [3][6][85]. Group 1: Importance of Asking "Why" - Individuals who habitually ask "why" engage in deep thinking and seek their own answers, leading to personal growth and success [6][85]. - In contrast, those who only ask "how" tend to rely on others for solutions, which can hinder their ability to think critically and solve problems independently [4][5]. Group 2: Stories Illustrating the Concept - A factory faced a production defect where some soap boxes were empty. Instead of spending millions on high-tech solutions, a worker suggested using a fan to blow away the empty boxes, costing only $200 [8][15]. - A teacher, noticing students skipping class, asked why they were doing so and learned that the class was boring. He then transformed his teaching method into a game, making the class more engaging [22][26]. - In a Stanford entrepreneurship class, one group of students asked why they were limited to making money with $5. They realized they could sell their time instead, earning $650 by selling 10 minutes of class time to a company [31][38]. - Two fruit salesmen faced complaints about rotten fruit. One offered refunds, while the other investigated the root cause and implemented a promotional strategy to reduce the sales cycle, leading to his promotion [42][49]. Group 3: The Process of Deep Thinking - The article discusses the "5 Why" method, which involves asking "why" multiple times to reach the root cause of a problem. This method can lead to effective solutions rather than superficial fixes [68][70]. - An example is given where a museum's wall corrosion was traced back to the use of a corrosive cleaner due to an abundance of bird droppings, which was ultimately caused by light attracting insects. The solution was to install blackout curtains [71][80]. Group 4: Conclusion on Deep Thinking - The article concludes that asking "why" leads to a deeper understanding of issues, allowing individuals to address the root causes rather than just the symptoms. This approach is essential for effective problem-solving and personal development [53][84].
长脑子最快的方式,是去做这6件事
洞见· 2025-06-16 10:19
Core Viewpoint - The article emphasizes the importance of deep thinking and diverse knowledge acquisition to combat cognitive decline and enhance personal growth in an era dominated by superficial information consumption [3][6][80]. Group 1: Importance of Documentaries - Documentaries serve as a means to broaden perspectives and combat ignorance, providing insights into human nature and the world [9][11]. - Watching documentaries is highlighted as a valuable way to gain knowledge without the need for significant financial investment [9]. Group 2: TED Talks - TED Talks are presented as influential platforms where top experts share profound insights, helping individuals navigate life's challenges [22][25]. - The article lists ten recommended TED Talks that cover various themes, including personal growth and productivity [26][27]. Group 3: High-Quality Films - High-rated films are suggested as a medium to experience diverse human emotions and cultural narratives, enhancing understanding of life [30][34]. - The article includes a list of ten acclaimed films that explore themes of humanity, emotion, and personal growth [38]. Group 4: Debate Competitions - Watching debate competitions is recommended for improving critical thinking and logical reasoning skills [41][42]. - The article suggests ten notable debates that showcase sharp arguments and diverse viewpoints [45]. Group 5: Biographies - Reading biographies is encouraged as a way to learn from the experiences of influential figures, providing guidance for personal development [56][59]. - The article lists ten biographies of notable individuals that can inspire and inform readers [60][64]. Group 6: Online Courses - The article advocates for engaging with online courses to enhance knowledge and skills, leveraging the accessibility of the internet [70][72]. - A selection of ten recommended online courses is provided, covering various subjects that promote critical thinking and understanding of human behavior [76].
守护孩子的记忆力(纵横)
Ren Min Ri Bao· 2025-06-11 22:11
Group 1 - The core issue highlighted is the phenomenon of "digital amnesia" among adolescents, which is attributed to the deep integration of digital scenarios into daily life, accelerated modern life pace, and unhealthy lifestyle choices [1] - Information overload and fragmented reading are identified as primary causes, leading to shallow reading and thinking among the youth due to constant exposure to vast amounts of online information [1] - Poor lifestyle habits, such as insufficient sleep, unbalanced diet, and lack of physical exercise, are noted to weaken the brain's memory functions [1] Group 2 - Recommendations for improving memory retention in children include enhancing media literacy education to help them discern the authenticity and value of information, thereby reducing distractions from ineffective information [2] - It is suggested to manage digital life wisely, using technology as a tool to enhance cognitive abilities rather than a crutch, while creating opportunities for cognitive training in daily life [2] - Emphasizing the importance of a healthy lifestyle, including quality sleep, balanced diet, and regular physical activity, is crucial for nurturing brain development and memory enhancement [2]
读书是一种被高估的美德
Hu Xiu· 2025-06-10 05:57
Group 1 - Reading is often perceived as a sacred virtue, equating readers with being cultured and deep thinkers, where the thickness of one's bookshelf symbolizes the depth of thought [1][2] - In a commodified society, reading individuals are positioned at a moral and intellectual high ground, often used by the wealthy and celebrities to enhance their cultural image [2] - The act of reading is fundamentally a passive cognitive activity, where individuals may lose their critical thinking abilities over time, as it involves repeating the author's thought process rather than creating original thoughts [3][4] Group 2 - Reading is primarily an input behavior that is simple and does not require significant cognitive resources compared to writing or research, leading to an overestimation of its value [6] - The cultural context often associates reading with moral superiority, ignoring the fact that the barriers to reading are low, thus it cannot be a reliable measure of moral character [7] - Reading is a neutral activity that does not inherently elevate or degrade one's character, and its consumption can be likened to other forms of entertainment, such as watching short videos [8] Group 3 - The true value of reading lies not in the quantity of books read but in the ability to internalize knowledge and thoughts through critical thinking and reconstruction [8][9] - A rational perspective on reading involves being aware of the illusion that "input equals justice" and maintaining a critical stance towards the material consumed [9] - Output is emphasized as being more important than input, advocating for conscious engagement in active output to enhance the learning process [10][12] Group 4 - The capabilities of artificial intelligence in processing large volumes of text highlight the need for human skills in knowledge reconstruction and deep thinking, which cannot be easily replaced by technology [11][12] - The ability to think critically, create, and apply knowledge is what distinguishes human cognition from mere data processing [12]
别让AI替你做判断
虎嗅APP· 2025-06-05 23:46
Core Viewpoint - The article discusses the phenomenon of "cognitive outsourcing" due to the increasing reliance on AI for information processing and decision-making, which may lead to a decline in critical thinking and independent analysis skills. Group 1: Cognitive Outsourcing - The reliance on AI tools is creating a dependency on "cognitive outsourcing," where individuals are encouraged to think less and rely more on AI for information processing [2][3][4]. - AI's ability to reduce cognitive load through features like one-click summaries and intelligent recommendations is leading to a decrease in active information filtering and judgment [3][4][5]. - The trend of cognitive outsourcing is evident as more people trust AI tools, resulting in diminished confidence in independent analysis when faced with complex problems [4][5][6]. Group 2: Impact on Critical Thinking - Frequent reliance on AI has been linked to difficulties in independent reasoning, with users experiencing a decline in cognitive sharpness compared to periods without AI assistance [5][6][9]. - Companies are systematically integrating AI into workflows, which, while seemingly increasing efficiency, may also weaken critical thinking abilities among employees [6][7]. - The article highlights a shift in academic environments, where students increasingly use AI for research and analysis, leading to a passive learning approach [7][8]. Group 3: The Role of Experience and Understanding - The article argues that experience is becoming a "compressed capsule," with individuals relying on AI to generate solutions rather than internalizing knowledge through experience [17][18]. - Certain types of knowledge and experience, particularly those requiring intuition and hands-on practice, cannot be replaced by AI, emphasizing the need for a balance between AI tools and human judgment [18][19]. - The understanding of complex concepts requires a foundational knowledge that cannot solely depend on AI, as true comprehension involves active engagement and critical thinking [15][16]. Group 4: The Future of Human-AI Interaction - The article suggests that as AI becomes more integrated into daily tasks, individuals must find their role in this evolving landscape, transitioning from creators to users of AI technology [25][26]. - There is a call for individuals to maintain their judgment and creativity in the face of increasing AI influence, ensuring that technology serves as a tool rather than a replacement for human thought [26][27]. - The ultimate boundary of AI's role is proposed to be in processing "what" and "how," while the "why" must remain a human domain, highlighting the importance of maintaining human agency in decision-making [23][24].
赚钱第一步,学会深度思考
洞见· 2025-05-29 18:21
洞见 ( DJ00123987 ) —— 不一样的观点,不一样的故事, 3000 万人订阅的微信大号。点击标题下蓝字 " 洞见 " 关注,我们将为您提供有价值、有意思的 延伸阅读。 作者:yy 来源:每晚一卷书 (ID: JYXZ89896) 靠体力和时间,永远赚不到大钱。 ♬ 点上方播放按钮可收听洞见主播佳音 朗读音频 "硅谷投资教父"纳瓦尔曾提出一个观点: 现代社会中,想要赚钱,努力只是一个常规要素,并非决定性因素。 工作多年,他见到那些职场中最忙碌的人,普遍薪水不高。 先讲个商铺老板的故事。 在"小家电之乡"余姚,有成千上万个小家电铺子。 为了拉订单,初中学历的他,到处翻看商业案例,学习商业知识。 有一次,他忽然注意到了"差异化经营"的字眼。 01 不论客户要什么小家电,不出一个镇,商户都能迅速找齐配件,组装成品。 所以家家户户都做全品类生意,谁能拿到订单全凭运气。 而谷文杰,就是这些商铺老板中的一员,刚开始时,生意也是不温不火。 在他看来,一个人赚不到钱,不是不够努力,而是思考太少。 因为思考一少,整个人就如生活在流水线之中,每一天都是机械地重复。 作家李尚龙说:靠体力和时间,永远赚不到大钱。 赚钱的 ...
一场对话,我们细扒了下文心大模型背后的技术
量子位· 2025-05-22 12:34
Core Viewpoint - The article discusses the advancements in large models, particularly focusing on the performance of Baidu's Wenxin models, which have achieved high ratings in recent evaluations, indicating their strong capabilities in reasoning and multimodal integration [1][2]. Group 1: Model Performance and Evaluation - The China Academy of Information and Communications Technology (CAICT) recently evaluated large model reasoning capabilities, with Wenxin X1 Turbo achieving the highest rating of "4+" in 24 assessment categories [1]. - Wenxin X1 Turbo scored 16 items at 5 points, 7 items at 4 points, and 1 item at 3 points, making it the only large model in China to pass this evaluation [1]. Group 2: Technological Innovations - Wenxin models emphasize two key areas: multimodal integration and deep reasoning, with the introduction of technologies such as multimodal mixed training and self-feedback enhancement [6][11]. - The multimodal mixed training approach unifies text, image, and video modalities, improving training efficiency by nearly 2 times and enhancing multimodal understanding by over 30% [8]. - The self-feedback enhancement framework allows the model to self-improve, addressing challenges in data production and significantly reducing model hallucinations [13]. Group 3: Application Scenarios - In practical applications, Wenxin X1 Turbo demonstrates its capabilities in solving physics problems and generating code, with AI-generated code now accounting for over 40% of new code added daily [42][44]. - The technology supports over 100,000 digital human anchors, achieving a 31% conversion rate in live broadcasts and reducing broadcast costs by 80% [48]. Group 4: Market Potential and Future Directions - The global online education market is projected to reach 899.16 billion yuan by 2029, with large models playing a crucial role in this growth [49]. - The digital human market is expected to reach 48.06 billion yuan this year, nearly quadrupling from 2022, indicating significant opportunities for large model applications [49]. Group 5: Long-term Strategy and Vision - Baidu's approach to large models emphasizes continuous technological exploration and deepening, focusing on long-term value rather than short-term trends [57][58]. - The company maintains a dynamic perspective on the rapid evolution of technology, aiming to prepare for future industry transformations [58].
ICML 2025 | 大模型深度思考新范式:交替「推理-擦除」解决所有可计算问题
机器之心· 2025-05-15 06:04
Core Viewpoint - The article introduces a new deep thinking paradigm called PENCIL, which alternates between generation and erasure to efficiently solve complex reasoning tasks, outperforming traditional Chain-of-Thought (CoT) methods [1][3]. Group 1: PENCIL Paradigm - PENCIL operates by dynamically erasing unnecessary intermediate results during the reasoning process, allowing for a more efficient generation of final answers [3][6]. - The paradigm addresses limitations of traditional CoT, such as exceeding context window limits, difficulty in retrieving key information, and decreased generation efficiency as context length increases [5][10]. Group 2: Mechanism and Design - The erasure mechanism in PENCIL is inspired by logical rewriting rules and stack frame memory management in functional programming, utilizing special tokens to manage the process [8][9]. - PENCIL supports various reasoning modes, allowing for the simplification of complex thought processes and efficient backtracking during problem-solving [10][13]. Group 3: Training and Experimental Results - PENCIL demonstrates superior accuracy in solving larger-scale reasoning problems compared to CoT, maintaining high accuracy rates even as problem size increases [15][21]. - The training efficiency of PENCIL is enhanced by reducing the context length required for each token, leading to significant savings in computational resources [12][17]. Group 4: Theoretical Implications - Theoretically, PENCIL can simulate any Turing machine's operations with optimal time and space complexity, making it capable of efficiently solving all computable problems [23][24]. - PENCIL's approach allows it to maintain a context length that is polynomial in relation to the problem size, contrasting with the exponential context length required by traditional CoT methods [25][28].
为什么你的工作运总是不顺?
3 6 Ke· 2025-04-24 09:10
Core Viewpoint - The article emphasizes the importance of identifying the root causes of problems rather than getting distracted by superficial issues, advocating for a strategic approach to problem-solving in the workplace and beyond [2][4][22]. Group 1: Problem Identification - Many workplace issues are manifestations of larger underlying problems, and focusing solely on these surface-level issues leads to ineffective solutions [2][4]. - The article suggests that individuals should look beyond immediate concerns and identify the overarching challenges that need to be addressed [4][22]. Group 2: Competitive Positioning - Achieving a high salary or job position is linked to one's value ranking in the job market, which is influenced by competition [8][9]. - The article highlights that even if one cannot be among the top in the country, striving to be among the best in a local context can still yield significant benefits [9][10]. Group 3: Market Dynamics - The concept of information asymmetry is discussed, where individuals or companies may create a perception of higher value than their actual capabilities, leading to inflated market prices [11][14]. - The prevalence of "sham" companies that appear successful but lack substance is noted, driven by a scarcity of genuinely skilled professionals [13][14]. Group 4: Personal Strategy - Individuals facing ethical dilemmas in the workplace should evaluate their personal goals and decide whether to adapt to the existing environment or seek new opportunities [19][20]. - The article advises against trying to change the game without the necessary power or resources, suggesting a more pragmatic approach to navigating workplace challenges [21][22].