大语言模型幻觉 - filings, earnings calls, financial reports, news

大语言模型幻觉

Search documents

ICLR 2026 放榜了！28%接收率，欢迎投稿机器之心

机器之心· 2026-01-27 09:45

作为机器学习领域的顶级会议， ICLR 2026 将于 2026 年 4 月 23 日至 27 日在巴西里约热内卢举行。官方今年收到了有效投稿约 19000 篇，总录取率约为 28%，该录取率涵盖了所有经过同行评审的完整论文投稿，无论其是否撤稿。网友晒出成绩单录用通知一出来，网友们也坐不住了。社交平台上，很快被各种成绩单刷屏： | 11894 Optimal Sparsity of Mixture-of-Experts | 4 Official Reviews Submitted | ICLR 2026 Conference Submission | | --- | --- | --- | | Language Models for Reasoning Tasks | Reviewer fnio: Rating: 8 / Confidence: 4 | Recommendation: | | L Download PDF | Read Official Review | | | | Reviewer tKtK: Rating: 6 / Confidence: 3 | Accept | | Taishi ...

中泰资管天团 | 王路遥：投研人员的DeepSeek打开方式

中泰证券资管· 2025-03-06 08:58

Core Viewpoint - DeepSeek-R1 has achieved performance comparable to OpenAI's O1 model, indicating significant advancements in AI capabilities and its integration into everyday life, with over 1.1 billion app downloads and nearly 97 million weekly active users [1][6]. Group 1: Problem-Solving Approach - The initial step in problem-solving is redefining complex issues into clear, actionable sub-questions, which can be facilitated by DeepSeek's ability to break down problems into manageable parts [2][3]. - DeepSeek can help users generate a structured thought process, allowing for a more systematic approach to tackling complex problems, thus bridging the gap between broad questions and specific solutions [2][3]. Group 2: Question Formulation - Effective questioning is crucial; narrow and specific questions yield better responses from AI models. For instance, asking "What are the energy bureau's generator assembly targets?" is more effective than asking about broader industry trends [3][4]. - Users can leverage DeepSeek's contextual understanding to refine questions further, enhancing the depth of inquiry and leading to more insightful answers [3][4]. Group 3: AI as an Assistant - DeepSeek should be viewed as an assistant rather than a definitive source of truth, as it may generate inaccurate or misleading information, a phenomenon referred to as "hallucination" [4][5]. - The hallucination rate for DeepSeek-R1 is reported at 14.3%, highlighting the importance of verifying information and using the model for brainstorming and idea generation rather than for precise answers [5][6]. Group 4: Implications for Work and Life - The increasing integration of AI into daily tasks suggests that repetitive jobs will be increasingly automated, necessitating a shift towards independent thinking and judgment in professional settings [6][7]. - The ability to think critically and independently will become a key differentiator between human capabilities and AI, emphasizing the need for professionals to adapt to this evolving landscape [6][7].