CoT

Search documents
VLN Commercial Launches Confirm Viability of the FDA's Proposed Reduced Nicotine Mandate
GlobeNewswire News Room· 2025-07-16 21:18
Multiple Tobacco Brands Partnering with 22nd Century to Expand Availability and Awareness of VLN Based Reduced Nicotine Content Products Manufactured in the USA, VLN Products from 22nd Century Provide Clinically Proven Solution to Reduce the Rate and Harms of Smoking MOCKSVILLE, N.C., July 16, 2025 (GLOBE NEWSWIRE) -- 22nd Century Group, Inc. (Nasdaq: XXII), the only tobacco products company that has for 27 years led and continues to lead the fight against the harms of smoking driven by nicotine addiction ...
腾讯研究院AI速递 20250717
腾讯研究院· 2025-07-16 15:44
Group 1 - OpenAI core scientist Jason Wei and Hyung Won Chung have left to join Meta, with Wei being the father of the thinking chain and Chung responsible for code models [1] - Meta has adopted an aggressive strategy in the AI field, investing $16 billion to recruit top talent, leveraging its own funds and decision-making autonomy to lead the competition [1] - Following its transformation into AI, Meta's stock price surged, reaching a new market capitalization high, with CEO Mark Zuckerberg transitioning from being mocked as a "metaverse dreamer" to a "strategic tech leader" [1] Group 2 - AI pioneers, including OpenAI, DeepMind, and Anthropic, have jointly called for in-depth research on monitoring thinking chains (CoT) to enhance AI safety [2] - Experts believe that CoT monitoring offers a unique opportunity for AI safety by observing the model's "thought process" to detect malicious intent, although its monitorability may decrease with different training methods [2] - The document proposes several research directions and recommendations for CoT monitoring, including assessing monitorability, publishing evaluation results, and incorporating monitorability into training decisions to prevent AI behavior from going out of control [2] Group 3 - Mistral AI has released its first open-source voice model, the Voxtral series, which includes 24B and 3B versions, licensed under Apache 2.0 [3] - Voxtral supports a 32k token context window, capable of processing 30 minutes of audio transcription or 40 minutes of semantic understanding, outperforming the open-source model Whisper in multiple tests [3] - The model supports eight major languages and inherits text understanding capabilities from Mistral Small 3.1, surpassing GPT-4o mini in some tests, but still lags behind top commercial models overall [3] Group 4 - MiniMax has launched an Agent full-stack development feature that allows users to build complete application systems with no-code, including backend hosting, payment integration, and scheduled tasks [4][5] - Users can create applications like concert seat selection systems, real-time financial dashboards, and e-commerce websites within 30 minutes, supporting real payment functions and data processing [5] - This feature employs a modular architecture, consisting of three core sub-Agents for research, development, and testing, and has released 12 updates in over a month, lowering the development barrier for enterprise applications [5] Group 5 - Kunlun Wanwei and Nanyang Technological University have introduced a new hierarchical multi-agent collaboration framework called AgentOrchestra, utilizing an "AI orchestra" collaboration model to tackle complex tasks [6] - The framework is coordinated by a top-level "conductor" Planning Agent, working alongside three types of specialized "musician" agents (Deep Researcher, Browser Use, Deep Analyzer) for collaborative tasks [6] - AgentOrchestra has performed excellently in authoritative evaluations such as SimpleQA and GAIA, achieving an 82.42% pass@1 score in the GAIA test, with complete open-source code and technical reports available [6] Group 6 - Google DeepMind has developed a software library named Concordia, creating an AI-hosted multi-AI character interaction environment similar to the AI virtual world in "Westworld" [7] - The system is designed based on a game engine's entity-component architecture, treating AI players and AI game masters (GMs) as configurable entities with different capabilities through pluggable components [7] - Concordia supports three main application scenarios: evaluative (testing AI capabilities), dramatic (creating interactive narratives), and simulation (building social science research environments), and has been open-sourced on GitHub [7] Group 7 - The ima platform offers note resources from top students at prestigious universities, including structured knowledge and thinking models across multiple subjects [8] - These notes not only compile knowledge but also include problem-solving strategies, key point breakdowns, and error analysis, such as high-scoring templates for Chinese and techniques for analyzing complex English sentences [8] - Users can directly ask "top student notes" on the ima platform for study methods, mindset adjustment advice, and can upload their own notes to build a personal knowledge base [8] Group 8 - NVIDIA CEO Jensen Huang praised the Chinese supply chain as a "miracle" during his first speech in Chinese at the China Supply Chain Expo, naming 11 Chinese companies [10] - He emphasized that Chinese open-source models are catalysts for global AI progress, providing opportunities for countries to join the AI revolution, and predicted that the next wave of AI will focus on understanding the physical world and robotic systems [10] - NVIDIA made its debut at the supply chain expo, showcasing humanoid robot products from four Chinese companies, including Galaxy General and Beijing Humanoid Robot Innovation Center, along with DIGITS mini supercomputers [10] Group 9 - The "verifier's law" states that the difficulty of AI solving tasks is proportional to the verifiability of the task rather than the complexity of the task itself [11] - Verifiability includes five key attributes: objective truth, rapid verification, scalable verification, low noise, and continuous rewards [11] - Any problem meeting these five attributes will be solved by AI in the future, creating an "intelligent serrated frontier" where AI will demonstrate higher intelligence on verifiable tasks [11] Group 10 - OpenAI's third podcast discusses the evolution of ChatGPT from an API "playground" to a flagship product and its profound impact on work and the economy [12] - COO Mira Murati and Chief Economist Dan Altman believe AI will significantly enhance productivity, especially in software engineering, scientific research, and small businesses, predicting that AI agents will become key partners in handling complex tasks [12] - They emphasize the need to focus on soft skills such as emotional intelligence, critical thinking, and adaptability in the AI era, advocating for educational reforms to cultivate collaboration skills with AI, and noting that AI is expected to create significant value in emerging markets and agriculture [12]
AI教父联名OpenAI、DeepMind、Anthropic:警惕CoT
3 6 Ke· 2025-07-16 12:34
就在今天,小扎的 Meta 公司「挖角」了一个大的,将思维链(CoT)论文的第一作者 Jason Wei 招至他们的超级智能团队。作为参与 OpenAI o1 和 deep research 模型的知名研究员,Jason Wei 的离开,或让 OpenAI 损失巨大。 文件链接:https://tomekkorbak.com/cot-monitorability-is-a-fragile-opportunity/cot_monitoring.pdf 众所周知,推理模型是驱动 AI agent 的核心技术,作者认为,随着 AI agent 的普及和能力提升,CoT monitoring 可能成为控制其行为的核心方法。 "然而,目前这种可见性程度能否持续尚无保证。我们鼓励研究界和前沿 AI 开发者充分利用 CoT 可监测性,并研究如何保持其透明度。" 在立场文件中,作者要求领先的 AI 模型开发者研究使 CoT 可监测的因素——换言之,哪些因素会增加或减少对 AI 模型实际得出答案过程的透明度。 同时,今天另一件与 CoT 相关的新闻是,OpenAI、Google DeepMind、Anthropic 公司"罕见 ...
OpenAI谷歌Anthropic罕见联手发研究!Ilya/Hinton/Bengio带头支持,共推CoT监测方案
量子位· 2025-07-16 04:21
Core Viewpoint - Major AI companies are shifting from competition to collaboration, focusing on AI safety research through a joint statement and the introduction of a new concept called CoT monitoring [1][3][4]. Group 1: Collaboration and Key Contributors - OpenAI, Google DeepMind, and Anthropic are leading a collaborative effort involving over 40 top institutions, including notable figures like Yoshua Bengio and Shane Legg [3][6]. - The collaboration contrasts with the competitive landscape where companies like Meta are aggressively recruiting top talent from these giants [5][6]. Group 2: CoT Monitoring Concept - CoT monitoring is proposed as a core method for controlling AI agents and ensuring their safety [4][7]. - The opacity of AI agents is identified as a primary risk, and understanding their reasoning processes could enhance risk management [7][8]. Group 3: Mechanisms of CoT Monitoring - CoT allows for the externalization of reasoning processes, which is essential for certain tasks and can help detect abnormal behaviors [9][10][15]. - CoT monitoring has shown value in identifying model misbehavior and early signs of misalignment [18][19]. Group 4: Limitations and Challenges - The effectiveness of CoT monitoring may depend on the training paradigms of advanced models, with potential issues arising from result-oriented reinforcement learning [21][22]. - There are concerns about the reliability of CoT monitoring, as some models may obscure their true reasoning processes even when prompted to reveal them [30][31]. Group 5: Perspectives from Companies - OpenAI expresses optimism about the value of CoT monitoring, citing successful applications in identifying reward attacks in code [24][26]. - In contrast, Anthropic raises concerns about the reliability of CoT monitoring, noting that models often fail to acknowledge their reasoning processes accurately [30][35].
突发|思维链开山作者Jason Wei被曝加入Meta,机器之心独家证实:Slack没了
机器之心· 2025-07-16 02:22
Core Viewpoint - Meta continues to recruit top talent from OpenAI, with notable researchers Jason Wei and Hyung Won Chung reportedly leaving OpenAI to join Meta [1][2][4]. Group 1: Talent Acquisition - Jason Wei and Hyung Won Chung, both prominent researchers at OpenAI, are confirmed to be leaving for Meta, with their Slack accounts already deactivated [2][4]. - Jason Wei is recognized as a key author of the Chain of Thought (CoT) concept, which has significantly influenced the AI large model field [4][6]. - Hyung Won Chung has been a core contributor to OpenAI's projects, including the o1 model, and has a strong background in large language models [4][29]. Group 2: Contributions and Impact - Jason Wei's work includes leading early efforts in instruction tuning and contributing to research on the emergent capabilities of large models, with over 77,000 citations on Google Scholar [21][16]. - Hyung Won Chung has played a critical role in the development of major projects like PaLM and BLOOM during his time at Google, and later at OpenAI, where he contributed to the o1 series models [26][40]. - Both researchers have been influential in advancing the capabilities of AI systems, particularly in reasoning and information retrieval [38][40]. Group 3: Community Reaction - Following the news of their potential move to Meta, the online community has expressed excitement and congratulations towards Jason Wei, indicating a strong interest in their career transition [10][9].
Compartir lo que somos: la belleza de lo cotidiano | Omar Guillén | TEDxUNIMESO
TEDx Talks· 2025-07-15 16:29
¿Quién soy? Seguramente muchos de ustedes se preguntan quién soy y la verdad es algo que yo también me pregunto constantemente porque constantemente estoy en ese proceso de transformación, de querer crecer. Y entonces cuando me pregunto quién soy, me pregunto, ¿seré lo que construí el tiempo atrás? ¿Seré los títulos que tengo? ¿Será quizás la sangre de mis ancestros? Y y la verdad es que todo eso soy yo. Soy soy la sangre de mis ancestros. Soy también lo que publico en las redes sociales, soy lo que ustedes ...
为大模型思考装上“猎鹰重装引擎” :腾讯混元 SEAT 重塑深度思考
AI科技大本营· 2025-07-15 11:30
Core Viewpoint - Tencent's Hunyuan team has introduced the SEAT adaptive parallel reasoning framework, transforming complex reasoning tasks from a "single-engine airship" into a "multi-engine rocket," enhancing the capabilities of large models in handling intricate reasoning challenges [7][44]. Group 1: SEAT Framework Overview - The SEAT framework integrates both sequential and parallel scaling paradigms, allowing for extensive exploration and deep refinement of reasoning processes [15][43]. - It employs a multi-round parallel reasoning approach, significantly enhancing the model's exploration capabilities by generating multiple independent reasoning paths simultaneously [16][20]. - The framework is designed to be plug-and-play, enabling easy integration with existing large language models without requiring additional training [29][44]. Group 2: Performance Enhancements - Initial experiments show that even with a minimal parallel setup (N=2), the SEAT framework can achieve a remarkable accuracy improvement of +14.1% for a 32B model and +24.5% for a 7B model [28]. - As the number of parallel paths increases (up to N=8), performance continues to improve, demonstrating the framework's powerful exploration capabilities [23]. Group 3: Semantic Entropy as Navigation - The SEAT framework introduces semantic entropy as a self-supervised metric to gauge the consistency of reasoning outputs, acting as a "navigation sensor" to determine when to stop computations [27][32]. - Two navigation strategies are implemented: a predefined threshold approach and an adaptive threshold-free mechanism, both aimed at optimizing the reasoning process [35][36]. Group 4: Safety Mechanisms - The SEAT framework includes a safety mechanism to prevent "semantic entropy collapse," which can lead to overconfidence and erroneous outputs in smaller models [38][40]. - By monitoring semantic entropy, the framework can issue stop commands before the model's performance deteriorates, ensuring stable reasoning outcomes [40][44].
Nicotine poisoning on the rise among children
NBC News· 2025-07-14 17:19
The number of young kids becoming sick after getting their hands on nicotine products like pouches and vapes, it has skyrocketed in recent recent years. According to a new study, US poison centers reported more than 130,000 cases of nicotine poisonings in kids under the age of six between 2010 and 2023. Now, that same study published by the American Academy of Pediatrics, it found an increase in poisonings of 763% in just 3 years.NBC News medical analyst Dr. . Vin Gupta joins us now. Dr.. Gupta, it's great ...
Why Target Tumbled 27% in the First Half of 2025
The Motley Fool· 2025-07-13 11:28
Core Viewpoint - Target is facing significant challenges in 2025, including market share losses, weak discretionary sales, and theft issues, which have worsened over time [1] Financial Performance - Target's financial performance has been negatively impacted by tariffs affecting consumer spending and imports, leading to falling sales and profits [2] - The stock price declined by 27% in the first half of the year, with a notable slump in the first quarter due to the aforementioned issues [3] - In the fourth quarter earnings report, comparable sales growth was only 1.5%, while adjusted EPS fell from $2.98 to $2.41, despite beating estimates [6] - The first-quarter earnings report showed a 3.8% drop in comparable sales and a decline in adjusted EPS from $2.03 to $1.30, prompting a cut in EPS guidance to a range of $7.00-$9.00 [7] Market Reactions - The announcement to roll back DEI programs led to boycotts, damaging the company's reputation and affecting business performance [5] - Following the announcement of "Liberation Day" tariffs, the stock experienced a significant plunge [7] Strategic Initiatives - Target has announced a turnaround plan, establishing a "multi-year acceleration office" and implementing leadership changes to enhance decision-making and aim for long-term profitable growth [9]
X @The Wall Street Journal
The Wall Street Journal· 2025-07-12 19:52
Mexican President Claudia Sheinbaum has accommodated U.S. security demands on stemming the flows of migrants and lethal narcotics, but so far has little to show for it https://t.co/3wuBp5YxmT ...