Workflow
Artificial Intelligence
icon
Search documents
好家伙!GPT-4o 学习“波多野结衣”的次数,比“您好”还多 2.6 倍
程序员的那些事· 2025-10-20 14:39
Core Viewpoint - The article discusses the findings of a recent research paper that reveals significant contamination in the training data of large language models (LLMs) like ChatGPT, particularly highlighting the prevalence of inappropriate content related to adult film star "波多野结衣" [5][8][9]. Group 1: Research Findings - Researchers from Tsinghua University and Nanyang Technological University discovered that over 23% of long Chinese tokens in GPT's vocabulary are associated with gray content such as pornography and gambling, indicating severe contamination in the model's Chinese vocabulary [5][8]. - The study identified and quantified these contaminated tokens, termed "污染中文词元" (PoC Tokens), suggesting that content related to "波多野结衣" may constitute as much as 0.5% of the training data for GPT-4o, which is 2.6 times more frequent than the common greeting "你好" [9][11]. - The presence of PoC Tokens poses a risk to AI, as it may lead to erratic responses and a lack of coherence when processing pure Chinese content [12][11]. Group 2: Implications for AI Models - The findings highlight a significant bias in the training data, which may explain why some models struggle with authentic and clean Chinese language processing [11]. - The widespread existence of PoC Tokens reflects the serious challenges faced by the current Chinese web corpus used for training LLMs, suggesting a need for improved data curation [14]. - The article also references a recent lawsuit against Meta by adult film companies for allegedly using pirated content to train AI, further emphasizing the ongoing issues surrounding content sourcing for AI training [14].
云知声:基础大模型和智能体平台商业化取得实质性进展
Zhi Tong Cai Jing· 2025-10-20 14:38
Core Viewpoint - The company, Yunzhisheng (09678), has made significant progress in the commercialization of its foundational large model and intelligent agent platform, collaborating with well-known domestic enterprises to implement projects valued at nearly RMB 200 million, with delivery planned by the end of 2025 [1] Group 1 - The company has established deep cooperation with domestic partners to advance its foundational large model and intelligent agent projects [1] - The implementation of these projects is based on the company's proprietary computing scheduling platform, Shanhai large model, and Shouya Agent intelligent platform [1] - This progress signifies a solid step forward in the commercialization process of the company's foundational large model and intelligent agent platform [1] Group 2 - The advancements further validate the company's technological leadership in foundational large models and intelligent agent platforms [1] - The company plans to deepen its research and commercialization efforts in the smart living sector, leveraging its strong advantages [1]
Buy 3 Mid & Small-Cap AI Infrastructure Stocks to Enrich Gains in 2026
ZACKS· 2025-10-20 13:56
Industry Overview - The artificial intelligence (AI) infrastructure segment is experiencing significant momentum, driven by a bullish demand scenario, with expectations of transformative changes across various fields over the next five years, including hyperscale automation, robotics, healthcare, energy, materials, financials, and cybersecurity [1] Company Summaries Innodata Inc. (INOD) - Innodata is positioned as a key player in the AI revolution, providing essential data for training advanced language models, with a focus on long-term demand from big tech, enterprises, federal agencies, and healthcare [3][4] - The company has launched a GenAI Test and Evaluation Platform aimed at validating large language models, enhancing its integration with major tech investments [5] - Innodata's expected revenue growth rate is 42.8% for the current year, with earnings growth projected at -6.7%, while next year's revenue and earnings growth rates are expected to be 26.6% and 46.6%, respectively [9][10] Five9 Inc. (FIVN) - Five9 provides intelligent cloud software for contact centers, benefiting from rising subscription revenues and the adoption of AI tools, particularly through its Intelligent CX Platform powered by Five9 Genius AI [11][13] - The platform includes features such as interactive virtual agents and AI insights, optimizing customer interactions across multiple channels [12] - Five9's expected revenue growth rate is 10.1% for the current year, with earnings growth at 16.6%, and for the next year, both revenue and earnings growth are projected at 9.6% and 8.5%, respectively [15] UiPath Inc. (PATH) - UiPath offers an end-to-end automation platform with embedded AI, machine learning, and natural language processing capabilities, enhancing decision-making and information processing [16][17] - The company has introduced new generative AI features to improve automated AI models tailored for specific business needs [17] - UiPath's expected revenue growth rate is 10.1% for the current year, with earnings growth at 22.6%, and for the next year, revenue and earnings growth rates are projected at 8.1% and 11.3%, respectively [18]
云知声(09678):基础大模型和智能体平台商业化取得实质性进展
智通财经网· 2025-10-20 13:41
Core Insights - The company, Yunzhisheng (09678), has made significant progress in the commercialization of its foundational large model and intelligent agent platform [1] - The company has partnered with well-known domestic enterprises to implement projects valued at nearly RMB 200 million, with delivery planned by the end of 2025 [1] - This advancement indicates the company's leading position in the technology of foundational large models and intelligent agent platforms [1] Summary by Categories Commercialization Progress - The company has achieved substantial progress in the commercialization of its foundational large model and intelligent agent platform [1] - The implementation of projects worth approximately RMB 200 million is underway, showcasing the company's ability to attract significant partnerships [1] Technological Leadership - The advancements signify further recognition of the company's technological leadership in foundational large models and intelligent agent platforms [1] - The company plans to deepen its research and commercialization efforts in the smart living sector, leveraging its strong advantages [1]
云知声(09678) - 自愿性公告最新业务进展
2025-10-20 13:35
UNISOUND AI TECHNOLOGY CO., LTD. 雲知聲智能科技股份有限公司 (於中華人民共和國註冊成立的股份有限公司) (股份代號:9678) 香港交易及結算所有限公司及香港聯合交易所有限公司對本公告的內容概不負責,對其準確性或完整性亦不發表 任何聲明,並明確表示概不就因本公告全部或任何部分內容而產生或因倚賴該等內容而引致的任何損失承擔任何 責任。 本公告乃由雲知聲智能科技股份有限公司(「本公司」)自願作出,旨在知會本公司股東 及潛在投資者有關本公司的最新業務發展情況。 本公司董事會(「董事會」)欣然宣佈,本公司基礎大模型和智能體平台商業化取得實質 性進展,近日與國內知名企業等合作夥伴深入合作,將基於自有知識產權的算力調度平 台、山海大模型以及獸牙Agent智能體平台,實現價值近人民幣2億元的基礎大模型和智 能體項目實施部署,計劃將於2025年底前完成交付。 該進展意味著本公司在基礎大模型和智能體平台商業化落地進程中邁出了堅實的一步, 同時也標誌著公司基礎大模型和智能體平台技術的領先性得到進一步認可。本公司將 在具備深厚優勢的智慧生活領域進一步深耕大模型和智能體的研發及商業化落地。 承董事會 ...
DeepSeek开源新模型!单张A100日处理可超20万页数据
Di Yi Cai Jing· 2025-10-20 13:23
Core Insights - DeepSeek has released a new OCR model named DeepSeek-OCR, which focuses on context optical compression to address the challenges faced by large language models in processing long texts [1][4][6] Group 1: Model Features - The DeepSeek-OCR model utilizes visual modalities to efficiently compress text information, achieving nearly 10 times lossless context compression while maintaining an OCR accuracy of over 97% [4][5] - The model consists of two main components: DeepEncoder for image feature extraction and compression, and DeepSeek3B-MoE for reconstructing text from compressed visual tokens [5] - The model's architecture allows it to have the expressive power of a 30 billion parameter model while maintaining the inference efficiency of a 500 million parameter model [5] Group 2: Potential Applications - The research indicates potential applications in long context compression and the memory forgetting mechanisms of large models, simulating human memory decay by gradually reducing the size of rendered images over time [5][6] - This approach could represent a significant breakthrough in handling ultra-long contexts, balancing theoretically infinite context information [6] Group 3: Industry Reception - The release of the DeepSeek-OCR model has garnered positive attention, receiving over 1,400 stars on GitHub shortly after its launch [7] - There are mixed opinions in the market regarding DeepSeek's pace of innovation, with some suggesting that the company is focusing on internal development for future models [8]
世界模型:机器能否理解现实?
3 6 Ke· 2025-10-20 13:01
Core Concept - The article discusses the concept of "world models" in artificial intelligence (AI), which are internal representations of the environment that AI systems use to evaluate predictions and decisions before executing tasks [1][4]. Group 1: Definition and Importance of World Models - World models are considered essential for building intelligent, scientific, and safe AI systems, as emphasized by leading figures in deep learning [1]. - The idea of a world model has historical roots, dating back to Kenneth Craik's 1943 proposal of a "small-scale model" in the brain that allows organisms to simulate various scenarios [2]. Group 2: Historical Context and Evolution - Early AI systems like SHRDLU demonstrated the use of world models but struggled with scalability and complexity in real-world environments [3]. - The rise of machine learning and deep learning has revitalized the concept of world models, allowing AI to build internal approximations of environments through trial and error [3]. Group 3: Current Challenges and Perspectives - Despite the potential of world models, there is still a lack of consensus among researchers regarding their definition, content, and verification methods [2]. - Current generative AI models, such as large language models (LLMs), exhibit heuristic rules but lack a coherent and unified world model, leading to inconsistencies in their outputs [4][6]. Group 4: Future Directions and Research Focus - Researchers are exploring how to develop robust and verifiable world models, which could enhance AI's reliability and interpretability [6][7]. - There are differing opinions on how to create these models, with some suggesting that sufficient multimodal training data could naturally lead to their emergence, while others advocate for entirely new architectures [7].
OpenAI也缺卡,僧多粥少,自曝内部抢卡抢到发疯
3 6 Ke· 2025-10-20 12:54
Core Insights - OpenAI is facing a severe scarcity of computing power, which has become a critical issue for the company and the AI industry as a whole [1][10] - The competition for computing resources within OpenAI is intense, impacting the release of innovative products [3][9] Resource Allocation Mechanism - OpenAI has a structured resource allocation mechanism that divides resources between research and application sides, with decisions made by senior management [5] - The allocation within the research domain is determined by the Chief Scientist and the Research Director [7] - A dedicated team is responsible for managing and reallocating idle GPU resources to meet the demands of various projects [8] Industry Implications - The internal competition for computing power at OpenAI reflects broader trends in the AI industry, where computing power directly influences AI capabilities [9][10] - OpenAI's significant investment in computing power, amounting to $7 billion last year, indicates its commitment to securing a competitive edge in the AI market [10] - Other companies, such as Meta, are also recognizing the importance of computing resources as a competitive advantage [12][13]
AI进化速递 | 宇树发布H2仿生机器人
Di Yi Cai Jing· 2025-10-20 12:51
Group 1 - Yushu Technology has launched the H2 bionic humanoid robot, which stands 180 cm tall and weighs 70 kg [1] - The DeepSeek team has open-sourced a new model called DeepSeek-OCR [1] - A Chinese research team has made significant breakthroughs in robotic algorithms, proposing the world's first unified theory of "force-position hybrid control algorithms" [1] Group 2 - Meta has introduced parental control options for its AI chatbot to ensure the safety of teenagers [1] - IBM is collaborating with the American AI company Groq to accelerate enterprise AI deployment [1]
C3.ai, Inc. Class Action: The Gross Law Firm Reminds C3.ai Investors of the Pending Class Action Lawsuit with a Lead Plaintiff Deadline of October 21, 2025 - AI
Prnewswire· 2025-10-20 12:45
Core Viewpoint - The Gross Law Firm has issued a notice to shareholders of C3.ai, Inc. regarding a class action lawsuit due to allegations of misleading statements and concealment of material facts that negatively impacted the company's stock performance [1][2]. Summary by Relevant Sections Allegations - The complaint alleges that C3.ai's management provided overly positive statements while concealing significant issues, particularly the health of the CEO, which affected the company's ability to close deals [1]. - The company failed to execute its profit and growth potential due to these undisclosed challenges [1]. Financial Impact - On August 8, 2025, C3.ai announced disappointing preliminary financial results for Q1 of fiscal 2026 and reduced its revenue guidance for the full fiscal year 2026 [1]. - The stock price dropped from $22.13 per share on August 8, 2025, to $16.47 per share on August 11, 2025, marking a decline of approximately 25.58% in just one day [1]. Class Action Details - Shareholders who purchased shares during the specified class period (February 26, 2025, to August 8, 2025) are encouraged to register for the class action [2]. - The deadline for seeking lead plaintiff status is October 21, 2025, and there is no cost to participate in the case [2].