开源大语言模型
Search documents
开源首次追平GPT-5!DeepSeek-V3.2:推理与效率兼得
自动驾驶之心· 2025-12-18 09:35
DeepSeek-V3.2 与其同类模型的基准测试结果。 开源模型的三大痛点 要理解DeepSeek-V3.2的突破性,首先需要正视当前开源模型普遍面临的三大核心困境。 从 架构层面 看,传统开源模型大多依赖 标准注意力机制(vanilla attention) ,这种机制在处理长序列文本时,计算复杂度会随序列长度的平方增长 (O(L²)),不仅导致推理速度缓慢,更限制了模型在长上下文场景中的部署与后续训练优化。 点击下方 卡片 ,关注" 大模型之心Tech "公众号 戳我-> 领取大模型巨卷干货 在 大语言模型 (LLM)的发展赛道上,闭源与开源阵营的实力差距曾一度呈现扩大态势。随着OpenAI等巨头持续加码算力与数据投入,其闭源模型在 复杂推 理、工具使用 等核心能力上不断突破;而开源社区虽不乏创新尝试,但受限于架构效率、训练资源等多重因素,在高端任务场景中始终难以望其项背。这种不 平衡的发展格局,让业界对开源模型的上限充满疑虑——开源LLM是否注定只能成为闭源模型的"简化版替代品"? 面对这一趋势,DeepSeek团队并未止步,而是通过系统性技术创新,推出了 DeepSeek-V3.2 。这款兼顾计算效 ...
OpenAI时隔六年再开源
Cai Jing Wang· 2025-08-06 03:37
Core Insights - OpenAI has released two new open-source AI models, GPT-oss-120b and GPT-oss-20b, marking the first introduction of new open-source large language models since the release of GPT-2 in 2019 [1] - The release was initially planned for March but was delayed until August 6, following a global open-source movement sparked by DeepSeek earlier this year [1] - Both models are released under a permissive Apache 2.0 license, allowing businesses to use them commercially without prior payment or licensing [1] - OpenAI CEO Sam Altman described GPT-oss as a significant breakthrough, claiming it offers advanced open-weight inference capabilities comparable to o4-mini, and can be run locally on personal computers or smaller devices [1]
速递|10亿美金挑战DeepSeek,红杉、光速资本押注,Reflection AI开源模型守塔
Z Potentials· 2025-08-05 02:59
Core Insights - Reflection AI, a startup founded by former Google DeepMind researchers, is negotiating over $1 billion in funding to develop open-source large language models, competing with companies like DeepSeek, Mistral, and Meta [1] - The company has raised $130 million in venture capital from investors such as Lightspeed Venture Partners and Sequoia Capital, with a previous valuation of $545 million [1] - The founders aim to position Reflection AI as a leading provider of open-source AI models in the U.S., driven by the rising popularity of Chinese AI models [1] Funding and Valuation - Reflection AI is in discussions for a funding round exceeding $1 billion, with specific valuation details yet to be disclosed [1] - The company has successfully raised $130 million in its previous funding round, achieving a valuation of $545 million [1] Product Development - Reflection AI has been developing a programming assistant named Asimov, which analyzes enterprise data to generate relevant application code [3] - The product has launched a preview version and is beginning to generate revenue from enterprise clients [3] Market Dynamics - The demand for AI models in the Chinese market is driving Reflection AI's expansion into open-source AI model development [3] - Open-source models are seen as more cost-effective and flexible compared to proprietary models, allowing companies to fine-tune models for specific business processes [4] Competitive Landscape - As of now, no open-source models in the top 30 rankings on LMArena are developed by U.S. companies, highlighting a competitive gap [3] - Meta, a prominent open-source AI developer, is restructuring its AI business after its latest model underperformed compared to DeepSeek [2] Cost of AI Model Training - Training AI models is expensive, with OpenAI projecting to spend over $7 billion on model training this year, potentially reaching $17 billion by 2026 [5]