Workflow
Seek .(SKLTY)
icon
Search documents
Llama 4发布:我看到了DeepSeek的影子
Hu Xiu· 2025-04-06 07:36
本文来自微信公众号:赛博禅心,作者:金色传说大聪明,题图来自:视觉中国 Llama 4 发布了。(https://huggingface.co/meta-llama) Llama 4 的三款模型 但这次,它没有高调宣称参数量"遥遥领先",而是通过三款模型来重新布局: MoE 大概就是这样 过去,MoE 更多还是"实验室选项",自 DeepSeek 大火后,很多厂商开始尝试将其用于主力模型,比如这次的 Meta。在 Llama 4 中,模型 Scout 配置 16 专 家,而 Maverick 则是 128 专家,推理时都只激活两个,17B的量。 回顾一下,DeepSeek 在 R1 和 V3 中也是类似:671B 总参数,37B 激活,用更可控的计算开销,换来模型能力密度的提升。 一个用、一主力、一教学,不卷彼此,也不试图通吃所有任务。 讲道理,看这个发布的时候,我总隐隐有当时读 DeepSeek V3 技术报告的感觉:拥抱 MoE,拥抱合成数据。 架构转向:MoE 登上主舞台 Lllma 3 是 Dense,哪怕 400B 的模型都是 Dense;而 Llama 4 是 MoE 架构。 (关于架构的问题,推 ...
关税刷屏的一周,AI圈也“暗流涌动”:Llama 4来了,O3和O4-mini也要来了,DeepSeek R2和GPT-5也不远了?
Hua Er Jie Jian Wen· 2025-04-06 07:01
本文作者:鲍奕龙 来源:硬AI 本周全球被关税议题占据头条,但科技界的目光却聚焦在AI领域的密集动作上。 周末,Meta深夜突袭发布Llama 4系列,号称"原生多模态+千万级上下文窗口",并首次披露单卡H100可运行的轻量化版本。此前OpenAI则宣布O3 和O4-mini模型即将在几周内上线,同时确认GPT-5因技术整合和算力部署问题推迟数月。 DeepSeek则与清华大学的研究团队本周联合发布了一篇关于推理时Scaling的新论文,提出了一种名为自我原则点评调优(SPCT)的学习方法,并 构建了DeepSeek-GRM系列模型。结合元奖励模型实现推理时扩展,性能接近671B大模型,暗示DeepSeek R2临近。 Meta强势推出Llama 4,多模态与超长上下文成亮点 周六,Meta正式发布了Llama 4系列模型,Llama 4全系采用混合专家(MoE)架构,并实现了原生多模态训练,彻底告别了Llama 3纯文本模型 的时代。此次发布的模型包括: 此次公布的Llama 4 Maverick 和 Llama 4 Scout 将是开源软件。然而,Llama 4 的新许可证对使用有一定限制,例如月活用户超 ...
击败DeepSeek V3?Meta强势炸场,史上最强Llama 4开源!
Ge Long Hui· 2025-04-06 06:22
自DeepSeek掀起新一轮大模型热潮来,全球科技巨头的AI军备赛依旧热火朝天。 经历多次延期之后,Meta最新推出超级"王炸",开源王座一夜易主。 Llama 4 系列登场 当地时间周六(4月5日),Meta推出了其最强大的开源AI大模型Llama 4。 据Meta介绍,Llama4 是多模态大模型,能处理整合多种数据,能在不同格式间实现内容转换。 其中,Scout最高支持1000万上下文的输入(10 M Context),击败了OpenAI的模型。 在广泛基准测试中,Scout分数超过Gemma 3、Gemini 2.0 Flash-Lite、Mistral 3.1。 Maverick则仅用一半参数,使其推理编码能力与DeepSeek-v3-0324实力相当。 另外在编程、推理、多语言、长上下文和图像基准测试中碾压了GPT-4o和Gemini 2.0等同类模型。 即日起,用户可从llama.com和Hugging Face可下载Llama 4 Scout和Llama 4 Maverick模型。 这些模型很快也将在主流云和数据平台、边缘芯片和全球服务集成商上提供。 2万亿巨无霸将至 该系列首次采用混合专家( ...
DeepSeek消费电子行业大模型新型应用最佳实践分享
Sou Hu Cai Jing· 2025-04-05 03:31
Core Insights - The report focuses on the application of large models in the consumer electronics industry, highlighting the industry ecosystem, advantages of the DeepSeek model, capabilities of Tencent Cloud's TI platform, and practical applications of large model development platforms [1][2]. Group 1: Large Model Industry Ecosystem - The large model industry chain consists of four levels: self-developed model structures by companies like Google and Microsoft, pre-trained model developers such as Huawei Cloud and Zhipu AI, and enterprises like Changan Automobile and Kingdee that fine-tune models based on data or directly call APIs [1][2]. - There is significant progress in open-source models, with a diverse range of domestic and international large models emerging [1]. Group 2: DeepSeek Model Advantages - The DeepSeek series has made notable achievements in natural language processing, with DeepSeek-V3 being a powerful mixed expert language model trained on 14.8 trillion high-quality tokens and possessing 671 billion parameters, excelling in knowledge tasks [4][5]. - DeepSeek-R1, based on DeepSeek-V3-Base, demonstrates outstanding performance in complex reasoning tasks such as mathematics and code generation [6]. Group 3: Tencent Cloud TI Platform Support - Tencent Cloud's TI platform provides comprehensive technical support from model development to application, including AI modeling deployment, large model fine-tuning, and data construction capabilities [2][10]. - The platform supports large-scale training with multiple machines and cards, automatic fault recovery, and offers various fine-tuning modes and inference acceleration capabilities [2][10]. Group 4: Large Model Application Development Platform Practices - Tencent Cloud's large model knowledge engine offers three application modes: standard mode for intelligent customer service, workflow mode for complex business scenarios, and agent mode for autonomous task planning and tool invocation [2][11]. - The platform facilitates rapid development and deployment of large model applications, catering to diverse developer needs and accelerating the application of large models across industries [2][11].
新凯来火爆出圈 中国半导体设备或迎DeepSeek时刻
Core Viewpoint - The domestic semiconductor equipment industry is experiencing a potential breakthrough moment, highlighted by the popularity of Shenzhen Xinkailai Industrial Machinery Co., Ltd. at the SEMICON China 2025 exhibition, where the company showcased 31 new devices covering the entire semiconductor manufacturing process [1][2]. Company Overview - Shenzhen Xinkailai was established in June 2022 and is backed by Shenzhen State-owned Assets Supervision and Administration Commission, indicating strong governmental support [2]. - The company has connections to Huawei, having originated from Huawei's "Starlight Engineering Division" [2][3]. - The core team of Xinkailai possesses over 20 years of experience in electronic equipment technology development [3]. Product Launch and Technology - Xinkailai unveiled a range of products including EPI, ALD, PVD, ETCH, CVD, and measurement equipment, which are critical to semiconductor manufacturing processes [4]. - The company claims that its equipment achieves 100% domestic control, with all core components being either self-developed or sourced through strategic partnerships [3][4]. - The new high-precision thin film deposition equipment is reported to have technology parameters close to international leading levels, aiming to disrupt the existing market dominated by international giants [4][5]. Market Trends - The semiconductor equipment market in China is projected to reach approximately 240 billion yuan in 2024, with significant revenue growth expected for several domestic companies [7][8]. - The overall domestic semiconductor equipment market is seeing accelerated product upgrades and increased domestic substitution, with the comprehensive domestic equipment localization rate expected to exceed 50% by 2025 [8]. - The demand for semiconductor equipment is anticipated to grow due to advancements in AI, IoT, and automotive electronics, contributing to a market recovery [8]. Industry Dynamics - The semiconductor equipment industry is entering a phase of consolidation and collaborative development, with companies pursuing mergers and acquisitions to enhance resource integration and competitiveness [9]. - Notable domestic players like North Huachuang are expected to see significant revenue growth, indicating a strengthening position in the global market [7][9].
专访世界经济论坛金融服务技术与创新主管德鲁・普罗普森:DeepSeek促进良性竞争,“算法透明度”是AI治理中的关键议题
Mei Ri Jing Ji Xin Wen· 2025-04-03 15:16
Core Insights - The rapid development of the technology industry necessitates innovative financial regulation, particularly in the context of AI applications in finance [1] - Mobile payments are emerging as a new paradigm for financial inclusion globally, but they also face significant risks [7] - Hong Kong is positioned as a key hub for technological innovation and experimentation, especially in the context of virtual asset legalization [10] Group 1: AI and Technology Development - DeepSeek is recognized for its rapid growth, promoting healthy competition and industry advancement at lower costs [2] - The importance of talent reserves in technology companies is emphasized, with a focus on establishing effective training mechanisms and resource integration [3][4] - The AI industry must address energy consumption issues and seek sustainable energy solutions to support technological applications [2] Group 2: Global Expansion and Communication - Effective global expansion relies on clear communication to ensure that the broader ecosystem understands the business [3] - Asian markets are highlighted for their advanced technology and emerging brands, providing opportunities for global outreach [3] Group 3: Algorithm Transparency and Regulation - Algorithm transparency is crucial for fair financial decision-making, impacting loan approvals and interest rates [5] - Regulatory frameworks should enforce algorithm transparency and encourage private sector compliance to build trust in the financial system [5][6] Group 4: Mobile Payments and Risks - Mobile payments have significantly contributed to reducing financial inclusion gaps, especially in remote areas [7] - Two major risks associated with mobile payments include infrastructure dependency and rising fraud incidents, with a noted 50% annual increase in fraud cases over the past five years [7] - The establishment of a favorable regulatory environment is essential to protect consumers and support the growth of mobile payments [8] Group 5: Hong Kong's Role in Technological Innovation - Hong Kong is seen as a "super connector" for technology and finance, facilitating innovation and sharing of experiences [10] - The regulatory approach should adapt to new technologies while maintaining existing frameworks to address emerging risks in the digital asset space [9]
《传奇世界》拥抱DeepSeek,对话盛趣游戏副总裁任霆:智能NPC只是开始,原生AI游戏才是未来
Mei Ri Jing Ji Xin Wen· 2025-04-03 12:37
Core Insights - The gaming industry is undergoing a significant transformation driven by the rapid development of AI technology, exemplified by the launch of the intelligent NPC "Xuan Xuan Lao Ren" in the game "Legendary World" [1][11] - The integration of AI in gaming is seen as a potential leading indicator for profitability in other sectors, as the gaming environment allows for higher tolerance for AI errors compared to real-world applications [2] Group 1: AI Integration in Gaming - The introduction of AI-driven NPCs changes the traditional interaction model, allowing for more dynamic and engaging player experiences [3][5] - The DeepSeek model was chosen for its strong text generation capabilities and open-source nature, facilitating easier deployment and optimization [5] - AI technology enhances player experience by allowing natural language queries for tasks and providing richer emotional expressions from NPCs [5][6] Group 2: Challenges and Solutions - The main challenge in integrating AI is ensuring content compliance while maintaining AI creativity, requiring a balance between real-time responsiveness and strict content control [5][8] - The company has developed a system combining RAG technology with a local knowledge base to ensure NPC responses remain within game boundaries [5] Group 3: Future Directions and Innovations - The company plans to expand the application of intelligent NPCs beyond information queries to include task and social systems, aiming for a more comprehensive and personalized player service [6][11] - AI is also being utilized in content production, significantly improving efficiency in art asset creation and automated testing processes [7] - The concept of "AI-native games" is being explored, characterized by dynamic content generation, intelligent interactions, and user-generated content [9][10] Group 4: Strategic Initiatives - In 2024, the company launched the sub-brand "Shu Long AI" and organized a global AI gaming and application innovation competition to further its AI initiatives [11] - The ongoing AI-driven revolution in gaming is transitioning from concept to reality, with the company's practices serving as a reference for the industry [11]
深度融合DeepSeek和多模态,百度文小言找到了自己的开放之道
Sou Hu Cai Jing· 2025-04-03 11:30
Core Insights - The rise of DeepSeek has prompted various AI model manufacturers to adapt their strategies, with companies like Tencent and Baidu embracing it, while others like ByteDance and Alibaba choose to compete directly [1][2] - Baidu's recent AI Day revealed an upgraded version of its AI assistant, Wen Xiaoyan, which integrates its own models with DeepSeek-R1 and other third-party models, enhancing user experience through automatic model selection and multi-modal capabilities [1][2] Group 1: Multi-Modal Capabilities - The upgraded Wen Xiaoyan can perform various tasks in one application, eliminating the need for users to switch between different AI models for different tasks [2][6] - The integration of Baidu's new models, Wenxin X1 and Wenxin 4.5, with DeepSeek-R1 allows for seamless switching and automatic mode selection, enhancing user interaction [6][7] - Wenxin X1 is designed for deep thinking and can autonomously call various tools for continuous tasks, while Wenxin 4.5 focuses on multi-modal interactions and understanding [6][7] Group 2: User-Centric Features - The new AI assistant can generate design ideas for home renovations by combining search and drawing tools, providing users with clear visual outputs and explanations [8][10] - A unique "problem-solving teacher" feature allows users to take pictures of homework questions, generating detailed explanations and video tutorials, enhancing the learning experience for children [10][11] - The upgraded voice model can recognize children's speech patterns and respond fluidly, accommodating interruptions and switching between various character voices and dialects [11] Group 3: Competitive Landscape - The competition in the AI model space is shifting from single-model capabilities to how effectively AI can meet diverse user needs [12] - Baidu's extensive experience in AI, particularly in Chinese language processing, provides a significant localization advantage over many foreign products [12] - The company's strategy of "model matrix + automatic scheduling + ecosystem openness" aims to create a sustainable competitive edge in the AI market [12]
学习“AI技术与DeepSeek运用” 赋能罗平民营经济发展
Sou Hu Cai Jing· 2025-04-03 10:28
Core Viewpoint - The event aims to promote the application of AI technology, specifically DeepSeek, to assist private enterprises in achieving digital transformation and seizing opportunities in AI development [1][2]. Group 1: Event Overview - The seminar was organized by the Luoping County Federation of Industry and Commerce in collaboration with the United Front Work Department, attracting over 200 representatives from various non-public economic organizations [2]. - The founder and chairman of Wanno Data Technology (Yunnan) Group, Liang Guanxing, delivered an in-depth presentation on the background, core functions, and diverse applications of DeepSeek technology [2]. Group 2: Key Insights from the Presentation - Liang Guanxing provided a comprehensive explanation of DeepSeek's capabilities in data mining, analysis, and prediction, showcasing its innovative applications in key sectors such as government services, education, healthcare, and industrial manufacturing [2]. - The discussion included a thorough analysis of the challenges faced by traditional enterprises and how DeepSeek technology can help overcome these obstacles for significant growth [2]. Group 3: Impact on Enterprises - Attendees expressed that the training enabled them to grasp the core application value of DeepSeek technology and provided practical pathways for driving intelligent transformation in their businesses [3]. - Small and medium-sized enterprises are encouraged to focus on their industry-specific data resources and collaborate with technology partners to develop lightweight, high-precision applications, prioritizing AI pilot projects that can yield quick results [3]. - The seminar not only created a platform for learning and exchange among private enterprises but also injected new vitality into the innovative development of the private economy in Luoping County [3].
一个基层民警的忠告:不要拿DeepSeek等AI工具刻舟求剑
Hu Xiu· 2025-04-03 07:31
Group 1 - The report highlights various community incidents in March 2025, including fraud, parking disputes, and other social issues, indicating a need for better community engagement and problem-solving strategies [2][3][23] - A notable fraud case involved an individual being scammed out of 10,000 yuan through a fake tax refund scheme, showcasing the adaptability of scammers to current events [4][6] - Parking violations and disputes are prevalent, with one individual reportedly evading parking fees 50 times, reflecting ongoing issues with enforcement and community accountability [7][8] Group 2 - Incidents of "picking up" or "scamming" in restaurants are reported, with one case involving a false claim of finding a bug in food, leading to a free meal for the scammer [16] - A significant case involved a community member being accused of "picking up" after a confrontation with security, highlighting the risks of aggressive behavior in public spaces [17][18] - The report discusses the challenges of managing public safety and community relations, particularly in light of recent changes to fire safety regulations [30][41] Group 3 - The report emphasizes the importance of community vigilance and the role of local law enforcement in addressing social issues, including the need for better communication and understanding between residents and police [3][41] - A specific incident involving a property dispute illustrates the complexities of landlord-tenant relationships and the potential for conflict when legal rights are not clearly understood [31][34] - The report concludes with a call for improved safety measures in public spaces, urging businesses to maintain operational surveillance systems to protect customers [45]