SmolLM3

Search documents
昆仑万维发布并开源Skywork-R1V 3.0版本;浙江大学发布高精准基因组设计AI模型丨AIGC日报
创业邦· 2025-07-10 00:00
2.【Hugging Face开源小参数模型SmolLM3】北京时间7月9日凌晨,Hugging Face首席执行官克莱门特·德朗 格(Clement Delangue)宣布,Hugging Face发布并开源小参数模型SmolLM3。拥有128k上下文窗口;支持 英语、法语、西班牙语、德语等6种语言;支持深度思考和非思考双推理模式。(财联社) 3.【浙江大学发布高精准基因组设计AI模型】浙江大学郭国骥教授团队开发出一款用于基因组预测设计的深度学 习AI模型"女娲CE",能够以超过90%的准确率预测基因组调控区域发生突变之后带来的表型变化,并结合疾病 表型设计出相应的治疗位点。相关成果已发表于国际学术期刊《细胞》。 (财联社) 4.【Hugging Face 桌面机器人 Reachy Mini 开订:长相呆萌,支持超 170 万个 AI 模型】据外媒TechCrunch 报道,Hugging Face旗下最新桌面机器人Reachy Mini的订单现已正式开放,开发者现在已可动手组装与测试。 Reachy Mini将推出两个版本。无线版名为Reachy Mini Wireless,内置Raspberry 5 微 ...
腾讯研究院AI速递 20250710
腾讯研究院· 2025-07-09 14:49
Group 1: Veo 3 Upgrade - The Google Veo 3 upgrade allows audio and video generation from a single image, maintaining high consistency across multiple angles [1] - The new feature is implemented through the Flow platform's "Frames to Video" option, enhancing camera movement capabilities, although the Gemini Veo3 entry is currently unavailable [1] - User tests indicate natural expressions and effective performances, marking a significant breakthrough in AI storytelling applicable in advertising and animation [1] Group 2: Hugging Face 3B Model - Hugging Face has released the open-source 3B parameter model SmolLM3, outperforming Llama-3.2-3B and Qwen2.5-3B, supporting a 128K context window and six languages [2] - The model features a dual-mode system allowing users to switch between deep thinking and non-thinking modes [2] - It employs a three-stage mixed training strategy, trained on 11.2 trillion tokens, with all technical details, including architecture and data mixing methods, made available [2] Group 3: Kunlun Wanwei Skywork-R1V 3.0 - Kunlun Wanwei has open-sourced the Skywork-R1V 3.0 multimodal model, achieving a score of 142 in high school mathematics and 76 in MMMU evaluation, surpassing some closed-source models [3] - The model utilizes a reinforcement learning strategy (GRPO) and key entropy-driven mechanisms, achieving high performance with only 12,000 supervised samples and 13,000 reinforcement learning samples [3] - It excels in physical reasoning, logical reasoning, and mathematical problem-solving, setting a new performance benchmark for open-source models and demonstrating cross-disciplinary generalization capabilities [3] Group 4: Vidu Q1 Video Creation - Vidu Q1's multi-reference video feature allows users to upload up to seven reference images, enabling strong character consistency and zero storyboard video generation [4] - Users can combine multiple subjects with simple prompts, with clarity upgraded to 1080P, and support for character material storage for repeated use [5] - Test results show it is suitable for creating multi-character animation trailers, supporting frame extraction and quality enhancement, reducing video production costs to less than 0.9 yuan per video [5] Group 5: VIVO BlueLM-2.5-3B Model - VIVO has launched the BlueLM-2.5-3B edge multimodal model, which excels in over 20 evaluations and supports GUI interface understanding [6] - The model allows flexible switching between long and short thinking modes, introducing a thinking budget control mechanism to optimize reasoning depth and computational cost [6] - It employs a sophisticated structure (ViT+Adapter+LLM) and a four-stage pre-training strategy, enhancing efficiency and mitigating the text capability forgetting issue in multimodal models [6] Group 6: DeepSeek-R1 System - The X-Masters system, developed by Shanghai Jiao Tong University and DeepMind Technology, has achieved a score of 32.1 in the "Human Last Exam" (HLE), surpassing OpenAI and Google [7] - The system is built on the DeepSeek-R1 model, enabling smooth transitions between internal reasoning and external tool usage, using code as an interactive language [7] - X-Masters employs a decentralized-stacked multi-agent workflow, enhancing reasoning breadth and depth through collaboration among solvers, critics, rewriters, and selectors, with the solution fully open-sourced [7] Group 7: Zhihui Jun's Acquisition - Zhihui Jun's Zhiyuan Robot has acquired control of the listed company Shuangwei New Materials for 2.1 billion yuan, aiming for a 63.62%-66.99% stake [8] - Following the acquisition, Shuangwei New Materials' stock resumed trading with a limit-up, reaching a market value of 3.77 billion yuan, with the actual controller changing to Zhiyuan CEO Deng Taihua and core team members including "Zhihui Jun" Peng Zhihui [8] - This acquisition, conducted through "agreement transfer + active invitation," is seen as a landmark case for new productivity enterprises in A-shares following the implementation of national policies [8] Group 8: AI Model Usage Trends - In the first half of 2025, the Gemini series models captured nearly half of the large model API market, with Google leading at 43.1%, followed by DeepSeek and Anthropic at 19.6% and 18.4% respectively [9] - DeepSeek V3 has maintained a high user retention rate since its launch, ranking among the top five in usage, while OpenAI's model usage has fluctuated significantly [9] - The competitive landscape shows differentiation: Claude-Sonnet-4 leads in programming (44.5%), Gemini-2.0-Flash excels in translation, GPT-4o leads in marketing (32.5%), and role-playing remains highly fragmented [9] Group 9: AI User Trends - A report by Menlo Ventures indicates that there are 1.8 billion AI users globally, with a low paid user rate of only 3%, and a high student usage rate of 85%, while parents are becoming heavy users [10] - AI is primarily used for email writing (19%), researching topics of interest (18%), and managing to-do lists (18%), with no single task dependency exceeding one-fifth [10] - The next 18-24 months are expected to see six major trends in AI: rise of vertical tools, complete process automation, multi-person collaboration, explosion of voice AI, physical AI in households, and diversification of business models [10]
AI日报丨五大投行集体唱多美股!“科技七巨头”扛起盈利大旗
美股研究社· 2025-07-09 11:25
在这个快速变化的时代,人工智能技术正以前所未有的速度发展,带来了广泛的机会 。 《AI日 报 》致力于挖掘和分析最新的AI概念股公司和市场趋势,为您提供深度的行 业 洞察和价 值 分 析。 整理 | 美股研究社 A I 快 报 1. 今天凌晨,全球著名大模型开放平台Hugging Face开源了 ,顶级小参数模型SmolLM3。 SmolLM3只有30亿参数,性能却大幅度超过了Llama-3.2-3B 、Qwen2.5-3B等同类开源模型。 拥有128k上下文窗口,支持英语、法语、西班牙语、德语等6种语言。支持深度思考和非思考双 推理模式,用户可以灵活切换。 2. 周二(7月8日),美国科技股七巨头(Magnificent 7)指数跌0.07%,报173.55点。 特斯拉反弹1.32%——马斯克创立"美国党"后该公司市值在周一蒸发680亿美元,英伟达涨 1.12%,Meta Platforms和被扎克伯格挖走AI模型高管的苹果至多涨0.32%,微软则收跌 0.22%,谷歌A跌1.37%,亚马逊在Prime Day会员日跌1.84%。 此外,AMD收涨2.24%,礼来制药涨0.62%,巴菲特旗下伯克希尔哈撒韦B ...