SmallThinker

Search documents
百元级硬件流畅运行百亿参数大模型!上交&本智激活开源端侧原生大模型
量子位· 2025-07-27 09:01
AI的下一个战场,不在云端,而在你的口袋里。 iPhone、华为、三星、小米、OPPO等各大手机厂商几乎都在将大模型塞进手机,端侧AI已然成为兵家必争之地。 背后的逻辑清晰而坚定:最懂你的AI,必须能安全地访问你的个人数据——邮件、照片、日程,而这一切的前提,就是 将计算留在本地,将 隐私还给用户 。 然而,想让AI在本地流畅运行,远比想象的要难得多。最好的证据,莫过于财力雄厚、软硬一体的苹果,其雄心勃勃的Apple Intelligence计 划也未能如期而至,核心AI功能不得不推迟到明年。 这无疑向整个行业释放了一个清晰的信号: 端侧AI,是一块难啃的硬骨头 。 正当全球科技巨头在端侧AI的道路上艰难探索时,一股产学研深度融合的新兴力量,给出了独有的解决路线。 今天, 上海交通大学IPADS研究所、上海交通大学人工智能学院 联合初创公司 本智激活 (Zenergize AI) ,在HuggingFace开源了 端侧 原生大模型SmallThinker 。 该系列模型采用 为端侧算力、内存、存储特性而原生设计的模型架构,并从零开始预训练 ,具体包含两个尺寸的稀疏模型,分别是 SmallThinker-4B- ...
本智激活完成数千万元种子轮融资,加速端侧 AI 全面落地
Tai Mei Ti A P P· 2025-07-24 02:31
Core Insights - "BenZhi Activation" is a startup incubated from Shanghai Jiao Tong University's Institute of Parallel and Distributed Systems (IPADS), which is renowned for its expertise in operating systems and distributed systems, ranking first globally in the CSRankings for the past decade [2] - The team, led by CEO Mi Zeyu, focuses on edge-native AI solutions, aiming to transform personal AI paradigms by addressing privacy concerns, high costs, and lack of personalization in cloud-based AI models [2][3] Group 1: Technological Innovations - "BenZhi Activation" proposes a disruptive "edge-native" full-stack design that reconstructs the software and hardware technology system from the ground up, achieving true edge intelligence without sacrificing model intelligence [3] - The team has achieved significant breakthroughs in edge model algorithms and infrastructure, including the release of the PowerInfer edge model infrastructure system, which operates a trillion-parameter model efficiently on consumer-grade NVIDIA GTX 4090 GPUs, achieving 90% of the performance of data center-level A100 GPUs [4] - The upcoming PowerInfer-2, set to launch in June 2024, will enable the smooth operation of a 47 billion-parameter model on smartphones, surpassing the performance of international benchmarks by 29 times [4][5] Group 2: Market Impact and Future Prospects - The first batch of edge-native models will be released and open-sourced on July 26, 2025, featuring original algorithm architectures designed specifically for edge devices, allowing for smooth operation on budget hardware [5] - Industry experts highlight the growing demand for privacy protection and low latency, positioning edge intelligence as a key entry point connecting the virtual and physical worlds, with "BenZhi Activation" leading the way in low-cost, efficient deployment of large models on mainstream devices [6] - The company is recognized as one of the few globally with top-tier R&D capabilities and mass production experience in edge AI, indicating a strong potential for future growth and innovation in the sector [6]