Workflow
量子位
icon
Search documents
“最美PM”宋紫薇获红杉蚂蚁投资,创业方向略有调整,转向AI护肤
量子位· 2026-03-09 06:05
Core Viewpoint - The article discusses the recent funding round for the AI hardware company "Wei Guang Dian Liang," founded by Song Ziwei, highlighting the interest from top investors and the company's strategic pivot towards AI skincare products [1][2][16]. Group 1: Company Overview - Wei Guang Dian Liang was founded in 2024 by Song Ziwei, focusing on AI Agent as the core of its innovative smart hardware [5]. - The company has recently increased its registered capital from 1.25 million to approximately 1.73 million [3]. - The first product aimed at young consumers is an AI makeup mirror, which has now shifted focus to AI skincare [19][20]. Group 2: Funding Details - The latest funding round included investors such as Sequoia China, Ant Group, and BlueRun Ventures, although the exact amount remains undisclosed [2][4][8]. - Reports indicate that the funding amount could range from several tens of millions to over 100 million [4][10]. - The ambiguity surrounding the funding round's classification (Angel+, Pre-A, A round) and amount reflects varying reports from different sources [9][10]. Group 3: Strategic Direction - The company has adjusted its focus from an interactive makeup mirror to AI skincare, indicating a strategic shift to address core consumer needs [20][21]. - This pivot suggests a strategy to avoid competition in saturated markets like smartphones and instead find new entry points in essential consumer products [21]. - The approach aims to leverage aesthetic value to counteract hardware commoditization [24]. Group 4: Background of the Founder - Song Ziwei, born in 1994, graduated from Shanghai University with a degree in physics and has a background in project management at Huawei and product management at Vivo [25][26][27]. - She gained popularity as the "most beautiful product manager" during her time at Vivo, where she played a significant role in product launches [27][29]. - After leaving Vivo, she made a brief appearance at Li Auto before establishing her own company, hinting at her transition into the AI smart hardware sector [32][37].
卡帕西开源Agent自进化训练框架,5分钟一轮实验,48h内揽星9.5k
量子位· 2026-03-09 06:05
Core Insights - The article discusses a new open-source project called autoresearch, developed by Karpathy, which enables AI to autonomously conduct research by following instructions written in Markdown documents [2][5]. - The framework is designed to be lightweight, consisting of only 630 lines of code, and can run on a single GPU [3][16]. - The project has gained significant attention, achieving over 9.5k stars on GitHub within two days of release [6]. Project Overview - Autoresearch automates the AI training loop, allowing the AI to modify code, run short experiments lasting five minutes, and evaluate results to determine the next steps [13][14]. - The system operates under two main rules: each experiment has a fixed training time of five minutes, and it evaluates based on the val_bpb metric, where lower values indicate better model performance [15]. Technical Structure - The project relies on three core files: prepare.py for setting constants and tools, train.py for AI modifications, and program.md for human-written instructions [20][24]. - The AI modifies train.py based on instructions from program.md, runs experiments, and decides whether to keep or discard changes based on performance metrics [30][32]. Future Aspirations - Karpathy envisions a future where thousands of AI agents collaborate asynchronously across numerous branches, enhancing research efficiency through collective intelligence [5][35]. - He draws parallels to the SETI@home project, aiming to create a decentralized, distributed exploration model for AI research [38][41]. Research Methodology - The autoresearch process involves AI iterating through modifications, training, evaluation, and decision-making, achieving a high efficiency that surpasses human capabilities [29][32]. - The project aims to shift the research paradigm from a linear, centralized approach to a more flexible, experience-based model that accommodates diverse research paths [49].
龙虾最大痛点被官方插件升级!对话永不忘记,GPT和Gemini最强模型都可接入
量子位· 2026-03-09 04:13
henry 发自 凹非寺 量子位 | 公众号 QbitAI 报!龙虾更新了! 刚刚,新的OpenClaw测试版(2026.3.7)已经推出,并光速上线OA两家最新模型 GPT-5.4 和 Gemini Flash 3.1 。 与此同时,一并更新的还有: ACP绑定在重启后依然可保留 精简版Docker多阶段构建 用于网关认证的SecretRef 可插拔的上下文引擎 支持HEIF图像格式 修复Zalo渠道问题 其中, 可插拔的上下文引擎(pluggable context engine) 可谓是这次更新的重点,不少网友纷纷表示: 相比于跑那个模型,上下文才是关键。 那么,这个上下文插件是怎么一回事?有啥用? 上下文管理插件化 总体来看,这次OpenClaw更新可以归纳为三个方面: 上下文管理插件化 、 Agent路由能力升级 (频道、topic、独立 session)以及 部署 与插件工程化 (Docker multi-stage、SecretRef、安全策略)。 其中,最值得关注的,就是 上下文管理插件化 。 根据官方的changelog,这次更新新增了ContextEngine插件插槽。 细心的你甚至还发现 ...
龙虾最佳适配模型,OpenClaw之父给出了推荐
量子位· 2026-03-09 04:13
Core Insights - The article discusses the rising popularity of lobster-related AI models and the challenges in selecting the most suitable model for OpenClaw, with a recommendation to refer to the PinchBench ranking system [1][3]. Group 1: PinchBench Overview - PinchBench is a benchmark specifically designed for evaluating AI models based on their success rate, speed, and cost, providing real-time updates [3][6]. - The benchmark has gained traction since its introduction in February, particularly due to the impressive performance of Chinese models [3][20]. - The ranking highlights that Chinese models excel in success rate and speed, although they lag behind in pricing compared to models from OpenAI and Google [7][15]. Group 2: Model Performance - The top three models in terms of success rate are: 1. Google Gemini 3 Flash with a success rate of 95.1% 2. MiniMax M2.1 with a success rate of 93.6% 3. Kimi K2.5 with a success rate of 93.4% [11]. - In terms of speed, MiniMax M2.5 outperformed other models, achieving the fastest completion time of 105.96 seconds [12][10]. - However, in pricing, the cheapest model from OpenAI, GPT-5-nano, offers significantly lower costs compared to the MiniMax models, with input prices at $0.05 per million tokens versus MiniMax M2.1's $2.1 [15][17]. Group 3: Evaluation Methodology - PinchBench employs a combination of automated checks and LLM evaluations to assess model performance across various real-world tasks, focusing on the ability to complete entire workflows rather than just answering questions [25][29]. - The benchmark includes 23 real tasks across categories such as productivity, research, writing, coding, analysis, email management, memory, and skills [26][28]. - The results indicate that larger models do not always outperform smaller, more efficient models, which has sparked discussions within the community [31][32].
一年一度最值得关注的AI榜单来啦!申报即日启动
量子位· 2026-03-09 04:13
Core Insights - The article discusses the transition of generative AI in China from a "new technology" to a "new tool" and now to a reality that businesses must confront, impacting various aspects such as content production, R&D efficiency, marketing methods, team collaboration, and decision-making processes [1] Group 1: Event Overview - The Fourth China AIGC Industry Summit will take place in May 2026, where Quantum Bit will announce the results of its evaluation of generative AI companies and products based on their performance and feedback over the past year [1][2] - The summit aims to invite millions of industry practitioners to witness the recognition of outstanding companies [2] Group 2: Evaluation Criteria for AIGC Companies - Companies eligible for evaluation must be based in China or have their main business operations in China, focus on generative AI or have widely applied AI in their core business, and have shown outstanding performance in technology/products and commercialization over the past year [7] - The evaluation will consider innovation, forward-looking potential, and scalability of the AI companies [4] Group 3: Evaluation Criteria for AIGC Products - Products eligible for evaluation must be based on generative AI capabilities, have mature technology already in the market with a certain user scale, and have significant technological innovations or functional iterations in the past year that impact the industry [13] - Evaluation dimensions for products include technical strength, innovation, market performance, and future potential [12] Group 4: Registration Information - Registration for the evaluation is open now and will close on April 27, with final results announced at the May summit [14] - Companies can register through a provided link or contact Quantum Bit staff for inquiries [14][16] Group 5: Summit Theme and Goals - The theme of the 2026 China AIGC Industry Summit is "Everyone, Let's AI Now," focusing on how to effectively utilize AI [17] - The summit aims to engage AI entrepreneurs, developers, and experienced players to clarify and implement AI, encouraging broader participation in AI technology [17]
打败GPT-5.2,嵌入真实工业生产,这个大模型什么来头?
量子位· 2026-03-09 04:13
Core Viewpoint - The article discusses the performance of various AI models in industrial practice exams, highlighting the limitations of general-purpose models in real industrial contexts and the superiority of IndustryGPT from Simo Technology in specialized industrial applications [2][4][6]. Group 1: Industrial AI Examination Results - A series of three industrial practice exams revealed that even top models like GPT-5.2 Thinking (high) and Gemini-3.1-Pro struggled in real industrial engineering contexts [2][4]. - IndustryGPT outperformed these general models in all three exams, demonstrating its capability in industrial knowledge breadth and depth [3][11]. - The exams highlighted the structural differences in AI requirements between general and industrial scenarios, emphasizing the need for compliance, rigor, and reliability in industrial applications [26][39]. Group 2: Assessment Methodology - The first exam assessed the breadth of industrial knowledge using the SuperGPQA dataset, where IndustryGPT achieved state-of-the-art (SOTA) results [9][11]. - The second exam focused on the depth of industrial knowledge, with IndustryGPT leading significantly, especially in high-difficulty questions, achieving over a 20% relative performance improvement [14][18]. - The third exam evaluated practical decision-making capabilities, aligning with professional qualification standards, where IndustryGPT again demonstrated superior performance in regulatory compliance and complex decision-making [20][24]. Group 3: Industrial AI Requirements - The article identifies three core capabilities that industrial AI must possess: boundary control, compliance with regulations, and task execution [39][40][42]. - IndustryGPT's training paradigm emphasizes these capabilities, ensuring that the model operates within safety boundaries and adheres to strict industrial standards [41][44]. - The discussion contrasts two main approaches to industrial AI: general models with industry fine-tuning versus native industrial models like IndustryGPT, which are designed from the ground up to meet industrial needs [46][49]. Group 4: Practical Applications and Impact - IndustryGPT has been successfully integrated into various industrial scenarios, significantly improving efficiency and reducing risks in processes such as quality inspection and complex production line management [28][29][36]. - The model's ability to automate the generation of manufacturing plans and manage complex production environments demonstrates its practical value in real-world applications [32][34][36]. - The article concludes that the true measure of AI in manufacturing is not just intelligence but its ability to be effectively implemented in production environments [53][54].
科研AI出了个狠角色:开源30B小模型,硬刚Gemini和Claude
量子位· 2026-03-09 02:01
Core Viewpoint - The article discusses the capabilities of the UniScientist model developed by UniPat AI, emphasizing its ability to conduct autonomous scientific research despite having only 30 billion parameters, outperforming larger closed-source models in various scientific benchmarks [2][3][36]. Group 1: Model Capabilities - UniScientist can autonomously propose hypotheses, collect evidence, execute reproducible deductions, and iteratively validate until conclusions are established [2][10]. - The model addresses the limitations of existing AI in scientific research, which often only mimic the appearance of research without true validation or reproducibility [7][8]. - It integrates a dynamic system approach to scientific research, allowing for continuous evolution of evidence states and hypothesis refinement [17][20]. Group 2: Data Engine and Research Process - The data engine of UniScientist is designed to balance the scale and diversity of data generated by the model with the quality and verifiability provided by human experts [12][16]. - The model's research process is formalized into a series of verifiable unit tests, breaking down open scientific questions into independent, verifiable rubric items [24][25]. - The dataset includes over 4,700 research-grade instances, covering more than 50 disciplines and 400 research directions, with each instance validated by experts [26][30]. Group 3: Performance and Benchmarking - UniScientist achieved a score of 28.3 on the FrontierScience-Research benchmark, surpassing several larger models, and reached a score of 33.3 in the results aggregation mode [36][37]. - The model's performance indicates that it has learned to integrate retrieval, deduction, validation, and writing into a coherent research workflow [42]. - Even without tools, the model demonstrated significant performance improvements, suggesting enhanced research reasoning capabilities through training [40][41]. Group 4: Future Directions - The next steps for UniScientist involve expanding its capabilities to include real-world experimental resources and computational infrastructure for controlled orchestration and execution [47]. - The integration of a code interpreter aims to transition the research process from narrative reasoning to a "test-correct" cycle, allowing hypotheses to be instantiated as computational experiments [44][45].
暴雪皮克斯老兵的AI社交实验:用声音匹配,MAU破260万,估值1.5亿美金
量子位· 2026-03-09 02:01
Core Viewpoint - The article discusses how AI is undermining the credibility of social interactions, leading to a new product, Gensen, which focuses on voice as a genuine signal for social connections in a world where visual and textual representations can be easily manipulated [4][11][12]. Group 1: The Problem with Current Social Media - The assumption that user-provided information is trustworthy is collapsing due to AI's ability to create fake images, videos, and texts [10][11]. - Users are experiencing fatigue from "performative socializing," where they feel pressured to curate an idealized self-image rather than engage authentically [28][29]. - Young people prefer genuine interactions over "performative" ones, as evidenced by their willingness to engage in offline games where they can be themselves [30][31]. Group 2: Gensen's Unique Approach - Gensen, a voice-based social product, has gained significant traction, reaching a peak of 2.6 million monthly active users (MAU) and ranking 17th on the iOS social charts during the Spring Festival [13][46]. - The founder, Li Zheyue, emphasizes that voice is the last real signal that cannot be easily faked by AI, making it a more reliable medium for social interaction [6][22]. - Gensen's philosophy is to create real interactions through games, using voice as a behavioral signal and AI to model personality traits [21][41]. Group 3: AI's Role in Gensen - Unlike other products that use AI to enhance user presentation, Gensen employs AI to improve interaction efficiency and match users based on their voice characteristics [38][40]. - The system analyzes anonymized voice features during gameplay to optimize user matching without requiring complex questionnaires [40][41]. - Gensen's approach allows for a natural display of users' personalities through gameplay, making it easier for them to connect with like-minded individuals [44][48]. Group 4: Market Potential and Growth - Gensen has raised over $45 million in funding, with a valuation of $150 million, indicating strong investor interest from firms like A16Z and Tencent [46]. - The product's design encourages social interactions that transition from weak to strong relationships, leveraging voice and gaming to foster genuine connections [48][49]. - The article concludes that as AI excels in content generation, understanding real human behavior through voice may become a key capability for future social products [51][52].
20岁大学生花10天VibeCoding一个开源项目,获盛大3000万投资
量子位· 2026-03-08 06:45
Core Viewpoint - MiroFish is an AI prediction engine that constructs a high-fidelity parallel digital world by extracting real-world seed information, allowing for dynamic variable injection and precise future projections [3][9]. Group 1: Project Overview - MiroFish has rapidly gained popularity, reaching over 5.7k stars on GitHub since late January [1]. - The project is a continuation of the previous work, BettaFish, which focused on public opinion analysis [9]. - MiroFish aims to create a closed-loop system from raw data to intelligent decision-making through multi-agent simulations [9]. Group 2: Functionality and Use Cases - MiroFish can simulate and predict major social events, analyze corporate strategies, and even explore complex literary character relationships [6]. - The system can generate detailed character relationship graphs and simulate interactions among agents based on the input data [14][20]. - An example case demonstrated the prediction of the lost ending of "Dream of the Red Chamber" using the first 80 chapters as input [11][30]. Group 3: Investment and Support - The project has received significant backing, including a 30 million RMB investment from the founder of Shengda Group, Chen Tianqiao, after only 10 days of development [8][54]. - The author, BaiFu, has garnered attention from major companies and investors following the success of BettaFish [36][51]. Group 4: Development Insights - BaiFu emphasizes the importance of market research and technical selection before coding, advocating for a structured approach to project development [47]. - The development process involves using AI to assist in various tasks, enhancing efficiency through parallel agent collaboration [41][46]. - The author notes the necessity of deep human-AI collaboration and code review to ensure project quality and coherence [46].
高中生AI创业,现在只招龙虾员工:每月成本2800
量子位· 2026-03-08 06:45
Core Viewpoint - The article discusses a unique business model where a company operates entirely with AI, without any human employees, showcasing how low-cost entrepreneurship can be effectively achieved through AI technology. Group 1: Company Structure and Operations - The company operates with a monthly cost of $400, acquiring over 450 paying users [2][8]. - It utilizes a complete organizational structure with various departments including design, development, research, content, and operations, all managed by AI [5][6]. - The main operational brain is an AI named Jarvis, which automates task allocation among different AI employees without human intervention [12][13]. Group 2: AI Utilization and Efficiency - The research department, led by Atlas, conducts deep research using multiple APIs to compile industry reports [15]. - The content team, consisting of Scribe and Trendy, produces high-quality articles and tracks trending topics to ensure timely content creation [16][17]. - The design department handles all visual needs with specialized AI tools for static images, videos, and animations [19][20]. Group 3: Development and Quality Assurance - The development and quality assurance are managed by Clawed and Sentinel, which review and optimize code regularly [21][22]. - Clawed reviews the codebase nightly and can initiate multiple AI to collaborate on development tasks [23]. - Sentinel performs quality checks every two hours to monitor code vulnerabilities [24]. Group 4: Entrepreneurial Background and Management - The founder of the company has no coding background and initially had limited knowledge of technology [26][27]. - The entrepreneur effectively communicates with AI through well-crafted prompts, establishing clear work standards and collaboration logic [29][31]. - The company aims to hire efficient managers with their own AI teams rather than traditional developers in the future [34].