Founder Park

Search documents
阶跃星辰发布新一代基模 Step 3,原生多模态推理模型,性能达到开源 SOTA
Founder Park· 2025-07-26 04:53
Core Viewpoint - The article discusses the launch of Step 3, a new generation foundational model by the company, aimed at enhancing intelligent applications and efficiency in the reasoning era, emphasizing the importance of meeting customer needs and real-world application scenarios [3][6]. Group 1: Step 3 Model Overview - Step 3 is positioned as the primary foundational model, designed for global enterprises and developers, and will be open-sourced on July 31 [3][20]. - The model features a total parameter count of 321 billion, with 38 billion active parameters, showcasing strong visual perception and complex reasoning capabilities [9]. - Step 3 aims to balance performance and cost, achieving state-of-the-art (SOTA) results in open-source multi-modal reasoning tasks [9][18]. Group 2: Technological Innovations - The model employs a Mixture of Experts (MoE) architecture, which allows for significant performance improvements while maintaining low operational costs [9][18]. - Step 3 has demonstrated a decoding efficiency that can reach up to 300% on domestic chips compared to previous models, and over 70% improvement in throughput on NVIDIA Hopper architecture [18][20]. Group 3: Industry Collaboration - The company has initiated the "MoXin Ecological Innovation Alliance" with leading chip and platform manufacturers to foster joint innovation across the model and chip industry [5][22]. - A strategic partnership with Shanghai State-owned Capital Investment Co., Ltd. has been established to enhance capital linkage and ecological business cooperation [5][22]. Group 4: Application and Market Focus - The company is focusing on key application scenarios such as automotive, mobile phones, and IoT devices, with significant collaborations with major domestic smartphone manufacturers and the automotive industry [23]. - The company aims to create scenario-based applications in vertical industries, collaborating with leading firms in finance, content creation, and retail [23].
怎么从 ChatGPT 拿流量?送上这九条实用建议
Founder Park· 2025-07-25 13:38
Core Insights - The article emphasizes the importance of Answer Engine Optimization (AEO) as a new growth path for brands and applications in the context of AI search engines, highlighting the shift from traditional SEO to AI-driven strategies [1][3][30] Group 1: AEO Strategies - Brands need to identify relevant question scenarios to optimize their presence in AI search engines, focusing on question frequency and alignment with product differentiation [5][6] - AEO strategies should be tailored to the specific platforms used by the target audience, such as ChatGPT or Google AI Overviews, rather than applying a one-size-fits-all approach [11] - Creating professional and specific content is crucial for being referenced in key questions, with a focus on providing authoritative and targeted answers [12] Group 2: Content Structure and Engagement - Content that is structured clearly, such as comparison lists, is favored by AI question engines, with about one-third of citations coming from comparative content [21][22] - Visual content is less effective for AI engines, which struggle to interpret images; structured tables are recommended instead [24][25] - Brands should focus on brand visibility metrics rather than just click-through rates, as many AI searches result in "zero clicks" [26] Group 3: Monitoring and Adaptation - The phenomenon of "citation drift" is significant, with nearly 50% of citation domains changing within a month, necessitating regular updates to keyword strategies [27][28] - Brands must remain sensitive to changes in citation dynamics and adjust their strategies accordingly to maintain relevance in AI search results [28][29]
万字对谈 Physical Intelligence(π):具身智能的卡点和下一步突破,到底在哪?
Founder Park· 2025-07-25 13:38
Core Insights - The current bottleneck in embodied intelligence is not hardware but the intelligent software that enables autonomous decision-making in robots [6][20][60] - The company has made significant progress in two of the three critical areas: capability and generalization, while performance remains the main challenge [6][10][28] - The general public tends to underestimate the value of universal robot foundational models, which could fundamentally change perceptions of intelligence in the physical world [52][60] Group 1: Current State of Embodied Intelligence - The company has released the π0.5 model, which enhances robots' ability to perform complex tasks in unfamiliar environments, demonstrating significant advancements in adaptability and generalization [6][9] - The primary challenges in achieving embodied intelligence are the ability to perform complex tasks, generalization to unknown environments, and high reliability in performance [6][8][10] - Robots are now capable of self-correcting and demonstrating resilience in task execution, which is a departure from previous models that required precise actions [13][14] Group 2: Comparison with Autonomous Driving - The challenges faced by robots in physical interaction with objects are fundamentally different from those encountered in autonomous driving, as robots must physically manipulate objects [14][15] - Both fields face similar long-tail performance challenges, where achieving high reliability requires handling numerous rare events [15] - The development trajectory of robotics may mirror that of autonomous driving, with potential breakthroughs occurring unexpectedly after prolonged periods of slow progress [15][26] Group 3: Data and Model Training - The company emphasizes the importance of collecting the right data rather than just a large quantity, as poor data can hinder model performance [16][35] - The current training approach involves using a combination of pre-trained visual language models and robot-specific data to enhance generalization without losing foundational capabilities [42][44] - The company is exploring methods to speed up training and inference processes, which are critical for efficient model deployment [45][46] Group 4: Future Predictions and Industry Outlook - The timeline for widespread deployment of robots capable of performing complex household tasks is estimated to be within the next 5 to 10 years, contingent on continued advancements [55][56] - The potential for a future where robots can be easily programmed or guided by users, akin to "vibe coding," is seen as a transformative shift in how robots will integrate into daily life [56][60] - The company believes that open-sourcing their models and findings is crucial for collaborative progress in the field, as collective efforts are necessary to overcome existing challenges [60]
保姆级教程:讲真,从零开始创办一家 AI 初创公司,要怎么做?
Founder Park· 2025-07-24 08:28
Core Viewpoint - The article provides a comprehensive guide for individuals interested in starting an AI startup, detailing the entire process from preparation, team building, to fundraising, based on the author's extensive experience in entrepreneurship and investment [1][5]. Group 1: Startup Preparation - It is advisable not to rush into quitting a stable job; instead, consider working while starting the business to ease the transition [6]. - Automating daily work can free up time for entrepreneurial activities, allowing for a smoother startup process [6]. - Achieving financial independence is prioritized over securing large investments; profitability is deemed more important than high valuations [3][6]. Group 2: Building a Personal Brand - Developing a personal brand is essential; engaging in open-source projects and publishing creative content can enhance visibility [7]. - Collaborating with reputable publishers or speaking at industry conferences can help in building a professional network and generating income [7]. Group 3: Team Formation - Finding the right partners is crucial; the company should focus on assembling a skilled team rather than following traditional hiring processes [8][9]. - Remote collaboration can reduce operational costs significantly, with a focus on a lean structure to minimize the burn rate [9][10]. Group 4: Financial Management - Every dollar spent should be strategically allocated; avoiding unnecessary expenses on incubators or paid introductions is recommended [10][11]. - It is suggested to maintain a low-cost lifestyle and explore alternative income sources to support the startup financially [12]. Group 5: Networking and Fundraising - Building a professional network is vital for resource expansion; leveraging platforms like LinkedIn can facilitate connections [14]. - Preparing for a potential two to three-year period without external funding is essential; self-sustainability should be a priority [15][16]. Group 6: Risk Management - Entrepreneurs should possess a high "risk IQ," enabling them to navigate uncertainties and maintain focus on long-term goals [17].
面向 AI Agent 的搜索服务,小宿科技有机会成为百亿美金的新巨头吗?
Founder Park· 2025-07-24 08:28
Core Viewpoint - The article discusses the evolving landscape of AI search services, particularly in light of Microsoft's decision to discontinue the Bing Search API, which has created a significant market opportunity for new players like Xiaosu Technology [1][3][22]. Group 1: Market Dynamics - Microsoft's withdrawal from the Bing Search API exposes a substantial market gap, prompting clients who relied on this service to seek alternatives, thus accelerating market share transfer to other service providers [3][4]. - The AI search market is likened to the early days of cloud computing, where large companies focused on consumer-facing services, inadvertently creating opportunities for smaller firms like Xiaosu Technology [8][9]. Group 2: Xiaosu Technology's Strategy - Xiaosu Technology has achieved an annual recurring revenue (ARR) of $25 million within months, indicating strong market traction [2]. - The company emphasizes three core capabilities: global service capacity, alignment with AI agent needs, and comprehensive expansion capabilities, which differentiate it from competitors [9][10][14]. - Xiaosu's intelligent search service covers over half of the leading AI native applications in China, showcasing its market penetration [15][22]. Group 3: Competitive Landscape - The competitive landscape post-Bing's exit includes both overseas and domestic players, with many lacking the necessary language capabilities or comprehensive service offerings to meet market demands [14][16]. - Xiaosu's competitive edge lies in its talent pool, which includes experienced professionals from early search companies, and its robust distributed infrastructure that supports low-latency global services [14][20]. Group 4: Future Outlook - The article suggests that the competition for AI infrastructure is just beginning, with a shift from consumer traffic battles to B2B foundational service contests [22][23]. - As AI evolves, the demand for real-time data and model invocation will become more complex, indicating a growing need for sophisticated search services tailored for AI applications [22].
ChatGPT Agent 团队专访:基模公司做通用 Agent,和 Manus 有什么不一样?
Founder Park· 2025-07-23 13:23
Core Insights - The article discusses the introduction of ChatGPT Agent by OpenAI, which combines deep research and operator capabilities to create a versatile agent capable of performing complex tasks without losing control over extended periods [1][6][13]. Group 1: ChatGPT Agent Overview - ChatGPT Agent is described as the first fully "embodied" agent on a computer, allowing seamless transitions between visual browsing, text analysis, and code execution [1][7]. - The agent can perform complex tasks for up to one hour without losing control, showcasing its advanced capabilities [13][19]. Group 2: Training Methodology - The training of ChatGPT Agent involved reinforcement learning (RL) where the model was given a variety of tools and allowed to discover optimal strategies independently [2][10]. - The agent utilizes a combination of a text browser and a graphical interface, enhancing its efficiency and flexibility in task execution [6][8]. Group 3: Functionality and Use Cases - ChatGPT Agent can handle various tasks, including deep research, online shopping, and creating presentations, making it suitable for both consumer and business applications [13][15]. - Users have reported practical applications such as data extraction from Google Docs and generating financial models, indicating its versatility [16][17]. Group 4: Future Developments - The team envisions continuous improvements in the agent's accuracy and capabilities, aiming to expand its functionality across a wide range of tasks [23][33]. - There is an emphasis on enhancing user interaction and exploring new paradigms for collaboration between users and the agent [34][36]. Group 5: Safety and Risk Management - The article highlights the increased risks associated with the agent's ability to interact with the real world, necessitating robust safety measures and ongoing monitoring [35][36]. - The development team is focused on creating a comprehensive safety framework to mitigate potential harmful actions by the agent [37][39].
小扎疯狂撬人,「HALO」正成为硅谷收购新形态
Founder Park· 2025-07-23 13:23
Core Viewpoint - The article discusses the emergence of a new transaction model in the AI industry known as HALO (Hire And License Out), which combines elements of hiring and licensing intellectual property, allowing startups to continue operating independently while providing financial returns to investors and employees [3][4]. Group 1: HALO Structure and Characteristics - HALO transactions are characterized by the hiring of a startup's core team while obtaining non-exclusive licensing of its intellectual property, resulting in substantial licensing fees distributed to investors and employees [3][6]. - The structure of HALO requires the startup, referred to as the "remaining company," to continue independent operations, which can lead to confusion about the nature of the transaction [6][9]. - HALO is seen as an evolution of the acquihire model, where the focus is on hiring talent rather than acquiring the company itself, allowing for higher premiums to be paid for strategic talent [10][11]. Group 2: Market Dynamics and Talent Value - The AI industry is witnessing a shift where talent is becoming more valuable than traditional assets, with companies willing to pay significant amounts to secure key personnel [4][17]. - The scarcity of experienced talent in AI is driving up competition, with valuations for newly established companies reaching billions, indicating a market belief in the value of human capital over technological assets [17][21]. - The article highlights that the current trend reflects a broader recognition that individuals may hold more value than the combined assets of a company, influencing investment strategies and valuations [21][22]. Group 3: Regulatory Environment and Future Implications - The rise of HALO is partly attributed to the increasingly politicized and uncertain nature of acquisition processes under current antitrust regulations, prompting companies to seek alternative methods for securing talent [13][14]. - HALO is positioned as a more honest representation of hiring practices, avoiding the complexities and risks associated with traditional acquisitions while still ensuring returns for stakeholders [14][15]. - The article suggests that while HALO is still in its infancy and may evolve, it represents a significant shift in how the industry values talent and structures transactions [23][28].
阿里开源最强编码模型 Qwen3-Coder:1M上下文,性能媲美 Claude Sonnet 4
Founder Park· 2025-07-23 08:21
Core Viewpoint - The article discusses the release and features of the Qwen3-Coder model by Alibaba Cloud, highlighting its advanced capabilities in coding and agentic tasks, as well as its competitive performance against other models in the market [3][4][5]. Group 1: Model Features - Qwen3-Coder series includes various versions, with Qwen3-Coder-480B-A35B-Instruct being the most powerful, featuring 480 billion parameters and supporting 256K tokens natively, expandable to 1 million tokens [4]. - The model has achieved state-of-the-art (SOTA) results in areas such as Agentic Coding, Browser Use, and Tool Use, comparable to Claude Sonnet4 [5][6]. - The training data for Qwen3-Coder amounts to 7.5 terabytes, with 70% being code, enhancing its programming capabilities while maintaining general and mathematical skills [12]. Group 2: Technical Details - The model utilizes a unique approach to reinforcement learning (RL) by focusing on real-world software engineering tasks, allowing for extensive interaction and decision-making [16]. - A scalable environment for RL has been established, enabling the simultaneous operation of 20,000 independent environments, which enhances feedback and evaluation processes [16]. Group 3: Tools and Integration - Qwen Code, a command-line tool for agentic programming, has been developed to maximize the performance of Qwen3-Coder in coding tasks [17]. - The integration of Qwen3-Coder with Claude Code is also highlighted, allowing users to leverage both models for enhanced coding experiences [22][26]. Group 4: User Experience - Users can access Qwen3-Coder through the Qwen Chat web version for free, providing an opportunity to experience its capabilities firsthand [6][7]. - Various demos showcasing the model's capabilities, such as simulating a solar system and creating visual effects in coding environments, are available for users [8][9][10].
Trae 核心成员复盘:从 Cloud IDE 到 2.0 SOLO,字节如何思考 AI Coding?
Founder Park· 2025-07-23 04:55
Core Insights - The article discusses the rapid development of Trae, particularly the introduction of the SOLO mode, which allows for a comprehensive AI-driven software development process, covering planning, coding, testing, and deployment through natural language input [1][2][36]. Group 1: Trae's Evolution - Trae's direction evolved from exploring Cloud IDE products like MarsCode and Coze, leading to the development of Trae Native IDE after recognizing the limitations of Cloud IDE in the market [3][11]. - The transition from MarsCode to Trae was driven by the realization that while Cloud IDE technology was strong, the market was not yet mature enough to support it [11][12]. Group 2: AI Coding Stages - AI coding is categorized into stages: AI-assisted programming, AI pair programming, and AI self-driving programming, with Trae's products currently focusing on AI pair programming [14][24]. - The first stage, AI-assisted programming, includes advancements in code completion and generation, with tools like Trae Cue enhancing the coding experience [17][20][23]. Group 3: SOLO Mode and AI's Role - The SOLO mode represents a shift where AI takes a leading role in the coding process, transforming the traditional dynamic where programmers primarily code while AI assists [36][38]. - The SOLO mode aims to improve task completion efficiency by reducing the number of interactions required to complete a task, leveraging AI's capabilities [37][40]. Group 4: Future of IDEs - The future of IDEs is expected to move away from being code-centric, with a focus on integrating AI as a core component of the development process [45][46]. - The company is committed to continuous improvement and innovation in AI coding tools, aiming to reshape developer experiences and expectations in the coming years [46].
8 月、上海,每年一度的谷歌开发者大会来了
Founder Park· 2025-07-22 12:27
本月有三场 AI 创业者大赛值得关注: 两场为 AI 低代码大赛,分别来自美团 NoCode 社区、 YouWare,以及还有一场 外滩大会主办的人工智能硬件科创大赛。 8 月,还有一场 2025 Google 开发者大会,将在上海举办。 此外,Founder Park 联合 Google 推出的「从模型到行动」系列 AI 工作坊活动,本周六将迎来最后一站 「北京站」,仍在火热报名中。 此前深圳站、上海站两场线下,现场开发者反馈收获满满。 我们还整理了近期值得参与的一些活动,对更多活动感兴趣的小伙伴,可以点击文末的 「阅读原文」 查看。 跟着 Google 出海:教你怎么落地 Gemini【最后一站】 主办方: Founder Park x Google 活动&报名时间: 7 月 26 日(周六) 14:00–17:00 @Google 北京办公室,7 月 24 日截止报名 活动亮点: 面向人群: 报名方式: https://mp.weixin.qq.com/s/WFScFd2yDeryo-kOeLiwRw NoCode 7 月「晒作品,赢奖励」 主办方: NoCode 社区 比赛时间: 2025.07 活动 ...