Founder Park
Search documents
AI Agent时代的「AWS」:Manus 背后的重要功臣 E2B 是何来头?
Founder Park· 2025-05-19 12:16
文章转载自「海外独角兽」 Multi agent 系统正成为新的突破方向的过程中,agent infra 也成为落地关键。在 computer use 带来范式创新的趋势下,virtual machine 将成为 潜在创业机会,E2B 就是这个领域的新兴参与者。 E2B 之所以受到市场关注很大程度上是因为 Manus,Manus agent 完成任务过程中的 virtual computer 支持正是来自于 E2B。E2B 成立于 2023 年,作为一个开源基础设施,允许用户在云端的安全隔离沙盒中运行 AI 生成的代码。E2B 本质上是一个可以快速启动(~150 毫秒)的 microVM, 它的底层类似于 AWS Firecracker 这个代表性的 MicroVM,在此基础上, AI Agents 可以在 E2B 中运行代码语言、使用浏览器、调用各种操作系 统中的工具。 随着 Agent 生态的繁荣,E2B 的 沙盒月创建量一年内从 4 万增长到 1500 万,一年内增长了 375 倍。 为什么 AI agents 需要专属的"电脑"? 为了更好地理解这个问题,「海外独角兽」编译了 CEO Vasek Ml ...
北大校友、OpenAI前安全副总裁Lilian Weng关于模型的新思考:Why We Think
Founder Park· 2025-05-18 07:06
Core Insights - The article discusses recent advancements in utilizing "thinking time" during testing and its mechanisms, aiming to enhance model performance in complex cognitive tasks such as logical reasoning, long text comprehension, mathematical problem-solving, and code generation and debugging [4][5]. Group 1: Motivating Models to Think - The core idea is closely related to human thinking processes, where complex problems require time for reflection and analysis [9]. - Daniel Kahneman's dual process theory categorizes human thinking into two systems: fast thinking, which is quick and intuitive, and slow thinking, which is deliberate and logical [9][13]. - In deep learning, neural networks can be characterized by the computational and storage resources they utilize during each forward pass, suggesting that optimizing these resources can improve model performance [10]. Group 2: Thinking in Tokens - The strategy of generating intermediate reasoning steps before producing final answers has evolved into a standard method, particularly in mathematical problem-solving [12]. - The introduction of the "scratchpad" concept allows models to treat generated intermediate tokens as temporary content for reasoning processes, leading to the term "chain of thought" (CoT) [12]. Group 3: Enhancing Reasoning Capabilities - CoT prompting significantly improves success rates in solving mathematical problems, with larger models benefiting more from increased "thinking time" [16]. - Two main strategies to enhance generation quality are parallel sampling and sequential revision, each with its own advantages and challenges [18][19]. Group 4: Self-Correction and Reinforcement Learning - Recent research has successfully utilized reinforcement learning (RL) to enhance language models' reasoning capabilities, particularly in STEM-related tasks [31]. - The DeepSeek-R1 model, designed for high-complexity tasks, employs a two-stage training process combining supervised fine-tuning and reinforcement learning [32]. Group 5: External Tools and Enhanced Reasoning - The use of external tools, such as code interpreters, can efficiently solve intermediate steps in reasoning processes, expanding the capabilities of language models [45]. - The ReAct method integrates external operations with reasoning trajectories, allowing models to incorporate external knowledge into their reasoning paths [48][50]. Group 6: Monitoring and Trustworthiness of Reasoning - Monitoring CoT can effectively detect inappropriate behaviors in reasoning models, such as reward hacking, and enhance robustness against adversarial inputs [51][53]. - The article highlights the importance of ensuring that models faithfully express their reasoning processes, as biases can arise from training data or human-written examples [55][64].
中国 AI 应用的终局:AI RaaS 和 AI 包工头模式
Founder Park· 2025-05-17 02:28
Core Viewpoint - The article discusses the emergence of the "AI Contractor Model" (AI 包工头模式) as a transformative approach in the AI application landscape, emphasizing its potential to disrupt traditional SaaS models and create significant profit opportunities through a results-oriented service framework [4][12][27]. Summary by Sections AI Application Payment Models - The essence of AI application payment models revolves around the value of AI products, with a focus on how to present unique value to users and achieve commercial revenue [2][3]. Traditional SaaS vs. AI Applications - Traditional SaaS products, which rely on standardized functions and private data accumulation, are at risk of being replaced by high-intelligence AI applications, losing favor in capital markets [4][27]. - The AI Contractor Model can potentially break the ceiling of digital profit pools, with profit margins varying significantly across different business models, achieving up to 60 times the profit space when combined with AI capabilities [4][32]. AI Contractor Model Characteristics - The AI Contractor Model is characterized by a results-oriented payment structure, binding the interests of AI service providers and clients closely [12][14]. - It requires a comprehensive delivery system, including investment in production equipment, management of personnel, and operational funding, encapsulated in the "package of work, materials, and results" concept [12][14]. Evolution Levels of AI Contractor Model - The model evolves through four levels: L1 focuses on basic efficiency, L2 on comprehensive efficiency, L3 on profit sharing, and L4 on transforming from passive service to active resource control [5][50]. Market Examples - Case studies illustrate the application of the AI Contractor Model in various sectors, such as autonomous mining operations and AI customer service, showcasing how companies like Sierra and KoBold are leveraging this model to achieve significant operational efficiencies and profit margins [16][19][21][24]. Challenges for Traditional SaaS - Traditional SaaS companies face significant challenges, including high R&D and sales costs, low customer retention rates, and a lack of recognition in the Chinese market, which has led to a high rate of losses [14][27]. Profit Pool Analysis - The article outlines five major profit pools for enterprises, highlighting the potential for the AI Contractor Model to tap into these pools more effectively than traditional models, thus enhancing overall profitability [32][34]. High Capital Value Factors - The AI Contractor Model can overcome traditional barriers to capital value by achieving high technological content, systematic optimization, controllability, customer stickiness, and financial predictability, collectively referred to as the "Five Highs" [43][44][49]. Required Cognitive Upgrades - Successful implementation of the AI Contractor Model necessitates a focus on vertical specialization, human-machine collaboration, and a deep understanding of industry-specific needs to avoid pitfalls associated with broad, unfocused strategies [58][59][60].
2025 中国最具价值 AGI 创新机构 TOP 50 调研启动征集!
Founder Park· 2025-05-17 02:28
Core Insights - The article discusses the transformative impact of AI technologies on industries and society, highlighting the emergence of AI Agent products and their integration into business operations [1] - It emphasizes the importance of foundational technology advancements, such as the launch of the DeepSeek R1 model, which has significantly enhanced the capabilities of AI models in China [1] - The article introduces a survey initiated by Founder Park to identify key players that are innovating at the intersection of technology, business, and application [2] Group 1: AI Innovations - AI Agent products are creating new human-computer interaction experiences, functioning as "digital employees" within enterprises [1] - AI Coding products are evolving towards full automation, shifting developers' focus from specific code lines to expected outcomes [1] - The release of AlphaFold3 has sparked a commercialization wave in the fields of protein prediction, drug discovery, and bio-AI models [1] Group 2: Evaluation Criteria - The evaluation focuses on companies that demonstrate innovation in business value creation, including new operational processes and value distribution methods [4] - Companies are assessed on their ability to enhance user interaction experiences through intelligent design and improved workflows [4] - The criteria also include breakthroughs in AI algorithms, models, and data processing capabilities that can influence industry ecosystems [4] Group 3: Target Companies - The survey targets both startups and publicly listed companies primarily in the AI sector, focusing on infrastructure, model, and application layers [5] - The infrastructure layer includes companies providing data, computing power, and platforms essential for AI development [5] - The model layer focuses on general large models and deep learning frameworks, while the application layer encompasses a wide range of AI applications, including image, text, and code generation [5]
怎么回事?刚被OpenAI收购,Windsurf就发了个自己的模型
Founder Park· 2025-05-16 09:22
Core Viewpoint - OpenAI has agreed to acquire Windsurf for $3 billion, highlighting the growing importance of AI programming tools in the software development industry [1] Group 1: SWE-1 Model Overview - Windsurf has launched its AI programming model, SWE-1, which focuses on the entire software engineering process rather than just coding tasks [1] - SWE-1 features "Flow Awareness," allowing for a seamless collaboration between AI and users, where AI performs tasks, users provide corrections, and AI continues the process [1][34] - The SWE-1 series includes three models: SWE-1, SWE-1-lite, and SWE-1-mini, catering to different user needs and performance requirements [5][28] Group 2: Development and Evaluation of SWE-1 - The development of SWE-1 was inspired by the popular Windsurf editor, utilizing a new data structure and training method to understand incomplete states and long-term tasks [13][14] - SWE-1 has been evaluated against leading models like Anthropic's series and has shown competitive performance in both offline assessments and real-world usage [21][22] - The model's performance metrics include the number of code lines accepted by users and the contribution rate of code changes made by the model [24][26] Group 3: Importance of Flow Awareness - Flow Awareness is a key design principle for Windsurf, enabling a shared timeline between AI and users, which enhances collaboration and task management [33][34] - This system allows for continuous tracking of the model's capabilities and user interventions, facilitating a more effective development process [37][41] - The evolution of the shared timeline concept is central to Windsurf's goal of creating a comprehensive software engineering timeline [39] Group 4: Future Prospects - SWE-1 is just the beginning, with Windsurf aiming to continuously improve the SWE series models while maintaining low costs and enhancing performance [42] - The acquisition by OpenAI marks a new era for AI programming tools, transforming the software development landscape [42][43]
独家对话Lovart创始人陈冕:我们没有产品经理,只有设计师
Founder Park· 2025-05-16 09:22
Lovart 值得关注,它是 AI 应用层团队产品创新能力的印证和延续,这是 Manus 之后最火的 Agent,从通用领域,成功地向垂直赛道落地了 Agent 产品形 态。 据了解,Lovart 发布后,推特上出现近 5000 条讨论帖,官方视频播放近百万,获得马斯克点赞、Grok 官方发帖讨论。24 小时内,waitlist 申请人数超过 2 万。 (一个使用 Lovart 制作的 Lovart 宣传片) 基础形态上,Lovart 看起来与 Manus 很像,一个能够调用工具的 Agent,替用户完成任务。 但 Lovart 在垂直领域更进一步,它把一种需要多模态的「职业」变成了工作流再内化成 Agent,配以适合设计师使用的产品形态——画布。「画布」就是 「桌子」,还原到设计的原始状态,没有电脑,只有笔和纸,一个人有需求,一个人有能力,好的设计作品在这个场景中诞生。 「Lovart 现在当然是一个工具,但以后呢?它会是一个有职业属性的人,直接交付服务的结果。」 Lovart 创始人 & CEO 陈冕在 AI 设计领域有很多实践经验,他认为图像生成的 AI 产品,其实已经走到了第三个阶段。 1.0 阶段, ...
2025 中国最具价值 AGI 创新机构 TOP 50 调研启动征集!
Founder Park· 2025-05-15 11:34
Core Insights - The article discusses the transformative impact of AI technologies on various industries, highlighting the emergence of AI products that enhance human-computer interaction and automate processes [1] - It emphasizes the importance of foundational technologies in driving the commercialization of AI applications, particularly in the fields of drug discovery and biological AI models [1][4] - A survey initiated by Founder Park aims to identify key players that are innovating at the intersection of technology, business, and application [2][3] Group 1: AI Product Development - AI Agent products are being integrated into business operations, enhancing the role of digital employees [1] - AI coding products are evolving towards full automation, shifting developers' focus from specific code lines to expected outcomes [1] - The launch of the DeepSeek R1 model marks a significant advancement in China's AI capabilities, fostering a new wave of entrepreneurial innovation [1] Group 2: Evaluation Criteria for Candidates - Candidates are evaluated based on their ability to innovate in business value creation, including operational processes and value distribution [4] - Innovations in user interaction and experience are crucial, focusing on natural, fluid, and sustainable interactions that improve workflows [4] - Breakthroughs in AI algorithms, models, and data processing are essential for candidates, showcasing their potential to influence industry ecosystems [4] Group 3: Focus Areas for Evaluation - The evaluation will cover startups and public companies primarily in China, focusing on the infrastructure, model, and application layers of the AGI industry [5] - The infrastructure layer includes companies providing data, computing power, and platforms essential for AI development [5] - The application layer encompasses a wide range of sectors, including image, text, audio, video generation, and enterprise applications [5] Group 4: Application Process - The application period runs from now until May 31, with evaluations taking place from June 2 to June 21 [8] - The evaluation process includes initial screening, secondary screening, and final assessments to determine the most valuable players in the AI space [8]
2025年哪款模型最受欢迎?Poe最新报告:DeepSeek降温、可灵成黑马
Founder Park· 2025-05-15 11:34
Core Insights - Poe's latest report analyzes AI model usage trends from January to May 2025, focusing on user engagement across text, reasoning, image, video, and audio domains [1][2] Group 1: Model Performance and Market Trends - The popularity of the DeepSeek model has declined, with its market share dropping from a peak of 7% in mid-February to 3% by the end of April [4][7] - New flagship models from the same provider tend to capture market share from their predecessors, leading to a rapid shift in user preferences towards newer models [4][7] - The share of text messages sent to reasoning models increased from approximately 2% to about 10%, peaking during DeepSeek's popularity [9][11] Group 2: Reasoning Models - The number of reasoning models has significantly increased, reflecting a growing trend towards more precise and reliable handling of complex tasks [8] - Gemini 2.5 Pro gained approximately 30% of reasoning message share within six weeks of its release [11] - Users are quickly transitioning to OpenAI's latest reasoning models, indicating a strong preference for newer, more powerful options [12] Group 3: Image Generation Models - The GPT image generation model, GPT-Image-1, achieved a usage rate of 17% within two weeks of its API launch [17] - Google's Imagen 3 series saw its usage grow from about 10% to 30%, while Black Forest Labs' FLUX series maintained a market share of approximately 35% [17][18] Group 4: Video Generation Models - Kuaishou's Kling video generation model rapidly captured about 30% of the market share, with Kling-2.0-Master accounting for 21% of all video generation requests within three weeks of its release [21][22] - Runway, a pioneer in video generation, experienced a 40% decline in usage share, dropping to around 20% [23] Group 5: Audio Generation Models - ElevenLabs dominated the audio generation space, handling about 80% of TTS requests from subscribers [24] - The audio generation market is becoming increasingly competitive, with new players offering unique voice options and performance features [24]
GPT-4.1正式在ChatGPT中上线,暂时没有1M上下文
Founder Park· 2025-05-15 03:58
文章转 载自 「 新智元」 刚刚,OpenAI 官宣:GPT-4.1 在 ChatGPT 中上线,用户可以直接使用。GPT-4.1 模型擅长编码任务和 遵循指令,生成速度更快,是 o3 和 o4-mini 的绝佳替代品。 一个月前,OpenAI 推出了新系列模型 GPT-4.1,在编程、指令遵循、长上下文方面表现优异。 GPT-4.1 此前仅通过 API 向开发者开放,在 ChatGPT 上线后,Plus、Pro 和 Team 用户可通过模型选择 器中的"更多模型"下拉菜单访问 GPT-4.1。企业版和教育版用户将在未来几周内获得访问权限。同时, OpenAI 还计划在 ChatGPT 中引入 GPT-4.1 mini 取代 GPT-4o mini。 Founder Park 正在搭建「 AI 产品市集」社群,邀请从业者、开发人员和创业者,扫码加群: 进群后,你有机会得到: 01 GPT-4.1, 比GPT-4.5还好? 这次放出GPT-4.1,也算响应群众的呼声。 早在4月底,就有用户抱怨说:GPT-4.1简直是自己最喜欢的OpenAI模型,可惜在ChatGPT中并 不能使用。 最新、最值得关注的 AI 新 ...
付费用户突破 1000 万,All in AI 的多邻国,是怎么用 AI 的?
Founder Park· 2025-05-14 12:28
Core Insights - Duolingo's Q1 financial report shows impressive growth with DAU at 46.6 million, up 49% year-over-year; MAU at 130.2 million, up 33%; and paid users surpassing 10.3 million, up 40% [1][2] Group 1: Financial Performance - Total revenue reached $230.7 million, representing a 38% year-over-year increase [2] Group 2: AI Integration - Duolingo has not been negatively impacted by AI; instead, it has leveraged AI to create 148 courses in one year, a process that would have taken 12 years using traditional methods [3] - The CEO announced a strategic shift to "AI-first," emphasizing the need to adapt to AI's transformative impact on the company [4][6] - AI is expected to enhance productivity and help achieve high-quality teaching by automating content creation, which previously required extensive manual effort [8] Group 3: AI Applications in Education - AI has revolutionized Duolingo's content creation process, allowing for a fully automated system that significantly reduces human labor and increases course offerings [11] - The use of AI in conversation practice has improved user engagement, as learners can practice without the fear of judgment from peers [12] Group 4: User Engagement Strategies - Duolingo has successfully gamified learning by shortening lesson durations from 30 minutes to 2 minutes, making it more appealing for users [15] - Features like "streaks" that track consecutive learning days have proven to be highly effective in maintaining user motivation [16] Group 5: Future of Education - The CEO believes that while AI will enhance educational scalability, traditional teachers and schools will still play a crucial role in student care [34] - The company anticipates a gradual transformation in education, with AI tools becoming more integrated into learning environments over the next 20 years [35][36]