Founder Park
Search documents
加量不加价,一篇说明白 Claude Sonnet 4.5 强在哪
Founder Park· 2025-09-30 03:46
Core Viewpoint - Anthropic has launched the Claude Sonnet 4.5 model, claiming it to be the best coding model in the world, with a focus duration of over 30 hours for complex multi-step tasks, surpassing OpenAI's GPT-5 Codex [2][9]. Pricing and Cost Efficiency - The pricing for Claude Sonnet 4.5 remains the same as its predecessor, at $3 per million tokens for input and $15 per million tokens for output. Cost savings of up to 90% can be achieved through prompt caching, and batch processing can save 50% [2]. Developer Tools and Integration - Anthropic has introduced the Claude Agent SDK and an experimental feature called "Imagine with Claude" for developers, allowing integration with platforms like Amazon Bedrock and Google Cloud's Vertex AI [3][26]. Performance Metrics - In the SWE-bench Verified evaluation, Claude Sonnet 4.5 achieved industry-leading scores, with a 61.4% score in the OSWorld benchmark, significantly improving from the previous model's 42.2% [10][12]. Enhanced Features - The model includes new features such as a checkpoint function in Claude Code, context editing, and memory tools, enabling it to handle longer tasks and more complex operations [4][24]. Application and Usability - Users can interact with Claude Sonnet 4.5 through the Claude.ai website and mobile applications, with integrated functionalities for code execution and file creation directly within conversations [5][6]. Safety and Alignment - Claude Sonnet 4.5 is noted for its improved alignment and safety features, reducing undesirable behaviors such as deception and flattery, and making significant progress in defending against prompt injection attacks [24][25]. Experimental Features - The "Imagine with Claude" feature allows real-time software generation, showcasing the model's capabilities in adapting to user requests without pre-written code [31][33]. Recommendations - Anthropic recommends all users upgrade to Claude Sonnet 4.5 for enhanced performance across all applications, with updates available for both the Claude Code and developer platform [34].
DeepSeek V3.2 发布:长文本能力新突破,API 价格砍半
Founder Park· 2025-09-29 10:55
Core Insights - DeepSeek has launched its latest experimental model, DeepSeek-V3.2-Exp, which incorporates the revolutionary DeepSeek Sparse Attention (DSA) technology aimed at significantly enhancing long text processing efficiency [2][6][7]. Group 1: Technical Innovations - The introduction of the DeepSeek Sparse Attention (DSA) mechanism allows for fine-grained sparse attention, achieving a substantial increase in long text training and inference speed with minimal impact on model output quality [6][7]. - A rigorous evaluation was conducted to align the training settings of DeepSeek-V3.2-Exp with V3.1-Terminus, showing that the performance of DeepSeek-V3.2-Exp is comparable to V3.1-Terminus across various public benchmarks [10]. Group 2: Cost Reduction - The efficiency improvements have led to a significant reduction in API call costs, with a decrease of over 50%, benefiting developers by allowing them to build more powerful applications at a lower cost [4][12]. Group 3: User Engagement and Testing - DeepSeek has retained access to the V3.1 model's API for a limited time until October 15, 2025, allowing users to compare the new and old versions while enjoying the same pricing for both [15][16]. - Users are encouraged to participate in testing the experimental version and provide feedback, which is crucial for further refinement [15][18].
扒完全网最强 AI 团队的 Context Engineering 攻略,我们总结出了这 5 大方法
Founder Park· 2025-09-28 12:58
Core Insights - The article discusses the emerging field of "context engineering" in AI agent development, emphasizing its importance in managing the vast amounts of context generated during tool calls and long-horizon reasoning [4][8][20]. - It outlines five key strategies for effective context management: Offload, Reduce, Retrieve, Isolate, and Cache, which are essential for enhancing the performance and efficiency of AI agents [5][20][21]. Group 1: Context Engineering Overview - Context engineering aims to provide the right information at the right time for AI agents, addressing the challenges posed by extensive context management [5][8]. - The concept was popularized by Karpathy, highlighting the need to fill a language model's context window with relevant information for optimal performance [8][10]. Group 2: Importance of Context Engineering - Context management is identified as a critical bottleneck in the efficient operation of AI agents, with many developers finding the process more complex than anticipated [8][11]. - A typical task may require around 50 tool calls, leading to significant token consumption and potential cost implications if not optimized [11][14]. Group 3: Strategies for Context Management - **Offload**: This strategy involves transferring context information to external storage, such as file systems, rather than sending complete context back to the model, thus optimizing resource utilization [21][23][26]. - **Reduce**: This method focuses on summarizing or pruning context to eliminate irrelevant information while being cautious of potential information loss [32][35][38]. - **Retrieve**: This involves sourcing relevant information from external resources to enhance the model's context, which has become a vital part of context engineering [45][46][48]. - **Isolate**: This strategy entails separating context for different agents to prevent interference, particularly in multi-agent architectures [55][59][62]. - **Cache**: Caching context can significantly reduce costs and improve efficiency by storing previously computed results for reuse [67][68][70]. Group 4: The Bitter Lesson - The article references "The Bitter Lesson," which emphasizes that algorithms relying on large amounts of data and computation tend to outperform those with manual feature design, suggesting a shift towards more flexible and less structured approaches in AI development [71][72][74].
泡泡玛特的玩具收入,超过迪士尼了,成年人才是玩具的最佳消费者
Founder Park· 2025-09-27 02:37
Core Insights - The article discusses the significant changes in the global toy industry, highlighting the revenue rankings of toy companies for the first half of 2025, which reflect evolving consumer trends and business models in the post-pandemic era [5][6]. Group 1: Market Overview - The global toy market showed a notable recovery in the first half of 2025, with an average year-on-year sales growth of 7% across 12 major markets excluding China [6]. - Specific categories such as "games and puzzles" and "collectibles" experienced explosive growth, with increases of 36% and 35% respectively [7]. Group 2: Revenue Rankings - The top toy companies by revenue for the first half of 2025 include: - LEGO Group: 38.45 billion RMB - Pop Mart: 13.88 billion RMB - Disney: 13.86 billion RMB - Bandai Namco: 14.44 billion RMB - Hasbro: 13.34 billion RMB - Mattel: 13.18 billion RMB - Sega Sammy: 6.64 billion RMB - Asmodee: 5.77 billion RMB - Tomy: 5.55 billion RMB - Pokémon: 5.50 billion RMB - Spin Master: 5.21 billion RMB - MGA Entertainment: 3.93 billion RMB - Sanrio: 3.91 billion RMB - Ravensburger: 3.04 billion RMB - VTech: 2.89 billion RMB - Funko: 2.74 billion RMB - Simba Dickie Group: 2.71 billion RMB - Moose Toys: 2.68 billion RMB - JAKKS Pacific: 1.66 billion RMB - Blokees: 1.34 billion RMB - Dream International Limited: 1.21 billion RMB [12][11]. Group 3: Key Trends - The article identifies three major trends driving profitability and growth in the toy industry: 1. The rise of IP collectible toys and trading card games. 2. The increasing importance of adult consumers in the toy market. 3. The necessity for brands to excel in IP development and cross-platform value amplification [15][19]. Group 4: Company Strategies - Disney continues to leverage its strong content ecosystem to drive sales, with its consumer products division generating 13.86 billion RMB in revenue, a 3.5% increase year-on-year [21][26]. - Bandai Namco's toy sales are closely tied to its content, with significant contributions from popular franchises like "One Piece" and "Dragon Ball" [27][30]. - Mattel is transitioning from a traditional toy company to a content-driven entity, establishing Mattel Studios to enhance its IP narrative capabilities [39][42]. - Pop Mart has emerged as a leading player in the global trend toy market, achieving 13.88 billion RMB in revenue, with its core IP "THE MONSTERS" contributing significantly to its success [48][50]. Group 5: Trading Card Games - Trading card games (TCGs) have become one of the fastest-growing and most profitable segments in the toy market, with the global TCG market projected to reach $7.8 billion (approximately 55.5 billion RMB) in 2025 [56][59]. - Hasbro's "Magic: The Gathering: Final Fantasy" set a record for single-day sales, highlighting the potential of TCGs in driving revenue growth [61][66]. Group 6: Distribution and Market Dynamics - Asmodee has established itself as a major distributor in the TCG market, with approximately 64% of its revenue coming from card games [69][76]. - Bandai Namco has also made significant strides in the TCG space, with multiple titles dominating sales charts in Japan [77][80].
Sam Altman:到目前为止,这绝对是我最喜欢的 ChatGPT 新功能
Founder Park· 2025-09-26 03:30
Core Viewpoint - OpenAI has launched a preview version of the new ChatGPT feature "Pulse," which acts as a personalized assistant that provides daily updates based on user interactions and preferences [2][10][14]. Group 1: Functionality of Pulse - Pulse operates as an asynchronous search tool, compiling user memories, chat history, and direct feedback to deliver personalized updates the next day [5][10]. - Users can manage the research content provided by ChatGPT, indicating what is useful or not, with results presented in visual card format for easy browsing [5][10]. - The feature allows integration with Gmail and Google Calendar to enhance context and relevance of suggestions, such as drafting meeting agendas or reminding users of important dates [5][10]. Group 2: User Experience and Feedback - Users have reported that the content presented by Pulse is not only broad but also highly specific to previous discussions with ChatGPT, enhancing the personalization aspect [10][12]. - The interface includes options for users to provide quick feedback through likes or dislikes, which will help refine the personalization of Pulse over time [8][10]. Group 3: Future Implications - OpenAI views Pulse as a significant step towards making ChatGPT more practical, with plans to extend the feature to Plus subscribers in the future [14]. - The proactive nature of Pulse may influence how users consume news and social media, potentially paving the way for future advertising opportunities and social network development [12][14].
对话 Plaud 莫子皓:你还记得 PMF 的感觉吗?
Founder Park· 2025-09-25 01:03
Core Insights - Plaud is aggressively hiring and aims to expand its team to enhance its AI hardware capabilities, reflecting its growth trajectory and market potential [2][9] - The company reported over $100 million in earnings last year, with projections to exceed $200 million this year, indicating strong financial performance and market demand [3][4] - Plaud's product, a $150 recording card, has sold to over 1 million users globally, showcasing its success in the AI hardware startup space [4] Group 1: Business Model and Market Position - Plaud's business model is not heavily reliant on external financing, as it has established itself as a leading AI hardware startup [4] - The company emphasizes the importance of product-market fit (PMF), which has driven its rapid growth, achieving a fourfold increase in sales within a year [5][18] - The competitive landscape is evolving, but Plaud remains focused on delivering cutting-edge intelligence to its users, rather than being distracted by slower competitors [6][9] Group 2: Product Development and User Engagement - The company is iterating on its product offerings, moving from a simple recording device to a more comprehensive work companion that integrates various functionalities [58][70] - New features like "Press to Highlight" allow users to mark important moments during recordings, enhancing the value of the captured information [44][46] - Plaud aims to align AI capabilities with user intentions, ensuring that the technology not only records but also understands and processes user needs effectively [47][56] Group 3: Future Directions and Market Strategy - The company plans to expand its presence in the Chinese market, recognizing the significant opportunity presented by a large user base [68] - Future product iterations will focus on integrating advanced AI capabilities, with an emphasis on context and user interaction [70][74] - Plaud is committed to maintaining a strong engineering team to support its ambitious goals in the AI hardware space, prioritizing talent that can drive innovation [78][79]
a16z:AI 产品初期用户流失高很正常,M3 留存才是评估 PMF 的关键
Founder Park· 2025-09-24 08:16
Core Insights - The leading AI companies do not necessarily face retention issues, but they struggle with measurement [2][4] - Shifting the benchmark for measuring user retention from month 0 (M0) to month 3 (M3) provides clearer insights into product-market fit (PMF) and go-to-market (GTM) strategies [4][8] - The retention curve for AI products can be divided into three phases: acquisition phase (M0-M3), retention phase (M3-M6/M9), and expansion phase (M9+) [8][10] Retention Curve Dynamics - During the acquisition phase (M0-M3), the retention curve often experiences an initial decline due to the influx of non-core users [10][11] - The retention curve typically stabilizes around M3, indicating that core users who find high-value use cases remain [11][12] - In the retention and expansion phases (M3-M12+), core users may integrate the product into new workflows, leading to revenue growth [12][21] Key Metrics - The M12/M3 ratio serves as an early indicator of long-term retention quality, with a ratio close to or exceeding 100% signaling potential for long-term net dollar retention (NDR) above 100% [18][25] - High retention rates are crucial for assessing PMF, and tracking the unit acquisition cost of M3 retained customers can indicate the efficiency of GTM investments [22][23] Future Outlook - The long-term retention potential of AI companies may surpass that of traditional SaaS companies, with expectations of achieving over 150% NDR during the scaling phase [25][24]
Google Cloud 最新 AI 创业者报告:应用公司不用做自己的模型,速度和认知才是壁垒
Founder Park· 2025-09-24 08:16
Core Insights - The article discusses a trend report from Google Cloud aimed at AI entrepreneurs, featuring insights from prominent entrepreneurs and investors on AI trends, advice for startups, and predictions for AI development [2][4]. Group 1: Advice for Entrepreneurs - Startups should prioritize seizing market opportunities, as this is a critical time for growth [6]. - Pricing should be based on the value delivered rather than a per-user model, considering usage or value-based pricing [6]. - Immediate assessment is essential to define problem scopes accurately, with a clear metrics system and performance evaluation methods established early on [6]. - Focusing on niche areas to solve specific problems is more beneficial than pursuing general AI [6]. - Founders should prioritize hiring quality talent, be adaptable, assertive, and maintain close financial ties [18]. Group 2: Market Opportunities and Challenges - AI presents opportunities for billion-dollar companies, but trillion-dollar opportunities will take time to materialize [7]. - There is currently no consensus on trillion-dollar opportunities in AI, as large companies control traffic and respond quickly to market changes [9]. - Achieving a billion-dollar valuation requires a path to $500 million in annual recurring revenue (ARR), with several companies already reaching $100 million ARR [9][10]. - Companies should find differentiated approaches within a concentrated infrastructure landscape to develop consumer-grade AI products [10]. Group 3: Barriers and Growth Strategies - Speed and cognitive understanding are the primary barriers in the AI space, with a focus on vertical domains for sustainable profitability [13][14]. - AI applications are evolving, requiring a combination of model capabilities, contextual understanding, and environmental interaction to enhance product value [15]. - Growth in AI applications should rely on innovation rather than advertising, with a focus on demonstrating new capabilities to users [17]. Group 4: Global Expansion and Market Understanding - Successful entrepreneurs in global markets need to identify their comparative advantages and understand local demands [23][25]. - Companies should leverage their strengths in execution and product quality to capture user attention in unfamiliar environments [26]. Group 5: Investment Opportunities - Four categories of AI products are highlighted as worthy of investment: products with bilateral network effects, non-consensus paths, data and scenario advantages, and complex products that combine technology and business models [27][29][30]. - Investors should focus on companies that demonstrate foresight, identify valuable data paths, and adhere to first principles in their approach [32][34].
18 年 SEO 增长经验专家:别再收藏各种 AEO 最佳攻略了,自己动手实验才是做好的关键
Founder Park· 2025-09-23 14:19
Core Insights - The article emphasizes the importance of verifying information about Answer Engine Optimization (AEO) through personal experimentation rather than relying on potentially inaccurate online best practices [2][3] - AEO is closely related to traditional SEO but requires a focus on citation optimization and long-tail questions to be effective [5][8] - The rise of AEO is attributed to the increasing adoption of AI models like ChatGPT, which have changed how users seek information [10][52] Group 1 - AEO is fundamentally about optimizing content to appear as answers in large language models [9][10] - High-quality, authentic comments on platforms like Reddit are more effective than numerous low-quality comments for AEO [3][24] - The distinction between AEO and SEO lies in the need for citation optimization and addressing long-tail questions [5][14] Group 2 - AEO strategies should include both on-site optimization (like improving help center content) and off-site optimization (like increasing mentions across various platforms) [22][58] - The average length of user queries in chat scenarios is significantly longer than traditional search queries, indicating a shift in user behavior [19][20] - Companies can quickly gain visibility in AEO by being mentioned in relevant discussions or content, unlike the longer timeline required for SEO [19][45] Group 3 - The effectiveness of AEO can be measured through experiments that compare the impact of different strategies on visibility and traffic [36][44] - AEO is not a replacement for Google but rather a new channel that complements existing search methods [50][51] - The quality of leads generated through AEO is significantly higher than those from traditional search, with conversion rates being six times greater [16][47] Group 4 - Companies should focus on creating original, high-quality content that provides unique insights to stand out in AEO [32][33] - The optimization of help center content is crucial, as many user queries are related to specific product functionalities and support [58][60] - AEO requires continuous adaptation and validation of strategies to ensure effectiveness in a rapidly changing digital landscape [36][46]
Nano Banana核心团队:图像生成质量几乎到顶了,下一步是让模型读懂用户的intention
Founder Park· 2025-09-22 11:39
Core Insights - The future of image models is expected to evolve similarly to LLMs, transitioning from creative tools to information retrieval tools [4] - Multi-modal interaction will be crucial, focusing on understanding user intent and adapting to various interaction modes [4][20] - The integration of "world knowledge" from LLMs into image models is a significant application direction for enhancing user assistance [14] Group 1: Trends and Developments - Image models are anticipated to become more proactive and intelligent, capable of using text and images flexibly based on user queries [4][14] - Users' expectations for instant, high-quality outputs from models are often unrealistic, highlighting the need for iterative processes [18] - The design of user interfaces (UI) for model products is currently undervalued, with a need for better integration of various modalities to enhance usability [4][18] Group 2: User Interaction and Experience - The "blank canvas dilemma" is a significant challenge, necessitating clear communication of what actions are possible within the interface [5][20] - Simplifying operations for ordinary users is essential, with a focus on visual guidance and examples to facilitate understanding [17] - Social sharing plays a key role in overcoming the "blank canvas dilemma," as users are inspired by others' creations [17] Group 3: Model Evaluation and Aesthetics - User feedback is critical for evaluating model performance, with a focus on aesthetic quality and meeting user needs [21][22] - Meeting aesthetic demands is challenging and requires deep personalization to provide useful suggestions [26] - The future may see a shift towards more personalized models, but current expectations are likely to remain at the prompt level [27] Group 4: Future Directions and Integration - The development of "Omni Models" that can handle multiple tasks is a likely trend, with shared technologies between image and video models [40] - Traditional tools and AI models are expected to coexist, with each serving different user needs based on the complexity of tasks [35][37] - The integration of AI into existing workflows, such as enhancing presentation tools, is a promising area for future development [38]