Workflow
Founder Park
icon
Search documents
豆包创作挑战赛、与a16z合伙人面对面聊AI,近期优质AI活动都在这里
Founder Park· 2025-11-18 11:06
Group 1 - The "Not Technical at All" creative challenge initiated by Doubao and Intel will take place from November 10 to December 8, 2025, allowing participants to submit entries with no programming skills required [4][5]. - The challenge features a prize pool of 200,000, with opportunities to win cash prizes and exclusive experiences, such as witnessing a rocket launch [5]. - The event aims to foster high-value exchanges among investors, media, and AI experts [5]. Group 2 - The "From Lab to Market" event hosted by a16z and SSG will occur on November 20, focusing on navigating technology trends and market demands [6][7]. - The event is targeted at AI and embodied field researchers, with limited seating available for doctoral students and researchers [7]. - Founder Park's event on November 25 will address challenges in building global teams, including recruitment, compliance, and management in the AI sector [7][8]. Group 3 - The Geek Park Innovation Conference 2026 is scheduled for December 6-7 in Beijing, aimed at technology enthusiasts and entrepreneurs [8]. - The 2026 Geek Camp will take place from February 6-10, 2026, in Shenzhen, offering a hands-on experience to create prototypes in a collaborative environment [8][9]. - Participants in the Geek Camp will receive transportation and accommodation subsidies, along with access to over 265 enterprise ecosystems and investment resources [9].
95 后团队做 3D 大模型,拿下头部游戏重磅合作,正在定义 3D 生成的新规则
Founder Park· 2025-11-18 11:06
Core Insights - The article highlights the significant advancements made by Yingmou Technology in the field of 3D generation, particularly through their model Rodin and its latest iteration, Rodin Gen-2, which has achieved substantial improvements in generation quality and controllability [2][6][9]. Group 1: Company Achievements - Yingmou Technology's Rodin model was showcased at GDC, capturing the attention of top game developers and leading to the successful application of 3D generation technology in mobile gaming [2]. - The company recently completed a multi-million dollar funding round led by BlueRun Ventures, with participation from ByteDance and Sequoia China, positioning it as a leading startup in the 3D large model sector [2]. - The research paper "CLAY" received nominations for best papers at SIGGRAPH, marking a significant milestone for the young team that has been focused on 3D research since its inception [2][3]. Group 2: Technological Innovations - Rodin Gen-2 has been upgraded to utilize a dataset of millions and billions of parameters, resulting in a qualitative leap in generation quality, including smoother geometric surfaces and reduced post-processing costs [6][9]. - The introduction of the "Bang to Parts" feature allows users to decompose generated models into smaller components, enhancing the controllability of 3D models and streamlining workflows in various applications [9][12]. - The model's ability to generate clean and clear 3D meshes reduces the need for extensive repairs in software like Blender and Unity, making it more production-ready [8]. Group 3: Industry Trends - Major companies are increasingly investing in 3D generation technologies, with Roblox open-sourcing CUBE 3D and ByteDance releasing Seed3D 1.0, indicating a growing trend in the industry [6]. - The demand for rapid and accurate 3D model generation is driving innovations, with Yingmou's technology achieving model generation speeds of under 10 seconds, catering to diverse industry needs [24]. - The team believes that 3D generation will play a crucial role in future applications, serving as a foundational technology for various sectors, including digital content creation, industrial design, and AR/VR interactions [29].
从《塞尔达传说》理解 Agent 的上下文工程:Claude Skills 还是被低估了
Founder Park· 2025-11-18 07:59
Core Insights - Claude Skills represents a significant advancement in AI Agent capabilities, allowing for dynamic discovery and loading of specialized knowledge, transforming general agents into task-specific experts [8][4] - The underlying design philosophy of information layering is a key breakthrough that enhances token efficiency by up to 95%, improving decision quality and response speed [6][9] Information Layering Design - Information layering allows agents to process complex tasks efficiently by first accessing an index, then a summary, and only retrieving the original content when necessary [5][6] - This design philosophy is akin to techniques used in 3D game development, such as Level of Detail (LOD) and on-demand loading, which optimize resource usage [12][20] Three-Layer Architecture - The three-layer architecture consists of: - LOD-0: Summary Layer, providing minimal metadata for quick browsing [29] - LOD-1: Core Layer, offering essential information sufficient for 80-90% of routine tasks [30] - LOD-2: Raw Layer, containing complete information for in-depth analysis when needed [31][32] - This structure enables agents to efficiently navigate vast information landscapes, reducing token consumption and improving operational speed [60] Practical Application - In a case study analyzing quarterly performance, agents utilize LOD-0 to identify relevant data assets, LOD-1 to generate high-quality summaries, and LOD-2 for detailed queries, demonstrating the architecture's effectiveness [51][56] - The results show a dramatic reduction in token consumption from approximately 150,000 to 5,000, and a significant decrease in response time from 45 seconds to 5 seconds [60] Challenges and Considerations - Implementing an information layering architecture requires substantial initial investment in creating high-quality LOD-1 summaries and maintaining synchronization across layers [63][64] - The complexity of designing a layered system necessitates careful consideration of information scale, frequency of updates, and access patterns to avoid over-engineering [66] Universal Principles - The core principles derived from Claude Skills emphasize using metadata instead of complete information and adopting on-demand loading strategies to optimize resource usage [67][71] - These principles can be applied across various information-intensive systems, enhancing efficiency and intelligence in agent design [85]
Agent 如何用搜索?这家最懂 AI 搜索的团队,把踩过的坑都分享出来了
Founder Park· 2025-11-17 10:08
Core Insights - The article emphasizes the fundamental differences between human search behavior and AI search requirements, highlighting that AI searches are dynamic, iterative, and often involve multiple queries to address complex tasks [1][6][9]. Group 1: AI Search vs. Traditional Search - AI search is characterized by its need for multi-turn, iterative queries, contrasting with the static, one-time queries typical of human searches [1][6]. - The accuracy of AI search results is prioritized over speed, with a focus on comprehensive information coverage rather than just the top results [8][9]. - AI agents require longer, more detailed content to understand context, differing from traditional search engines that provide short summaries [7][8]. Group 2: Challenges in AI Search Integration - Different AI applications face unique challenges when integrating search capabilities, such as the need for task decomposition in office applications and ensuring low-latency responses in AI hardware [10][15][28]. - The importance of authoritative content has increased significantly, as AI agents generate answers directly from search results, necessitating strict standards for content quality [7][24]. Group 3: Search Infrastructure and Technology - The search infrastructure provided by companies like Xiaosu Technology includes intelligent search and content reading capabilities, essential for AI agents to access reliable information [10][11]. - The article discusses the need for a large-scale data index and advanced algorithms to ensure timely and accurate search results, addressing the limitations of traditional search methods [29][31]. Group 4: Future of AI Search - The future of search is expected to be closely tied to AI agents, with a projected exponential increase in token consumption as AI applications become more prevalent [41]. - Companies are focusing on enhancing search quality to reduce the reliance on costly AI models, suggesting that effective search can significantly lower operational costs [35][36].
为什么在海外招到「对的人」这么难?
Founder Park· 2025-11-17 10:08
Group 1 - The core challenge for companies expanding overseas is the difficulty in recruiting suitable talent through traditional channels [4] - Many AI product teams are structured with development teams based in China and growth teams primarily located overseas [3] - The workshop aims to address the challenges of identifying, recruiting, and managing global teams, featuring insights from Deel and Vorka.AI [4][7] Group 2 - Key discussion topics include how to accurately identify candidates that align with team culture and core competencies in unfamiliar overseas markets [7] - The need for adjustments in traditional recruitment funnels and evaluation systems is highlighted [7] - Strategies for leveraging social media platforms like Xiaohongshu and X to enhance employer branding on a limited budget are discussed [7][8] Group 3 - The workshop will also cover compliance with cross-border payroll, hiring policies, and remote team collaboration challenges [7][8] - The event is targeted at founders and business leaders of tech companies with overseas operations or those planning to build global teams [8]
AI Native 的影像公司们,颠覆赛道的机会来了!
Founder Park· 2025-11-16 03:05
Core Insights - The article discusses the transformative impact of AI on the imaging equipment industry, highlighting a shift from optical dominance to computational capabilities as the primary value driver [4][5][6]. Group 1: Historical Context and Evolution - Over the past 50 years, the balance between optical and computational contributions in imaging has evolved, indicating a new logic for the emergence of successful imaging companies [5][6]. - In the film era, the value of photography was almost entirely determined by optical and mechanical factors, with companies like Leica and Zeiss leading the market [8]. - The advent of digital technology marked the first significant intervention of computation, allowing companies like Canon and Sony to disrupt traditional optical firms by integrating computational elements into their products [8][9]. Group 2: The Role of Computation - Initially, computation served to optimize optical performance and simplify user operations, enhancing the overall user experience and expanding market demand [8][9]. - The true disruption began when computation shifted from merely optimizing optics to defining scenarios and reshaping reality, as seen with companies like GoPro and DJI [9][10]. - DJI's drones exemplify this shift, functioning as advanced computational platforms that deliver unprecedented aerial imaging capabilities [10]. Group 3: New Computational Paradigms - A new computational architecture is emerging, characterized by a combination of on-device processing, lightweight local models, and powerful cloud-based models, enabling unprecedented capabilities in imaging devices [11]. - This evolution allows for real-time AI functionalities and opens up new possibilities for understanding, enhancing, and generating images [11][13][15]. Group 4: Layers of Value Creation - The first layer of value is "understanding reality," where AI enhances the camera's ability to interpret and provide context to images, moving beyond mere recording [13]. - The second layer is "augmented reality," where AI contributes to creative expression and emotional resonance in photography [15]. - The third layer, "generating reality," represents a paradigm shift where images are created through computation rather than traditional optical means, as demonstrated by products like the Paragraphica camera [23][29]. Group 5: Market Opportunities and Future Directions - The potential for new imaging companies lies in leveraging high computational capabilities to unlock previously suppressed market demands, particularly in niche segments [29][30]. - Successful companies will focus on providing exceptional user experiences in specific scenarios, thereby expanding overall market demand [30][33]. - The future of the imaging industry is expected to be shaped by AI-native companies that prioritize innovative product thinking and a deep understanding of user needs [34].
1亿ARR、21亿估值的新独角兽,Gamma创始人:只比PPT好一点,是活不下去的
Founder Park· 2025-11-15 03:04
Core Insights - Gamma aims to reconstruct PowerPoint rather than create another version of it, focusing on a content-first approach rather than a design-first one [8][10][25] - The company has achieved significant growth, raising $68 million led by a16z, with a valuation of $2.1 billion, despite initial skepticism from investors [3][5] - Gamma has successfully integrated AI into its product, enhancing user experience and engagement, leading to a rapid increase in user base [14][15][16] Group 1: Company Overview - Gamma started with a small team of fewer than 10 people and has become a new unicorn in the PPT space, achieving profitability within two years [5][6] - The founders identified a gap in the market where existing tools were not meeting user needs effectively, leading to the development of a more intuitive and user-friendly platform [8][10] - The company has a user base of 70 million and annual revenue exceeding $100 million, indicating strong market demand and product-market fit [16] Group 2: Product Development and AI Integration - The initial version of Gamma's AI product focused on helping users generate draft content and find suitable images, which significantly improved user engagement [14][15] - The company emphasizes a "human in the loop" approach, balancing AI capabilities with user control to enhance the creative process [16][25] - AI is used to solve common design problems, allowing users to generate multiple design options quickly, which would take much longer manually [19][20] Group 3: Growth Strategy - From the outset, Gamma prioritized growth, embedding it into the company's DNA to ensure long-term success [28][29] - The company has leveraged influencer marketing effectively, with over 50% of new users coming from word-of-mouth referrals [36][37] - Gamma's brand has evolved to become synonymous with AI presentations, aiming to establish itself as a standard in the industry [29][33] Group 4: Team and Culture - The company maintains a small, efficient team, emphasizing careful hiring to ensure alignment with its core values and principles [38][39] - The founders believe in a slow hiring process to build a strong foundational team that can adapt quickly to changes in strategy [39][40] - A high proportion of designers within the team contributes to creating a superior user experience, which is crucial for product success [41][42]
创业一年后,李飞飞推出首款可商用世界模型 Marble,任意模态都可生成 3D 世界
Founder Park· 2025-11-13 14:06
Core Insights - World Labs has officially launched its first commercial generative multimodal world model product, Marble, which supports a wider range of input modalities compared to its earlier preview version [2][4] - The concept of "spatial intelligence" introduced by Fei-Fei Li is highlighted as the next frontier in AI, emphasizing its importance in transforming how humans create and interact with both real and virtual worlds [25][29] Product Features - Marble allows users to generate a complete 3D world from various inputs such as images, text, or videos, enabling detailed and rich 3D environments [5][10] - The platform offers both a free version and a Pro version tailored for professionals in fields like game development, film effects, architecture, and robotics [8] - Key capabilities of Marble include: - Multimodal 3D world generation based on user inputs [10] - Support for multiple image inputs to create cohesive 3D spaces [13] - Advanced editing tools for fine-tuning generated worlds, including local adjustments and global style changes [18][20] - An experimental tool called Chisel for advanced users to manipulate the spatial layout of the generated worlds [21] - Options to expand and combine worlds for larger, more complex environments [22][26] - Export capabilities in various formats for use in professional software and platforms [23][24] Importance of Spatial Intelligence - Spatial intelligence is deemed crucial for AI's evolution, as it will reshape various fields such as storytelling, creativity, robotics, and scientific discovery [29][40] - Current AI models, while strong in language processing, lack a robust understanding of the physical world, which limits their practical applications [30][38] - The development of spatial intelligence can lower the barriers for creating 3D environments, enabling non-professionals to build and experience virtual worlds [41] - It is also essential for advancing embodied intelligence in robotics, allowing machines to interact safely and effectively with the physical world [41] - The potential applications of spatial intelligence extend to scientific research, healthcare, and education, enabling simulations and explorations beyond human perceptual capabilities [42][43]
更会聊天、主打情绪价值,OpenAI 发布 GPT-5.1
Founder Park· 2025-11-13 02:35
Core Insights - The article discusses the sudden update of ChatGPT to version GPT-5.1, emphasizing its enhanced intelligence and conversational abilities [2][5]. - The new model is designed to provide faster responses for simple queries and smarter solutions for complex problems [5][24]. Model Features - GPT-5.1 Instant focuses on everyday conversations and quick responses, while GPT-5.1 Thinking is tailored for complex reasoning and in-depth inquiries [5][23]. - The upgrade includes improved adherence to user instructions, showcasing a significant difference in response style and tone compared to the previous version [20][21]. User Experience - Early tests indicate that GPT-5.1 Instant offers a more friendly and engaging interaction, often surprising users with its light-hearted responses [11][13]. - The model's ability to follow specific instructions has been notably enhanced, with GPT-5.1 Instant successfully adhering to constraints like responding in a limited number of words [21]. Technical Advancements - The introduction of adaptive reasoning technology allows GPT-5.1 Instant to determine when to think critically before responding, balancing speed and accuracy [21][22]. - GPT-5.1 Thinking is reported to be twice as fast as its predecessor in typical tasks, while also providing clearer explanations with less jargon [25][28]. Personalization Features - OpenAI has made it easier for users to customize the tone and style of ChatGPT, offering eight predefined personality traits and the ability to adjust various response characteristics [32][33]. - The model can also proactively ask users about their preferred tone during conversations, enhancing the personalization experience [34]. User Feedback - Initial user experiences highlight the model's personality, with humorous and relatable responses to absurd queries, showcasing its ability to engage in a more human-like manner [36][38][42].
段永平少有的深度访谈:买股票就是买公司,真懂这句话的人,可能不到 1%
Founder Park· 2025-11-12 11:51
Core Insights - The core idea of the article revolves around the investment philosophy of Duan Yongping, emphasizing that "buying stocks means buying companies" and the importance of truly understanding the business behind the stock [5][19]. Group 1: Investment Philosophy - Duan Yongping believes that understanding a company is crucial for successful investing, and that most companies are difficult to comprehend [19][31]. - He emphasizes that many investors can make money without fully understanding the companies they invest in, highlighting the complexity of investment [9][31]. - The concept of "safety margin" is redefined by Duan as the depth of understanding one has about a company, rather than just its price [12][34]. Group 2: Company Analysis - Duan's investment in NetEase was based on his background in gaming and his belief in the company's potential, which led to a significant return on investment [32][34]. - He expresses confidence in Apple due to its strong corporate culture and user-oriented approach, which he believes will sustain its success [39][46]. - Duan acknowledges the challenges in understanding companies like Tesla and Nvidia, but recognizes their innovative potential and market position [61][56]. Group 3: Market Behavior and Company Culture - Duan discusses the importance of company culture, stating that a good culture can help a company correct its mistakes and stay on the right path [15][43]. - He believes that maintaining rationality in investment decisions is challenging, especially when market dynamics are volatile [13][33]. - The article highlights that Duan's investment strategy is cautious, focusing on companies with strong fundamentals and avoiding speculative investments [70][72]. Group 4: Specific Company Insights - Duan has a significant position in Moutai, viewing it as a unique brand with a loyal consumer base, and believes its corporate culture will help maintain its quality [72][74]. - He expresses skepticism about the long-term viability of electric vehicle companies, suggesting that the market may become oversaturated [63][64]. - Duan's investment in Pinduoduo reflects a cautious optimism, acknowledging the risks while recognizing the company's potential for growth [70][71].