Workflow
歸藏的AI工具箱
icon
Search documents
四大顶尖模型对决!6000 字测评带你看Deepseek R1有多强
歸藏的AI工具箱· 2025-05-29 14:54
Core Viewpoint - Deepseek-R1 0528 demonstrates strong performance in front-end development tasks, comparable to OpenAI's Opus 4 and surpassing Sonnet 4 and Gemini 2.5 Pro, especially considering the price difference [3][4][51]. Group 1: Model Performance Comparison - In front-end capabilities, Deepseek-R1 0528 slightly lags behind Opus 4 but outperforms Sonnet 4 and Gemini 2.5 Pro [3]. - Deepseek-R1 0528 successfully completed complex tasks that Opus 4 struggled with, although the quality and completion rate were slightly lower [3][4]. - The price of Deepseek-R1 0528 is significantly lower than Opus 4, making its performance even more impressive [4][51]. Group 2: Testing Results - In the warehouse management system test, Deepseek-R1 0528 produced a professional interface with complete functionality, while other models failed to deliver usable outputs [11]. - For the dot animation editor, Deepseek-R1 0528 excelled, providing a fully functional interface, while other models either failed to animate or had significant issues [17]. - In the gradient color extraction tool test, Deepseek-R1 0528 showcased excellent aesthetic design but failed to implement the color extraction logic, while Opus 4 and Sonnet 4 managed to complete the functionality albeit with simpler designs [20][21]. Group 3: Overall Implications - The advancements in Deepseek-R1 0528 suggest a shift in the AI programming model landscape, where high-quality outputs can be achieved at a fraction of the cost compared to leading competitors [51]. - The performance of Deepseek-R1 0528 indicates a potential democratization of access to advanced AI tools, allowing more users to leverage powerful models without prohibitive costs [51].
搜攻略到凌晨3点?飞猪AI“问一问”用1张表谋杀废话
歸藏的AI工具箱· 2025-05-29 06:10
之前测试各类Agent产品的时候老是会用一些旅游方案生成的提示词去测试,但是基本上都是输出一些废话。 关于景点的详细信息,打卡点,最重要的机票、酒店路程消耗时间等详细信息基本没有。 听说飞猪上了一个旅行 Agent "问一问",于是找朋友要了一个邀请码试了一下,确实厉害。 现在 Agent 产品最大的壁垒确实还是独家的上下文, 这是唯一一次 AI 给我生成的真正能用的旅行规划 。 刚好六月想跟朋友去一趟雨崩村,顺便去看看梅里雪山,但是我又不太想走比较困难的徒步游玩路线。 就让飞猪帮我规划一下丽江-梅里雪山-雨崩村的详细行程。 你可以在首页左上角找到问一问的入口,目前还需要邀请码,如果是飞猪 F5 和 F6 的会员可以直接使用。 提问之后模型会进行深度思考理解用户需求和进行任务拆分。 如果你的提问不是很细的话,比如只是规划行程,他会先生成几个简略的方案给你选择,列出每一天要去的地 方以及当前行程预计的花费。 在规划行程的时候也是有思考的,会根据难度、花销程度给出不同的选择。 当然不像其他的 Agent 只有文字, 飞猪还给出了对应的地图,你可以非常清晰直观的看到每个方案地点的距 离以及线路,还有对应景点的图片, ...
文旅新玩法!藏师傅教你做食物微缩景观宣传海报&视频
歸藏的AI工具箱· 2025-05-28 08:06
Core Viewpoint - The article discusses the creative use of AI tools like GPT-4o and Veo3 to generate visually appealing food-themed images and miniature scenes, highlighting their potential for tourism promotion and artistic expression [1][4][9]. Group 1: Image Generation Ideas - The article presents a concept for a surreal keyboard where each key is represented by a miniature dessert, emphasizing vibrant colors and realistic textures [2][5]. - A new idea combines food and cityscapes, suggesting the creation of miniature scenes made from representative foods of different cities, which could serve as promotional material [4][6]. - The use of Veo3 for creating time-lapse animations of culinary scenes is explored, showcasing the gradual assembly of ingredients into a complete miniature landscape [6][7]. Group 2: Specific Scene Descriptions - A detailed description of a "Chengdu" themed scene is provided, featuring a hot pot and playful panda elements, with ingredients creatively arranged to form landscapes and rivers [5][8]. - The scene captures the essence of Chengdu's culinary culture, with a playful and vibrant atmosphere, making it suitable for tourism marketing [5][8]. Group 3: Tools and Techniques - The article mentions the use of Veo3 and Gemini Pro membership for enhanced video creation capabilities, encouraging users to experiment with these tools [9]. - It highlights the potential of using Flow's capabilities for creating seamless video transitions, although it notes the higher costs associated with this option [6][9].
终于不用羡慕老外了!美团竟然做出了类似V0&Bolt的AI编程神器
歸藏的AI工具箱· 2025-05-27 07:24
Core Viewpoint - The article highlights the launch of Meituan's NoCode tool, which fills a gap in the domestic market for zero-code application development, showcasing its capabilities and ease of use [1][33]. Product Capabilities - The NoCode tool can generate complex multi-page web applications without extensive coding, allowing users to create functional products with minimal input [1][3]. - It supports dynamic web page generation based on user prompts, automatic bug detection and resolution, and optimization of user requests [3][14]. - The tool includes features for database management, enabling users to store and manage large amounts of information [32]. Design and User Interface - The design employs a Bento Grid style with specific color schemes and emphasizes large fonts and visual elements to highlight key points [4][5]. - It supports responsive design, ensuring compatibility across desktop and mobile devices [13]. - The interface includes interactive elements such as drag-and-drop functionality for managing items and real-time updates [28]. Technical Requirements - The tool utilizes modern web technologies including HTML5, TailwindCSS, and JavaScript, along with professional icon libraries [7][13]. - It incorporates features for data persistence using localStorage and supports integration with external APIs, such as Google Maps [27][28]. Additional Features - The NoCode tool includes a Dev Mode for users who wish to edit code directly, providing a coding environment alongside AI assistance [31][32]. - It offers a Database feature that allows users to store data in the cloud, facilitating access across different devices [32]. Market Impact - The launch of Meituan's NoCode tool is seen as a significant development for domestic users who have been seeking effective zero-code solutions, potentially fostering a better coding environment in China [33][34].
V0做不到、Bolt搞不定,Youware用MCP一键解决网页生成最大难题
歸藏的AI工具箱· 2025-05-26 03:02
Core Viewpoint - Youware has significantly enhanced its capabilities by integrating MCP (Multi-Channel Processing) services, allowing users to easily create and deploy AI-generated web pages with minimal technical skills [2][3][12]. Group 1: Product Features - Youware now supports MCP, which addresses the challenge of sourcing materials for web page creation [3][4]. - The platform has introduced a points system where users can earn money based on their website's traffic [21]. - The homepage has been updated to include more detailed categorization of works, making it easier for users to find community websites they like [8][22]. Group 2: MCP Integration - Youware's deep integration with mainstream MCP services has lowered the entry barrier for users, resulting in high-quality outputs [8]. - The platform allows users to pull content from various MCPs like Figma, Unsplash, Hugeicons, and Google Maps with minimal setup [14][15]. - Users can generate complex web layouts and responsive designs by simply providing links to Figma design files [6][7][16]. Group 3: User Experience - The platform's ability to generate web pages from complex design layouts has been praised, with results often exceeding expectations [10][18]. - Youware includes a "Boost" feature that enhances the visual appeal of generated pages while maintaining the original layout and content [17][18]. - The overall user experience is streamlined, allowing for quick adjustments and high-quality outputs without extensive configuration [19]. Group 4: Community Engagement - Youware is actively promoting community engagement by showcasing outstanding works and encouraging users to share their creations [20][22]. - The platform is hosting a retro-style website generation challenge with a prize of up to $1,000, incentivizing participation and social media sharing [22].
AI编码新神登基,藏师傅一手Claude 4实测
歸藏的AI工具箱· 2025-05-22 18:00
Claude 4 就这么低调的发布了,之前他们 CEO 说27年所有的代码都会由AI生成,现在看来应该就是看到了 Claude 4的潜力。 根据 Anthropic 所说 Claude Opus 4 是全球最佳编码模型,在复杂、长期运行的任务和代理工作流中表现持 续优异。 基础介绍 还有一些其他的发布内容,包括: 最重要的定价: Claude Sonnet 4 会向免费用户开放,这太好了。 API上定价与之前的 Opus 和 Sonnet 模型保持一致:Opus 4 每百万 token 输入/输出价格为 15/75 美元, Sonnet 4 为 3/15 美元。 模型能力 Claude Opus 4 的编码能力在 SWE-bench(72.5%)和 Terminal-bench(43.2%)上大幅领先其他模型, 而且它在需要集中精力和数千步操作的长时间任务中表现出持续稳定的性能,能够连续工作数小时,这个对于 Agent产品非常重要。 扩展思维与工具使用(测试版):两款模型在扩展思维过程中均可使用工具。 新模型能力:两款模型均可并行使用工具,更精准地遵循指令,并且在开发者授予本地文件访问权限时, 展现出显著增强 ...
我用这个产品做了小米5.22发布会官网,同事:这不是官方做的?
歸藏的AI工具箱· 2025-05-22 09:24
前几天受邀参加了天工超级智能体(Skywork Super Agents)的提前测试。 试了一下我发现,相较于各种大包大揽的所谓通用智能体,天工非常的务实,专注于帮助打工人优化我们每天 接触最多也是最繁琐的三个交付物,也就是所谓的 Office 三件套,文档、表格、PPT。 天工超级智能体 不是简单的生成一个交付物就结束了,而是考虑到了整个内容的生命周期 ,从意图判断到内 容检索到高品质生成到编辑和修改都做了非常多的优化,最大限度的保证内容的可用性。 先介绍一下天工超级智能体的主要能力: 网页生成 我发现他们有网页生成模式,那是时候掏出藏师傅的老测试项目了。 今晚不是有小米发布会吗,我想了一个很好的测试方式, 直接让他给小米做一个发布会预热网页 。 这个除了考验对藏师傅网页生成提示词的还原以外,也非常考验对于最新信息的检索能力,因为很多都是预测 信息,而且都是最近几天发布的,我们很容易就能看到检索的质量。 我也根据小米的设计风格改了一下网页生成提示词,大家有类似场景可以直接用。 这里可以看案例回放: https://www.skywork.ai/share/project/192542753810075238 ...
CEO的智囊团,实习生的救命稻草:这个飞书功能如何让所有人都变高效
歸藏的AI工具箱· 2025-05-21 07:18
Core Viewpoint - Feishu's Knowledge Q&A feature significantly enhances workplace efficiency by providing tailored AI responses based on organizational data and internet knowledge, proving to be a valuable tool for employees at all levels [1][2][22]. Group 1: Product Overview - Feishu Knowledge Q&A is a proprietary AI tool designed for enterprises, allowing users to ask questions and receive answers based on accessible organizational data, documents, and internet knowledge [2]. - The tool aids in content creation and enhances business understanding, making it versatile for various tasks [3]. Group 2: Practical Applications - The feature allows users to quickly gather information about ongoing projects, reducing the time spent sifting through numerous documents [4]. - Users can perform targeted inquiries to understand specific aspects of their responsibilities, such as categorizing guest speakers and their topics for events [5]. - It can retrieve not only text but also relevant images, aiding in event preparation [7]. - The AI can provide comprehensive suggestions for event planning, covering aspects from venue selection to promotional strategies [9]. - It can generate progress report documents based on user queries, significantly reducing the time required for such tasks [12]. - The tool is particularly beneficial for middle and upper management, enabling them to access real-time data and updates without waiting for subordinate reports [17]. Group 3: Personal Development Support - For individual users, Feishu Knowledge Q&A serves as a powerful AI knowledge base, helping to organize and optimize personal content and writing tasks [18][19]. - The tool can efficiently retrieve and analyze existing documents, providing structured insights and suggestions for improvement [19]. - It allows users to search specific knowledge bases for relevant information, streamlining the process of finding and organizing content [21]. Group 4: Competitive Advantage - The effectiveness of Feishu Knowledge Q&A lies in its ability to leverage contextual information from organizational documents, which enhances the AI's understanding and response accuracy [22]. - The integration of rich organizational context is seen as a key differentiator compared to other AI products, making it a cost-effective solution for enterprise AI implementation [22].
Veo3和FLOW一手实测:谷歌这次成了,这次视频创作可能彻底变天
歸藏的AI工具箱· 2025-05-21 07:18
Core Viewpoint - Google's new video model Veo3 and AI video creation product FLOW represent a significant advancement in video generation technology, enhancing usability and application scenarios for video editing and digital content creation [1][29]. Group 1: Features of Veo3 and FLOW - Veo3 can generate videos with corresponding ambient sounds and synchronized speech, greatly improving the usability for video editing software and digital avatars [2][29]. - FLOW allows for the generation of both images and videos, supports video extension and trimming, and enables users to compile selected clips into a complete video [2][15]. Group 2: Testing and Applications - Testing of Veo3 demonstrated accurate lip-syncing and sound effects, even with complex animations, showcasing its potential for various applications [4][6]. - The model can generate diverse scenes, such as a character explaining gravity under an apple tree, indicating its capability for educational content [7]. - Veo3 can also create ASMR videos by generating realistic environmental sounds, expanding its application in content creation [8][9]. Group 3: FLOW Usage Tutorial - FLOW provides a user-friendly interface for creating projects, where users can input prompts to generate videos [15][16]. - The platform supports three main video generation methods: text-to-video, image-to-video, and material-to-video, although it currently does not allow for external image uploads [20]. - Users can edit and arrange scenes, with the ability to download videos in high definition, although sound may require specific steps to be included [21][26]. Group 4: Conclusion and Future Implications - The integration of sound generation, speech synthesis, and lip-syncing in Veo3 marks a significant upgrade in video modeling, similar to the advancements seen with the release of the 4o image model [29]. - The potential for new applications and products in various industries is vast, as demonstrated by the capabilities of Veo3 and FLOW [29].
这宣传图也太上流了!藏师傅教大家如何用4o搭配提示词生成
歸藏的AI工具箱· 2025-05-19 08:58
今天橘子的新产品可以一分钟将任何内容变成播客的 ListenHub发布了,照例想用提示词为他做一张长图。 刚好这几天 Airbnb 的新拟物风格图标特别火,我就想能不能把拟物图标融合到长图网页里面去。 搞了一下结果真没问题,效果意外的非常好,整个图片的表现力高了非常多。 所以这篇内容教大家 如何用 4o 生成拟物图标搭配藏师傅网页提示词制作上流宣传图。 生成图标 首先我们需要生成对应的图标,这里模仿的是 Airbnb 的风格。 我们需要根据文章内容生成跟产品宣传内容搭配的图标这时候可以将 整篇文章 搭配下面的提示词都扔给 GP T,让他帮你分析出每部分用什么图标表示。 下面这是一篇产品介绍文章,如果我想要为他生成一个宣传图,上面主要介绍功能,我需要在卡片上生 成一些图标,帮我分析一下我应该生成哪些图标: 然后把 GPT 给出的图标对应物品填写到下面的提示词里面就行,右边就是我为ListenHub生成的九个图标。 然后将 GPT 推出来的图标词语放到提示词的[ ]里面,都是搭配左边第一张图片垫图使用。 几天我推了一个提示词出来,然后海外的一个设计师( x.com/hemeon/status/1923060589 ...