Workflow
Nano banana
icon
Search documents
我用AI生成流浪汉骗我爸,结果他摇来了特警
3 6 Ke· 2025-11-27 06:18
在恶作剧之前考虑一下后果 流浪汉进家门 "爸,门口有个流浪汉,说认识你。" Joe给正在上班的父亲发去一张图片,是一个胡子拉碴的陌生男人站在门口,父亲称并不认识这个人,"他想做什么?" "他说你们俩曾经一起上学,我请他进来了。"之后,Joe又陆续给父亲发去了这个貌似流浪汉的陌生人在翻家里的冰箱、用父亲的牙刷刷牙、甚至在父亲 床上睡觉的图片…… 父亲的电话就像夺命连环call,不停打来,但Joe并不打算接。那头的父亲显然已经焦急万分,"快接我电话!我不认识他!" 在这段将短信往来做成录屏的视频中显示,短短3分钟内,Joe的父亲已经给他打了21个电话,甚至要报警。 之后Joe把这段记录发到TikTok,视频迅速爆火,目前已经接近92万播放量。 ●TikTok用户mmmjoemele的视频 从这个10月开始,一股"AI流浪汉进家门"的恶作剧风潮在TikTok等社交媒体上迅速蔓延。 这股趋势随后蔓延到其他社交平台,如Snapchat、Instagram等,甚至国内的抖音小红书等中文社区,也开始出现类似的搬运视频与模仿拍摄。AI恶作剧正 从海外走向本地。 "harmless prank" 最初网友把这当成一个harml ...
我们用21款AI修图工具修了100张图:谁才是真正的“修图神器”?|Jinqiu Scan
锦秋集· 2025-11-10 11:38
Core Viewpoint - The article focuses on evaluating 21 AI image editing tools across six real-life scenarios to determine their effectiveness in understanding and executing user requests for image modifications [4][11][141]. Group 1: Evaluation Methodology - The evaluation consists of six rounds, each using the same prompt for image editing, with all models set to their latest default configurations [11][12]. - Three general evaluation dimensions are used: visual consistency, local quality, and content consistency [12][13][14]. Group 2: Performance Results - Top performers include Tencent Yuanbao, Meitu Xiu Xiu, and Qwen Image Edit, scoring 15 points for effectively meeting user prompts without noticeable discrepancies [23]. - Nano Banana, Sora, Lovart, Manus, and Runway scored 14 points, with minor issues in image retrieval capabilities [28]. - Tools like Jiemeng 4.0, Wake Map, and Pixel Cake scored around 10 points, showing significant errors despite being dedicated image editing software [30]. Group 3: Specific Findings - In the first round, Tencent Yuanbao and Meitu Xiu Xiu excelled in removing unwanted elements while enhancing image clarity [23]. - The second round highlighted Qwen Image Edit and Genspark as top performers in foreground extraction, maintaining original details [41]. - The third round saw Jiemeng 4.0 and Tencent Yuanbao achieving high scores for effectively replacing elements while preserving the original image's integrity [65]. Group 4: Future Directions - The article indicates plans for future evaluations of AI tools in areas such as game development, knowledge bases, and companionship products [7].
Nano-Banana核心团队首次揭秘,全球最火的AI生图工具是怎么打造的
创业邦· 2025-09-03 10:10
Core Insights - The article discusses the advancements of the "Nano Banana" model, highlighting its significant improvements in image generation and editing capabilities, which include faster generation speeds and better understanding of complex instructions [5][6][9]. Group 1: Model Capabilities - Nano Banana has achieved a substantial quality leap in image generation and editing, with faster speeds and the ability to understand vague and conversational instructions while maintaining consistency in multi-step edits [5][6]. - The model's key enhancement lies in its "native multimodal" capabilities, particularly "interleaved generation," allowing it to process complex instructions step-by-step and maintain context [5][29]. - For high-quality text-to-image generation, the Imagen model remains the preferred choice, while Nano Banana is better suited for multi-round editing and creative exploration [5][37]. Group 2: Future Goals - The future objective of Nano Banana is not only to enhance visual quality but also to pursue "intelligence" and "fact accuracy," aiming to create a model that understands user intent deeply and generates creative outputs beyond user prompts [6][50][53]. - The team envisions a model that can accurately generate charts and other work-related content, emphasizing the importance of both aesthetic appeal and functional accuracy [53][57]. Group 3: User Interaction and Feedback - User feedback has been instrumental in shaping the model's development, with the team continuously collecting data on common failure modes to improve future iterations [42][44]. - The model's ability to maintain character consistency across multiple images has improved, allowing for more complex scene reconstructions and edits [45][48]. Group 4: Comparison with Other Models - While Imagen excels in generating high-quality images from text prompts, Nano Banana is positioned as a more versatile creative partner capable of handling complex workflows and understanding broader contextual cues [37][39]. - The integration of insights from different teams has led to significant improvements in the model's natural aesthetics and overall performance [46][48].
Nano Banana官方提示词来了,附完整代码示例
量子位· 2025-09-03 05:49
Core Viewpoint - The article discusses the rising popularity of the Nano-banana tool, highlighting its innovative features and the official guidelines released by Google to help users effectively utilize this technology [1][8]. Group 1: Features of Nano-banana - Nano-banana allows users to generate high-quality images from text descriptions, edit existing images with text prompts, and create new scenes using multiple images [15]. - The tool supports iterative refinement, enabling users to gradually adjust images until they achieve the desired outcome [15]. - It can accurately render text in images, making it suitable for logos, charts, and posters [15]. Group 2: Guidelines for Effective Use - Google emphasizes the importance of providing detailed scene descriptions rather than just listing keywords to generate better and more coherent images [9][10]. - Users are encouraged to think like photographers by considering camera angles, lighting, and fine details to achieve realistic images [19][20]. - The article provides specific prompt structures for various types of images, including photorealistic shots, stylized illustrations, product photography, and comic panels [20][24][35][43]. Group 3: Examples and Applications - The article showcases examples of images generated by Nano-banana, such as a cat dining in a luxurious restaurant under a starry sky, demonstrating the tool's capability to create detailed and imaginative scenes [14][17]. - It also includes code snippets for developers to integrate the image generation capabilities into their applications, highlighting the accessibility of the technology [21][29][35].
Nano-Banana核心团队首次揭秘,全球最火的 AI 生图工具是怎么打造的
3 6 Ke· 2025-09-02 01:29
Core Insights - The article discusses the advancements and features of the "Nano Banana" model developed by Google, highlighting its capabilities in image generation and editing, as well as its integration of various technologies from Google's teams [3][6][36]. Group 1: Model Features and Improvements - Nano Banana has achieved a significant leap in image generation and editing quality, with faster generation speeds and improved understanding of vague and conversational prompts [6][10]. - The model's "interleaved generation" capability allows it to process complex instructions step-by-step, maintaining consistency in characters and scenes across multiple edits [6][35]. - The integration of text rendering improvements enhances the model's ability to generate structured images, as it learns better from images with clear textual elements [6][13][18]. Group 2: Comparison with Other Models - For high-quality text-to-image generation, Google's Imagen model remains the preferred choice, while Nano Banana is better suited for multi-round editing and creative exploration [6][36][39]. - The article emphasizes that Nano Banana serves as a multi-modal creative partner, capable of understanding user intent and generating creative outputs beyond simple prompts [39][40]. Group 3: Future Developments - Future goals for Nano Banana include enhancing its intelligence and factual accuracy, aiming to create a model that can understand deeper user intentions and generate more creative outputs [7][51][54]. - The team is focused on improving the model's ability to generate accurate visual content for practical applications, such as creating charts and infographics [57].