AI生图
Search documents
中文版Nano Banana来了?Qwen-Image-2.0炸场:1K长文本硬吃,中文生图彻底不拧巴了
量子位· 2026-02-10 11:59
梦瑶 发自 凹非寺 量子位 | 公众号 QbitAI 文本一长就糊、指令一杂就撂挑子、遇到中文更是一整个变形freestyle…… 「AI生图」的这点苦,到底有谁懂啊!!! 停,不用拧巴了,因为现在的AI,已经能稳稳吃下 1K token 的超长文字指令了: 复杂指令 也不在怕的,最近OpenClaw贼火,我索性让AI直接帮roll出一个赛博信息图海报(你就说牛不牛吧): 中文渲染 表现也不孬,《兰亭集序》这种公认的高难度文本,这AI居然能做到文字1:1还原,排版、笔锋都在线: 你以为到这儿就结束了,NONONO!因为它还能—— 多图编辑 。 随手丢给了它一张照片,人家直接给我甩出一组影棚级的9宫格写真!!(诶,突然感觉怒省一笔钱… 刚才帮我干活的这位,正是阿里刚刚发布的新一代图像生成及编辑模型—— Qwen-Image-2.0 。 1K token长文本、复杂指令、中文渲染、图片编辑、2K分辨率一次 性梭哈,连国际评测里的表现都已经冲到了仅次于Nano Banana Pro的 位置。 在AI生图界,最让人崩溃的倒不是写Prompt词,而是写了太多,AI根本不吃消,好的提示词真无!处!施!展! 不知道千问团队 ...
腾讯宣布春节红包新玩法 元宝派将启动红包掉落活动
Huan Qiu Wang· 2026-02-07 08:09
【环球网科技综合报道】2月6日,腾讯宣布"元宝派"将于近期启动春节红包新玩法,用户在派内与元宝互动将有机会获得现金红包。 自2月1日元宝APP春节主会场启动以来,元宝AI生图功能使用率飙升30倍,新用户平均每天跟元宝的互动问答超过8轮,用户单日使用元宝时长增长超 80%。 不久前,元宝宣布正式接入混元图生图3.0模型,用户通过元宝APP"创作"入口,还可体验超400套新春创作模版。(勃潺) ...
电商人实测:真正能批量出产品效果图的AI软件,到底解决了我哪些工作难题?
Sou Hu Cai Jing· 2026-02-07 00:59
一、每天在电商后台,我最怕的不是数据,而是"图不够用" 如果你做过电商运营,应该能理解我这句话。 每天一打开后台,迎面而来的不是 GMV,而是一连串看起来很"细碎"、却极度消耗精力的事情: 以前我们靠的是三件套: 摄影棚 + 设计师 + 不断返工 但现实是: 图好看,但不"像商品图" 很多工具生成的图,更像是"概念海报": 对电商来说,这是致命的。 商品图不是艺术创作,而是转化工具。 久而久之,我开始意识到一个问题: 在电商这种高频试错、高频更新的环境里,"慢"和"贵",本身就是最大的风险。 也正是从这个阶段开始,我开始系统性地研究一件事—— 有没有真正"能批量出产品效果图的AI软件",不是玩票,而是能落地到电商工作流里的那种。 二、市面上的AI生图工具很多,但"电商可用"和"好看"是两回事 一开始,我也像大多数人一样,随便试了不少 AI 生图工具。 说实话,第一眼的感觉都挺惊艳: 但真正拿去用的时候,问题很快就暴露出来了: 无法批量,效率反而更低 有些工具单张图效果不错,但: 当你要 10 张、20 张同风格的产品效果图时,效率甚至不如人工。 场景不懂电商,沟通成本极高 很多 AI 工具本身并不是为电商设计 ...
火爆全网的AI片场探班玩法,手把手教会你。
数字生命卡兹克· 2025-12-25 01:20
Core Viewpoint - The article discusses the evolution of AI video technology, particularly focusing on the use of AI tools to create personalized video experiences with characters from popular media, highlighting the ease of use and creative potential of these tools [1][35]. Group 1: AI Tools and Techniques - The process of creating AI-generated images and videos involves three main steps: generating images using prompts, creating videos from key frames, and editing the final product with software [4]. - The author emphasizes the simplicity of the process, suggesting that users do not need to purchase prompts but can utilize AI tools like Gemini to generate effective prompts based on their needs [16][21]. - The article mentions the challenges faced when using different AI models, particularly the inconsistency in generating images of Asian faces with the Nano Banana Pro model, leading to a switch to another model, Seedream 4.5, which performed better for this demographic [11][13]. Group 2: Creative Applications - The author shares various creative applications of the AI tools, including generating images and videos featuring characters from popular franchises like "Stranger Things" and "Avatar," as well as nostalgic shows like "Wu Lin Wai Zhuan" [28][34]. - The article highlights the ability to create engaging content by combining AI-generated visuals with editing software, allowing for the addition of effects and sound to enhance the final product [30]. - The narrative reflects on the emotional connection to childhood memories and the excitement of interacting with beloved characters through AI technology, showcasing the potential for personal storytelling [35].
你还在晒AI图,有人已经在靠“提示词”收款了
3 6 Ke· 2025-11-27 09:40
Core Insights - The article discusses the rise of the AI image generation tool "Jimeng 4.0," which allows users to create realistic images, including virtual selfies with celebrities, leading to a new trend in social media sharing [8][12][17] Group 1: Technology and Features - Jimeng 4.0 has significantly improved over its predecessor, Jimeng 3.x, by enhancing the realism of generated images, particularly in capturing lifelike expressions and eye details [8][11] - The tool utilizes a multi-modal architecture that allows for quick and accurate image generation, maintaining consistency in character features across different angles [11][12] - The output quality is high, with 4K resolution that captures intricate details such as fabric texture and skin quality, making the images appear more authentic [11][12] Group 2: Market Trends and User Behavior - The popularity of Jimeng 4.0 has led to a surge in social media posts featuring AI-generated images, similar to the earlier trend seen with "Nano Banana" [7][12] - The accessibility of Jimeng 4.0 through platforms like Doubao, which has a monthly active user base of 157 million, has lowered the barrier for users to engage with AI-generated content [12] - The phenomenon of trading prompt templates for image generation has emerged as a small business, with users willing to pay for effective prompts that yield high-quality results [16][17] Group 3: Cultural Implications - The article suggests that we are entering an era where the line between reality and digital creation is increasingly blurred, allowing individuals to visualize alternate life scenarios through AI-generated images [17][18] - The ability to create images that reflect personal aspirations or past choices signifies a shift in how people perceive and interact with their identities in the digital space [17][18] - Jimeng 4.0 is positioned not just as a tool for image creation but as a "life generator," enabling users to explore various facets of their lives through visual representation [17][18]
开源模型叫板Nano Banana Pro!Stable Diffusion原班人马杀回来了
量子位· 2025-11-26 09:33
Core Insights - The article discusses the launch of Flux.2, a new AI image generation model from Black Forest Lab, which aims to compete with Google's Nano Banana Pro by offering similar image quality at a lower cost [1][42]. Group 1: Product Features - Flux.2 is designed to be a productivity tool, enhancing the capabilities of users in generating images [2]. - The model supports multiple reference images, allowing for complex image generation tasks, such as creating fashion editorial images with consistent characters [3]. - Flux.2 offers various versions, including Flux.2 [pro], [flex], [dev], and an upcoming [klein], each tailored for different user needs and performance requirements [16][17]. Group 2: Performance Comparison - Initial tests show that Flux.2's image generation speed is under 10 seconds for the [pro] version, with the ability to handle up to 10 reference images [17]. - While Flux.2 demonstrates significant improvements in instruction adherence and fine control, it still lags behind Nano Banana Pro in overall image quality [39][40]. - Users have reported that Flux.2 performs well in tasks like photo restoration and image editing, often producing results that are more natural compared to Nano Banana 2 [46][48]. Group 3: Market Positioning - Flux.2 is positioned as a cost-effective alternative to Google's models, providing high-quality outputs at a lower price point, which is appealing for users who typically face high costs with Nano Banana Pro [42]. - The model supports high-resolution image editing up to 4MP, catering to users looking for detailed outputs [44]. - The article highlights the historical context of Flux models, noting that Flux.1 was a benchmark in the AI image generation space before the introduction of Flux.2 [56][59].
太炸裂了!全网实测Nano Banana Pro,网友:这模型里到底装了什么鬼东西!
量子位· 2025-11-21 06:29
Core Insights - Google has launched the Nano Banana Pro, a powerful image generation model that has garnered significant attention and excitement across the internet [11][10]. - The model integrates multi-modal understanding capabilities from Gemini 3 Pro and Google's extensive knowledge base, allowing it to comprehend real-world semantics and physical logic [12]. Features and Capabilities - Users can access the Nano Banana Pro for free through the Gemini application, although there are usage limits for free accounts, while subscribers to Google AI Plus, Pro, and Ultra enjoy higher quotas [13]. - The model supports high-resolution outputs, including 2K and 4K, and can generate complex professional charts, enhancing its utility for various applications [15][46]. - It has improved text rendering capabilities, allowing for multi-language support and direct translation of text within images [15]. User Experience and Performance - Initial tests demonstrated the model's ability to create detailed and aesthetically pleasing visual outputs, such as exploded views of bicycle components and scenes with dolls [14][20]. - The model's performance is influenced by the specificity of user prompts, with clearer instructions leading to better results [23]. - Users have reported a surge in creative applications of the Nano Banana Pro, showcasing its versatility in generating illustrations, infographics, and even comic strips [28][34][42]. Industry Impact - The launch of Nano Banana Pro is seen as a significant advancement in AI-generated imagery, pushing the boundaries of what is possible in this field [26]. - Google CEO Sundar Pichai has endorsed the model, highlighting its advanced image generation and editing capabilities, which are designed to meet the needs of professionals in various industries [46].
AI技术滥用调查:“擦边”内容成流量密码,平台能拦却不拦?
Hu Xiu· 2025-10-12 10:08
Group 1 - The article highlights the misuse of AI technology, particularly in creating inappropriate content, leading to significant concerns for both ordinary individuals and public figures [1][6][10] - A surge in AI-generated content, such as "AI dressing" and "AI borderline" images, has become prevalent on social media platforms, attracting large audiences and followers [2][10][11] - The Central Cyberspace Affairs Commission has initiated actions to address the misuse of AI technology, focusing on seven key issues, including the production of pornographic content and impersonation [4][5] Group 2 - Ordinary individuals and public figures alike are victims of AI misuse, with cases of identity theft and defamation emerging from AI-generated content [6][8][9] - The prevalence of AI-generated "borderline" content on social media platforms raises concerns about copyright infringement and the potential for exploitation [10][12][22] - Various tutorials and guides are available on social media, instructing users on how to create and monetize AI-generated borderline content, indicating a growing trend in this area [13][16][22] Group 3 - Testing of 12 popular AI applications revealed that 5 could easily perform "one-click dressing" on celebrity images, raising concerns about copyright infringement [31][32][39] - Nine of the tested AI applications were capable of generating borderline images, with the ability to bypass content restrictions through subtle wording changes [40][41][42] - The article discusses the challenges faced by platforms in regulating AI-generated content, highlighting the need for improved detection and compliance measures [54][56][60] Group 4 - The article emphasizes the need for clearer legal standards and increased penalties for violations related to AI-generated content to deter misuse [57][59][60] - Recommendations for individuals facing AI-related infringements include documenting evidence and reporting to relevant authorities, underscoring the importance of legal recourse [61] - The article concludes that addressing the misuse of AI technology requires a multifaceted approach, including technological improvements and regulatory clarity [62]
登顶苹果应用榜!谷歌火遍全网的“纳米香蕉”,凭啥击败ChatGPT?
证券时报· 2025-09-16 07:51
Core Viewpoint - Google's market capitalization has reached $3 trillion, and its AI application Gemini has surpassed ChatGPT to become the top app on the Apple App Store [1][2]. Group 1: Gemini's Performance - Gemini has achieved over 2 million downloads in the US App Store, surpassing ChatGPT, and has also topped the charts in Canada, India, and Morocco [2]. - The success of Gemini is attributed to the launch of the image editing product Nano Banana, which has significantly improved image quality and editing control [4]. Group 2: Nano Banana Features - Nano Banana allows users to edit images using simple natural language commands, eliminating the need for traditional editing tools [4]. - The model maintains character consistency across different scenes and actions, which is crucial for brand character creation and script generation [4]. - It supports the fusion of multiple images and incorporates world knowledge to understand complex scenes for editing tasks [5]. - Nano Banana reduces the barriers to 3D modeling by generating 2D designs that include essential structural and material information [5]. Group 3: Market Impact and Competitors - The popularity of Nano Banana has sparked competition in the image generation space, with other companies like ByteDance and Shengshu Technology launching similar models [10]. - Analysts believe that the native multimodal model architecture is gaining industry recognition, with OpenAI and Google's models showing advantages in performance and deployment [10]. - The demand for computational power is expected to increase due to the higher requirements of native multimodal models compared to non-native ones [11].
“AI生图”做题家大赛,谁赢了?
Zhong Guo Jing Ying Bao· 2025-09-13 01:46
Core Viewpoint - The emergence of AI-generated figurine images has been significantly influenced by Google's recent release of the Gemini 2.5 Flash Image model, dubbed "Nano Banana," which has been praised for its user-friendly operation and high-quality output [2][5]. Group 1: AI Model Comparisons - Following the launch of "Nano Banana," competitors such as ByteDance's Seedream 4.0 and Shenshu Technology's Vidu Q1 quickly entered the market, indicating a rapid escalation in the AI image generation sector [5][8]. - Seedream 4.0 has reportedly topped the rankings in text-to-image and image editing categories, surpassing Google's Nano Banana in both fields [8]. - In a comparative test, Nano Banana produced a more realistic figurine image of a long-haired kitten, demonstrating superior understanding of figurine aesthetics compared to Seedream 4.0 and Vidu Q1, which struggled with material representation [11][14]. Group 2: Performance Insights - Seedream 4.0 excelled in generating a stunning final image from a complex prompt involving a figurine in a realistic setting, while Nano Banana required additional prompts to improve its output [14]. - In a test involving family dynamics, Seedream 4.0 interpreted the prompt favorably, while Nano Banana added unexpected elements, showcasing differences in understanding user intent [18]. - All three AI models displayed unique strengths and weaknesses, with Nano Banana achieving extreme realism, Seedream 4.0 demonstrating good comprehension, and Vidu Q1 providing balanced performance across tasks [20]. Group 3: Industry Implications - The advancements in these AI models represent a significant leap in capabilities, including improved understanding, faster output times, and higher image quality, moving closer to the ideal of a productivity tool [23].