即梦

Search documents
AI项目新玩法,保姆级流程拆解,又一适合新手操作的项目!
Sou Hu Cai Jing· 2025-10-07 06:08
自 AI 工具开源以来,这大半年的时间,我们持续分享了很多 AI 工具的运用技巧。就在上周,社群里还系统的讲解了豆包、即梦、可灵以及 recraft 等工 具的使用方法,这些工具切实解决了互联网从业者面临的诸多难题。 其实大家都清楚 AI 工具的强大之处,但不会使用就跟没有一样。 | | 四大AI的使用方式 | | --- | --- | | | 国学报 2025-09-17 14:08 过期时间: 永久有效 | | | 返回上一级 全部文件 > 四大AI的使用方式 | | 197 文件名 | | | A | 1.1、如何给豆包提示词让它生成我们想要的文章_batch.mp4 | | 5 | 1.2、如何让豆包把别人的爆款文章改写成自己的_batch.mp4 | | (2) A | 1.3、怎样让AI根据要求生成通真的图片_batch.mp4 | | | 1.4、豆包的倒推提示词功能_batch.mp4 | | (조) A | 2.1、即梦怎样根据提示词生成高清图片_batch.mp4 | | 0079 A | 2.2、用即梦来复活照片上的人物_batch.mp4 | | D | 3.1、用提示词让可灵生成漂 ...
独家|Sora2率先发布AI时代的TikTok!最新一手测评,Sora2太强,即梦太差
Z Potentials· 2025-10-01 02:13
Core Viewpoint - The release of Sora 2 by OpenAI signifies a transformative shift in content production methods, contrasting with platforms like TikTok by automating the creative process through generative AI, thus lowering content creation barriers and challenging traditional user-generated content logic [2]. Group 1: Comparison with TikTok - Sora 2 differs from TikTok by emphasizing algorithm-driven content generation rather than user-generated short videos, representing a new paradigm in content creation [2]. - TikTok relies on a "human + algorithm" collaboration, while Sora 2 automates the creative process, allowing users to generate high-quality videos simply by inputting text prompts [2]. Group 2: Features of Sora 2 - Sora 2 enhances the storytelling aspect of video generation, integrating sound effects, voiceovers, and dialogues, moving beyond merely generating a sequence of image frames [2]. - User testing indicates that Sora 2 performs well in understanding text prompts and generating corresponding characters, significantly improving upon previous video generation tools [13]. Group 3: User Testing Results - In user tests, prompts like "Ronald McDonald and Colonel Sanders are dancing Latin dance together by the seaside" yielded satisfactory results, although some elements were not perfectly represented [3][6]. - Another prompt involving "Pikachu battles Ultraman" showed that while the overall performance was good, there were minor issues with consistency, such as Pikachu turning into a Pokémon egg [7][11]. - The integration of story elements, sound, and voice in the generated videos was noted to be much better in Sora 2 compared to previous tools, indicating a significant advancement in video generation capabilities [13].
谷歌“香蕉”爆火启示:国产垂类AI的危机还是转机?
3 6 Ke· 2025-09-26 10:44
Core Insights - The rapid rise of Nano Banana, a product from Google, has led to the generation of over 200 million images globally within two weeks, with significant user engagement in the Asia-Pacific region [1] - Nano Banana has contributed to the growth of the Gemini App, adding over 10 million new users and surpassing ChatGPT in the Apple App Store rankings [1] - OpenAI has responded to the competition posed by Nano Banana by acquiring Statsig for approximately $1.1 billion in an all-stock deal, indicating a strategic move to enhance its product offerings [3] Industry Impact - The emergence of Nano Banana has prompted ByteDance to launch seedream 4.0 to strengthen its user base, while Meitu faces challenges as general models threaten its market position, leading to significant stock price volatility [5] - Analysts suggest that while Meitu's stock has been supported by foreign investment banks, the potential of general models like Nano Banana looms as a significant threat [5] - The debate continues on whether general models will replace niche AI applications, with some experts arguing that niche applications have a better understanding of user needs and specific market scenarios [5][19] Technological Advancements - Nano Banana has transformed image creation by allowing users to interact in a more conversational manner, eliminating the need for structured prompts [9][11] - The cost of using Nano Banana is approximately $0.039 per image, with a pricing model of $30 per million tokens, making it a cost-effective solution for image generation [11] - The technology behind Nano Banana includes advanced capabilities such as text rendering and world knowledge integration, which enhances its performance in generating images with deep semantic accuracy [12][9] Competitive Landscape - Meitu's strategy involves integrating new technologies like Nano Banana into its products while maintaining a focus on its core competencies in the beauty and aesthetics sector [14][19] - The partnership with Alibaba, involving a $250 million investment, aims to enhance e-commerce experiences through AI-driven solutions like "AI fitting" and "AI product image generation" [17] - The competition between large model companies and niche AI firms is intensifying, with the need for niche players to adapt and leverage large models to remain relevant in the market [22][25]
如何正确理解Token经济学?
3 6 Ke· 2025-09-23 11:04
Core Insights - The article emphasizes the significance of Tokens in measuring the performance and commercial viability of AI models, shifting the focus from what AI can do to quantifying its efficiency, cost, and value [1][14][16] Group 1: Token Consumption and Revenue - Token consumption is closely linked to computational power, which in turn correlates with revenue for model providers [2] - OpenAI's token usage on Microsoft Azure is projected to increase from 0.55 trillion to 4.40 trillion daily tokens between June 2024 and June 2025, with annual revenue expected to rise from $5.5 billion to over $10 billion [3] Group 2: Consumer and Business Applications - Major contributors to consumer token consumption include AI features in high-traffic applications like Google Search and Douyin, with Google’s AI Overview feature projected to consume between 1.6 trillion and 9.6 trillion tokens daily [4][5] - ChatGPT remains a significant driver of token consumption, with a combined monthly active user base of 1.015 billion across app and web platforms as of July 2025 [7] Group 3: Business Applications and Market Penetration - Business applications are seeing high penetration rates, with OpenAI's B2B revenue expected to account for 54% of its annual recurring revenue by 2025 [9] - Google has reported over 85,000 enterprise customers for its Gemini model, leading to a 35-fold increase in token consumption [9] Group 4: Technological Advancements - The increase in token consumption is attributed to advancements in reasoning capabilities, multi-modality, agent-based systems, and longer context lengths, which enhance the practical application of AI [10][12] - New models like GPT-5 and Grok4 are designed to improve AI's usability in complex scenarios, thereby increasing token consumption [11] Group 5: Pricing Dynamics - Despite the increase in token consumption, the pricing for tokens is decreasing due to competitive pricing strategies and optimization of computational costs by model providers [13] - The introduction of tiered pricing models allows smaller clients to access AI capabilities, further driving token consumption [13] Group 6: Economic Implications - Understanding token economics provides insights into cost-effectiveness, technological efficiency, and the evolution of application scenarios, marking a shift towards a more mature and industrialized AI sector [14][16]
比起nano-banana,国产AI更有性价比?
Hu Xiu· 2025-09-12 06:03
Group 1 - The popularity of AI-generated figurines is surging across various platforms, with users eager to create "realistic figurines" from selfies or anime screenshots [1] - Some individuals are willing to pay 6 yuan on platforms like Xianyu for someone to create these AI-generated figurines for them [1] - Domestic AI tools such as Keling and Jimeng offer simple and free one-click generation options, contrasting with Google's more complex nano-banana tool [1]
可灵VS即梦:初探“多模态”
Tai Mei Ti A P P· 2025-09-11 05:33
Core Insights - The article discusses the current state of AI-generated video platforms in China, specifically focusing on two leading platforms: Keling and Jimeng [1] - It explores the process of creating a film using AI, highlighting the roles of AI in scriptwriting, storyboarding, and directing [5][10][18] - The article emphasizes the strengths and weaknesses of the AI platforms in generating videos, particularly in terms of creativity and fidelity [35][42] Group 1: AI Video Generation Process - The first step involves using AI as a screenwriter to create scripts, demonstrating that AI can effectively handle text-based tasks [7][8] - The second step is utilizing AI as an artist to create storyboards, where the quality of images generated can vary, with some instances of misunderstanding instructions [12][14] - The third step involves AI directing the video, where initial results may be impressive, but inconsistencies and logical errors become apparent in later outputs [18][20][24] Group 2: Performance of AI Platforms - Keling shows better performance in understanding abstract concepts and artistic interpretation, often producing videos that reflect the intended themes [36][38] - Jimeng excels in image fidelity and stability, ensuring that the generated videos maintain a consistent visual quality [43][44] - Both platforms face challenges in simulating physical realism and maintaining narrative coherence, leading to issues such as "memory loss" within short video segments [31][50] Group 3: Technical and Cost Considerations - The article notes that the current technology in AI video generation struggles to balance fidelity and creativity, with limitations on video length impacting content expression [50][52] - The cost of using these platforms can be significant, with basic configurations priced at 1 yuan per video for Jimeng and 2 yuan for Keling, indicating that achieving high-quality outputs may require additional investment [59][60] - The need for patience is emphasized, as generating visually appealing films with AI may take time and repeated adjustments [62]
又多了一个哄孩子AI神器,一张破涂鸦竟能秒变迪士尼动画
机器之心· 2025-09-04 09:33
Core Viewpoint - The article discusses the innovative use of AI tools to transform children's drawings into animated videos, highlighting the ease of use and creative potential of these technologies [2][4][18]. Group 1: AI Tools for Animation - The AI tool "即梦" allows users to upload childhood drawings and generate animations with cinematic effects, capturing the whimsical nature of children's imagination [2][4][7]. - "Veo3" from Google offers a comprehensive solution for generating synchronized audio and video content, enhancing the overall production quality [10][13][17]. - "可灵" also provides similar capabilities, allowing for the automatic generation of audio effects that sync with the animated visuals, streamlining the video creation process [16][17]. Group 2: User Experience and Functionality - Users can input specific prompts to create immersive scenes, such as a child walking with a lotus leaf while a snail follows, showcasing the tool's ability to accurately animate character movements [14]. - The tools allow for the addition of AI-generated music and sound effects, enhancing the storytelling aspect of the animations [8][15]. - The article emphasizes the simplicity of the process, where users can easily upload images and receive animated outputs without extensive technical knowledge [21][24]. Group 3: Additional Features and Recommendations - The article mentions "Animated Drawings" by Meta, which also converts drawings into animations, providing another option for users interested in this technology [18]. - For optimal results, the article provides guidelines on how to prepare images for animation, ensuring clarity and proper character separation [22][24]. - The tools are designed to be user-friendly, encouraging parents and children to engage creatively with their drawings [31].
字节跳动季度营收达480亿美元,连续两季超越Meta,坐上全球社交媒体收入头把交椅
Sou Hu Cai Jing· 2025-08-29 13:43
Core Insights - ByteDance's strong growth is driven by its solid foundation in the domestic market and expansion in overseas markets, with Douyin becoming a super app ecosystem [2] - Meta's Q2 financial report exceeded Wall Street expectations, with a net profit of $18.34 billion, a 36% year-on-year increase, and advertising revenue reaching $46.56 billion [3] - Despite ByteDance surpassing Meta in revenue scale, there remains a significant valuation gap, with ByteDance valued at over $330 billion, less than one-fifth of Meta's approximately $1.9 trillion market value [3] Company Performance - ByteDance's revenue is primarily generated from the domestic market, with TikTok's global commercialization efforts ongoing [2] - Meta's CEO attributes the company's strong profit performance to the effectiveness of AI technology in enhancing advertising system efficiency and ROI [3] - ByteDance announced a new round of employee stock buybacks, increasing the buyback price from $189.90 to $200.41 per share, reflecting its financial independence and healthy cash flow [2] Market Dynamics - The global social media advertising market is expected to reach $276.7 billion by 2025, maintaining a compound annual growth rate of about 10% [4] - User behavior is shifting towards short videos as the mainstream form of digital content consumption, with TikTok benefiting from high user engagement, averaging 35 hours of viewing time per month [4] Competitive Landscape - Meta's strategy focuses on long-term investments in AI infrastructure, with capital expenditures projected to reach $66 to $72 billion by 2025 and over $100 billion by 2026 [3][5] - ByteDance's AI strategy is characterized by a dual approach, investing in foundational model development while rapidly productizing AI technologies [7] - The competition between the two companies is intensifying, with Meta holding advantages in user base, profitability, and market recognition, while ByteDance must navigate political and regulatory risks in the U.S. [8]
AI生成图片,哪家强?
3 6 Ke· 2025-08-29 06:26
Group 1 - The article discusses the rapid growth of AI-generated images and their increasing integration into various platforms, highlighting their efficiency in work and study despite ongoing artistic controversies [1] - The evaluation focuses on six AI models, including Tencent's Mix Yuan, Zhiyu CogView-4, Tongyi Qianwen, Jimeng, Keling, and Gemini 2.5 Flash Image, to assess their performance in generating images from text prompts [2][3] - Gemini 2.5 Flash Image, previously known as nano-Banana, has gained significant attention for its superior performance in generating images [4][5] Group 2 - The evaluation criteria include basic aesthetics and realism, imagination and creativity, instruction understanding and execution, style imitation and mastery, and cultural understanding and concept expression [9][26][40][48] - In the first dimension, various models showed differing levels of realism, with some generating images that were too smooth or lacked natural proportions, while others performed exceptionally well [16][18] - The second dimension revealed challenges for AI in understanding abstract concepts, with models struggling to accurately depict a lion made of star clouds, indicating limitations in their imaginative capabilities [25] Group 3 - The third dimension highlighted that only a few models correctly executed simple instructions, suggesting that AI does not process numerical instructions in the same way humans do, but rather interprets them based on learned patterns [30][39] - In the fourth dimension, Gemini excelled in mimicking traditional Chinese ink painting styles, while other models struggled to meet the artistic requirements, indicating a lack of mastery in specific artistic styles [44] - The fifth dimension showed that Gemini and Keling demonstrated a strong understanding of cultural elements, effectively incorporating traditional features into their generated images, while others fell short [57] Group 4 - The overall scores from the evaluation ranked Gemini highest with 44 points, followed by Keling and Jimeng, indicating that these models produced the most visually appealing results [58][59] - The article emphasizes that while AI can produce impressive images, it does not create art in the same way humans do, as it relies on probabilistic models rather than creative inspiration [61][62] - The complexity of AI image generation processes is acknowledged, with the article noting that the exact sources of errors in image generation remain unclear [65][66]
又土又爽的AI短剧,占领抖音了?
菜鸟教程· 2025-08-28 03:29
Core Viewpoint - The article discusses the rising popularity of short dramas, particularly those produced using AI technology, highlighting their rapid production and high viewer engagement compared to traditional dramas [4][7][16]. Group 1: Popularity of Short Dramas - The viewership for short dramas has significantly increased, with some achieving over 50 million views, while traditional dramas struggle to reach 40 million [6][7]. - A specific short drama titled "Nine-Tailed Fox Male Demon Falls in Love with Me" has garnered over 180 million views despite only having 28 episodes [9][11]. Group 2: Advantages of AI in Short Drama Production - AI technology allows for the rapid creation of short dramas, with a single episode taking only about 2 hours to produce, compared to traditional dramas that often take months [15]. - The use of AI reduces costs associated with actors, sets, and production logistics, making it a more efficient option for content creation [15][16]. Group 3: Characteristics of AI-Produced Short Dramas - These short dramas typically have a fast-paced format, with episodes lasting around one minute, catering to modern viewers' fragmented attention spans [22][23]. - The quick pacing and engaging content resonate well with audiences, leading to a phenomenon where viewers find themselves unable to stop watching, even if the content is perceived as low quality [23][24]. Group 4: Technical Aspects of AI Short Drama Production - The production process involves several steps, including script generation using large language models, character and scene creation through text-to-image tools, and video generation from images [27][30][46]. - AI tools for voice dubbing and video editing are also utilized to enhance the final product, making the entire process accessible and cost-effective for creators [49][51].