Workflow
多模态生成式AI
icon
Search documents
单周涨37% 存储牛股闪迪再创新高
Bei Jing Shang Bao· 2026-01-11 15:14
随着AI浪潮从训练阶段向推理应用大规模迁移,被视为"AI工作记忆"的存储板块正迎来前所未有的价值 重估。2026年首个完整交易周,美股三大股指全线大幅上涨。道指周涨2.32%,报49504.07点;标普500 指数周涨1.57%,报6966.28点;而纳指周涨1.88%,报23671.35点。其中,道指、标普500指数均创历史 收盘新高。 存储概念股也走高,闪迪单周大涨37.12%,股价再创新高,美光科技涨9.41%,西部数据和希捷科技分 别涨6.8%和5.73%。野村证券的一份研报指出,企业级SSD使用的大容量3D NAND需求持续火爆,闪迪 本季度相关产品价格环比或暴涨逾100%。 消息面上,一份野村证券近期的研报引发了市场传播。分析师指出,面对未来几个季度服务器级存储需 求保持强劲的背景,闪迪可能在本季度将其面向企业级固态硬盘(SSD)的大容量3D NAND闪存价格 上调一倍。 市场对存储板块的重估,核心逻辑在于AI工作负载的性质转变。IDC数据显示,全球年度数据生成量预 计将从2024年的173 ZB飙升至2029年的527 ZB,五年内增长两倍以上,复合年增长率约为25%。而AI模 型数据爆炸对存储 ...
a16z对话Nano Banana团队:2亿次编辑背后的"工作流革命"
深思SenseAI· 2025-11-12 01:02
Core Viewpoint - The article discusses the transformative impact of multi-modal generative AI, specifically through the example of Google DeepMind's Nano Banana, which significantly reduces the time required for creative tasks like character design and storyboarding from weeks to minutes. This shift allows creators to focus more on storytelling and emotional depth rather than tedious tasks, marking a revolution in creative workflows [1]. Group 1: Nano Banana Development - The Nano Banana team, formed from various groups focusing on image generation, aims to create a model that excels in interactive and conversational editing, combining high-quality visuals with multi-modal dialogue capabilities [4][6]. - The initial release of Nano Banana exceeded expectations, leading to a rapid increase in user requests, indicating its value to a wide audience [6][8]. Group 2: Future of Creative Workflows - The future of creative processes is envisioned as a spectrum, where professional creators can spend less time on mundane tasks and more on creative work, potentially leading to a surge in creativity [8][9]. - For everyday consumers, the technology could facilitate both fun creative tasks and more structured tasks like presentations, depending on the user's engagement level with the creative process [9]. Group 3: Artistic Intent and Control - The definition of art in the context of AI is debated, with emphasis on the importance of intent over mere output quality. The models serve as tools for artists to express their creativity [10][11]. - Artists have expressed a need for greater control and consistency in character representation across multiple images, which has been a challenge in previous models [11][12]. Group 4: User Interface and Experience - The development of user interfaces for these models is crucial, balancing complexity for professional users with simplicity for casual users. Future interfaces may provide intelligent suggestions based on user context [14][16]. - The coexistence of multiple models is anticipated, as no single model can cover all use cases effectively. This diversity will cater to different user needs and preferences [16][19]. Group 5: Educational Applications - The potential for AI in education is highlighted, with models capable of providing visual aids alongside textual explanations, enhancing learning experiences for visual learners [18][19]. - The integration of 3D technology into world models is discussed, with a preference for focusing on 2D projections to solve most problems effectively [21]. Group 6: Challenges and Future Directions - The article identifies ongoing challenges in improving image quality and consistency, with a focus on enhancing the lower limits of model performance to expand application scenarios [39][40]. - The need for models to better utilize context and maintain coherence over longer interactions is emphasized, which could significantly improve user trust and satisfaction [40].
智象未来团队荣膺ACM MM 2025最佳演示奖:重新定义对话式视觉创作
Ge Long Hui· 2025-11-06 05:23
该智能体开创了可及性、交互式视觉叙事和多模态生成AI中协作内容创作的新方式,通过将生成和编 辑融合于一个对话驱动的体验中,降低了高质量视觉内容创作的门槛,并显著缩短了迭代周期,实现从 想法到优质产出的 "一次会话" 创意循环。目前,这一技术原型已成功迭代应用于智象未来旗舰产品 vivago.ai的对话生成功能中,为用户提供更自然、个性化的多模态对话交互体验。 此外,在本次ACM国际多媒体会议上,智象未来举办了Identity-Preserving Video Generation(IPVG)挑战 赛。赛事吸引了北大、上海交大、腾讯等国内外顶尖科研及企业团队参加。 ACM国际多媒体会议由国际计算机学会(ACM)主办,是全球多媒体领域最具权威性与影响力的学术盛 会之一。每年会议评选的最佳演示奖,具有极高含金量和行业认可度,代表着评审委员会和与会学者对 该技术创新性、实用性、成熟度和现场展示效果的最高肯定。 智象未来获奖的"灵感智能体"(Chat Generation)统一多模态智能体,以革命性技术将复杂的视觉内容创 作,转化为直观的对话体验。其核心优势在于突破碎片化多模态技术工具的局限,在单一界面内无缝整 合了文 ...