Workflow
Sora 2
icon
Search documents
请回答2025,红杉汇的五个关键词
红杉汇· 2025-12-31 00:07
又是创业者们疾驰的一年 AI逐渐从"工具"变成"伙伴" 过去的365天 智能在机器中越来越"具身" 生命密码被持续"解码" 消费者在追寻"共鸣" 此刻,让我们暂缓脚步 回顾下这一年的共同关注、思考和分享 我们从红杉汇的文章里找出了五个关键词 与大家一起回顾2025 01 AI 从惊艳的"工具",到并肩的"伙伴" 1月,DeepSeek横空出世,并迅速在全球开发者中爆火,之后Kimi等中国大模型也持续开源,领 跑全球开源生态;2月,业界首个"混合推理模型"Claude 3.7 Sonnet发布;3月,Manus正式发 布,被广泛认为是首个"真正意义上的通用AI Agent";9月,Sora 2正式发布,在时序一致性、 物理真实感与镜头语言控制上实现了质的跃迁;11月,Gemini 3系列模型发布,文本、图像、 视频、音频的理解与生成能力被统一到原生多模态架构中,多模态从"能力拼接"走向"原生系统 融合"……这一年,AI越来越"聪明",越来越"实用"。 【红杉汇这些文章值得回顾】 · Agent成为新范式,人机关系被重新定义: AI Agents从简单的Copilot(副驾驶)向能独立完成 复杂任务的Collea ...
谈谈2025年人工智能现状及发展趋势分析
3 6 Ke· 2025-12-30 09:18
一年即将结束,是时候回顾今年的人工智能发展现状,并展望来年的发展趋势了。这份概述基于包括麻省理工学院、 普华永道、OpenAI、OpenRouter 等在内的众多全球机构的数据。 一 "高采纳率,低转化率"悖论 企业人工智能领域存在着巨大的脱节:许多组织都在广泛使用人工智能工具,但真正获得可衡量的经济回报的却寥寥 无几。事实上,尽管投入了巨额资金,95% 的企业却一无所获。哦,对了,你终于可以松一口气了——失败的并非只 有你,几乎所有人都是如此。尽管科技大会层出不穷,各大公司都在吹嘘人工智能的奇迹,但匿名调查显示,很多 企 业 都在谈论人工智能,但真正付诸行动的却寥寥无几。 麻省理工学院的报告称之为"人工智能世代分化"。 数据有力地证明了这一悖论的正反两面: 高采纳率(普遍性) 仅有 39% 的组织报告其收益增长可归因于人工智能。 超过 90% 的企业正试图采用人工智能解决方案以保持竞争力。 95% 的组织从生成式人工智能(GenAI)投资中获得零回报,且陷入无实际可衡量影响的困境。 预计到 2025 年,全球人工智能支出将接近 1.5 万亿美元。 仅约三分之一的组织已成功开始在全企业范围内扩展人工智能应用。 ...
碾压小扎!22岁成亿万富翁,2025年AI造富速度刷新人类认知
猿大侠· 2025-12-29 04:11
编辑:艾伦 【导读】 2025 年,AI 不仅占据话题 C 位,更成为超级造富机,将 50 多位创始人送入亿万富 翁俱乐部。本文将盘点这场史无前例的 AI 财富狂欢与背后的顶级赢家。 2025 年,AI 无疑是绝对的话题中心。 空谈误国,实干兴邦,而 AI 公司的估值更是真金白银。 这一年,AI 成为了超级财富制造机,从大模型构建、基础设施铺设到应用层落地,它正以惊人的速 度渗透进日常生活,也顺手将 50 多位创始人送进了亿万富翁俱乐部。 根据 Crunchbase 的数据,今年全球投资者向 AI 领域狂砸了超 2000 亿美元,这一数字占据了全 球创投总额的半壁江山, 同比增长超过 75% 。 融资凶猛,花钱更凶猛,巨头们正不惜血本打造 AI 基础设施。 1 月 , 特 朗 普 宣 布 OpenAI 、 软 银 和 甲 骨 文 将 斥 资 5000 亿 美 元 打 造 代 号 为 「 星 际 之 门 」 (Stargate)的数据中心项目。 开年即决战。 1 月,中国的 DeepSeek 发布开源模型,仅用美国巨头零头的算力就训练出了性能惊人的模型,不 仅让金融市场为之震颤,也将其创始人梁文锋的身家推高至约 ...
碾压小扎,22岁成亿万富翁,2025年AI造富速度刷新人类认知
3 6 Ke· 2025-12-29 02:03
2025 年,AI 不仅占据话题 C 位,更成为超级造富机,将 50 多位创始人送入亿万富翁俱乐部。本文将盘点这场史无前例的 AI 财富狂欢与背后 的顶级赢家。 2025 年,AI 无疑是绝对的话题中心。 空谈误国,实干兴邦,而 AI 公司的估值更是真金白银。 这一年,AI 成为了超级财富制造机,从大模型构建、基础设施铺设到应用层落地,它正以惊人的速度渗透进日常生活,也顺手将 50 多位创始人送进了亿 万富翁俱乐部。 开年即决战。 1 月,中国的 DeepSeek 发布开源模型,仅用美国巨头零头的算力就训练出了性能惊人的模型,不仅让金融市场为之震颤,也将其创始人梁文锋的身家推 高至约 115 亿美元,一举跨入超级富豪行列。 大洋彼岸的 Anthropic 也不甘示弱。 作为 Claude 的开发者,这家公司在年初以 615 亿美元的估值完成了 35 亿美元融资,其七位联合创始人全员晋升亿万富翁。 到了 9 月,随着新一轮总计 130 亿美元资金的注入,其估值已飙升至 1830 亿美元。 这种巨额融资并非个例。 根据 Crunchbase 的数据,今年全球投资者向 AI 领域狂砸了超 2000 亿美元,这一数字 ...
火了整整一年 AI更“懂人”了!
Sou Hu Cai Jing· 2025-12-27 09:43
Core Insights - The AI industry is experiencing significant advancements, marked by the release of the DeepSeek AI model, which has sparked a wave of revaluation in the tech sector [2] - AI applications are evolving from simple question-answering to executing complex multi-modal tasks, indicating a shift towards more sophisticated AI capabilities [3][4] - The competition in the AI sector is increasingly focused on multi-modal capabilities, where models must understand and generate various types of information [4] Group 1: AI Advancements - The launch of DeepSeek's AI model R1 on January 20, 2025, has ignited a revaluation of tech stocks in the A-share and Hong Kong markets, leading to a surge in AI-related companies [2] - AI applications are now capable of processing multi-modal information, moving from mere intent understanding to executing services based on real-world data [2][3] - The introduction of various AI applications, such as Sora 2 and the Ant Group's AI health app, showcases the growing sophistication and understanding of AI in real-world scenarios [4][5] Group 2: Market Dynamics - The AI industry is transitioning from a phase reliant on capital investment to one that demands self-sustainability and rigorous scrutiny, as evidenced by companies like Zhiyu and MiniMax seeking IPOs [7] - The investment landscape for AI has been robust, with significant funding rounds and a total of 186 financing events in the AIGC sector from July to November 2025, amounting to 33.67 billion [7] - Major tech companies are committing substantial resources to AI development, with Alibaba planning to invest at least 380 billion RMB over three years for cloud computing and AI infrastructure [7] Group 3: Application Trends - AI applications are becoming more specialized, with a notable increase in vertical applications in healthcare, as seen with the Ant Group's AQ brand upgrade to Ant Aifu [5][6] - The competitive edge in AI applications is shifting from model parameters to a deeper understanding of industry needs and the ability to create closed-loop solutions [6] - The current landscape features a mix of general-purpose AI and specialized applications, with a notable presence of healthcare-focused AI apps among the top user engagement rankings [5][6] Group 4: Future Outlook - The AI industry is at a critical juncture, transitioning from a conceptual phase to a growth phase, with a need to enhance monetization strategies for AI applications [9] - Predictions for 2026 indicate a focus on lightweight models and deeper integration of AI with the real economy, alongside the establishment of regulatory frameworks to guide industry development [9][10] - The emergence of embodied intelligence and AI smartphones is expected to drive significant growth, with a competitive focus on application ecosystems among various AI platforms [10]
图数室丨回看2025,AI那些“封神”瞬间
Xin Lang Cai Jing· 2025-12-26 09:28
如果说有一年,人工智能从实验室的惊叹号,变成了每个人生活中的平凡句点,那一定是2025年。 从开年DeepSeek的"技术震撼",到年末"豆包手机"的触手可及,我们共同经历了一个被加速定义的"全 民AI元年"。大模型以前所未有的速度下沉,我们的笔记本、手机乃至穿戴设备,纷纷被重塑为智能的 终端载体。这不再是概念的狂欢,而是应用大规模落地的轰鸣——AI第一次从"看起来很聪明",变成 了"真正开始接管现实世界"的一年。 这背后,是一场静默而深刻的日常革命。让我们一同回顾2025年,AI究竟如何一步步走入亿万人生活 的具体轨迹。 2025 1月 1月20日 深度求索(DeepSeek)公司,推出 一代大模型R1, 给全球Al界带来 新 了一场"地震" deepseek deepsee 1月28日 字树科技Unitree H1"福兮"机器 人, 在春晚舞台上身着喜庆花袄大 秀秧歌技艺 4月 北京举行全球首个人 形机器人半程马拉松 赛,人形机器人"天 工"以2小时40分42 秒的成绩夺冠 4月17日 全国首例涉及AI模型结构和参数保 护的案件正式生效 2月17日 xAl发布最新人工智能模型Grok 3 2月19日 4月 ...
2025AI盘点:10大“暴论”
3 6 Ke· 2025-12-26 00:52
Group 1 - The concept of "Vibe Coding" has emerged, suggesting a new programming approach that emphasizes feeling and embracing exponential growth, leading to a broader trend of "Vibe Everything" in AI [2] - There is a divide in perception regarding "Vibe," with some viewing it as a refreshing product philosophy while others criticize it as a superficial trend that obscures the true essence of AI products [2] - The term "Vibe" reflects a strong narrative appeal, resonating with the desire for transformative change in the AI landscape, indicating its continued relevance in the future [2] Group 2 - The humanoid robot sector is experiencing a valuation surge despite discussions about a potential bubble, with significant capital inflow and a shift towards more conservative funding strategies among companies [6] - The focus on "scene" applications for humanoid robots has intensified, with education and performance being the most viable commercial scenarios, while the pursuit of commercial viability may not be the primary goal for the sector [6] Group 3 - The phrase "Prompt Engineering is Dead" has gained traction, suggesting a shift towards "Context Engineering," which encompasses a broader range of information and tools necessary for AI tasks [8][9] - Context Engineering is seen as a significant advancement, attracting investment and fostering the development of various AI tools, indicating a potential shift in the industry narrative [9] Group 4 - Huang Renxun's assertion that "China will win the AI race" highlights the competitive landscape between China and the U.S., emphasizing China's advantages in developer scale, market size, and infrastructure [12][13] - Huang's comments may also serve as a strategic move to influence U.S. policy regarding AI, aiming to maintain Nvidia's leadership position in the global market [12] Group 5 - Elon Musk and Satya Nadella predict the disappearance of traditional smartphones and apps, suggesting a transition to intelligent agents that could replace conventional software applications [15][16] - The emergence of new devices like the "Doubao phone" indicates a shift in how technology is being approached, with a focus on user interface and system control [16] Group 6 - Sam Altman's response to skepticism about OpenAI reflects a broader divide in opinions regarding the AI bubble, with concerns about the company's ability to deliver on its ambitious revenue projections [19][20] - OpenAI's projected revenue growth and the potential economic implications of AI's impact on employment and inflation are critical factors in assessing the sustainability of the AI market [21] Group 7 - The U.S. faces a potential electricity shortage that could impact AI infrastructure, with projections indicating a significant power gap by 2028 if supply does not keep pace with demand [23][24] - Major tech companies are exploring nuclear energy as a solution to their power needs, highlighting the intersection of AI development and energy infrastructure challenges [24] Group 8 - The debate surrounding the limitations of large language models (LLMs) continues, with experts arguing that scaling may not yield significant advancements and calling for a return to foundational research [27][28] - Despite criticisms, the push for larger models persists, indicating ongoing investment and interest in scaling within the AI community [28] Group 9 - The term "Slop" has been designated as the word of the year, representing the proliferation of low-quality AI-generated content, which poses challenges for content ecosystems [31][32] - The rise of AI-generated adult content is projected to grow significantly, raising questions about the implications for traditional content creation and quality standards [32]
ChatGPT也上线了个人年度报告
3 6 Ke· 2025-12-23 10:46
【导读】除了购物、外卖、听歌、看视频,现在连 ChatGPT 都有年度报告了!OpenAI 会根据过去一年你们的对话,总结出专属于你的独特年度回顾。 如果你今年重度使用了 ChatGPT,或许它比其他 App 更懂你。 这两天,你的朋友圈是不是又被各种 App 的年度报告刷屏了? 所有人都在假装不经意地晒出自己「小众而高雅」的品味。 当你正准备屏蔽这些「赛博凡尔赛」的时候,那个浓眉大眼的 OpenAI,竟然也来凑热闹了。 就在周一,ChatGPT 悄悄上线了一个新功能—— 「Your Year with ChatGPT」(你在 ChatGPT 的这一年)。 没错,连 AI 都要来复盘你这一年究竟干了多少蠢事了。 如何查收你的「赛博确诊书」? 虽然 OpenAI 这波整活目前还只在美、英、加、澳、新这几个国家试水,但这并不妨碍我们先来体验一下这场大型「社死」现场。 获取方式很简单,在 App 对话框中输入一句咒语:「Show me my year with ChatGPT.」(给我看看我和 ChatGPT 的这一年。) You should see the experience on your screen L ...
全球功能最全的视频生成模型来了
量子位· 2025-12-17 10:00
坏了,阿里这波是冲着Sora 2去的! 刚刚,阿里发布了新一代 通义万相2.6系列模型 ,一次性覆盖 文生视频 、 图生视频 、 参考生视频 ,以及 图像生成 和 文生图 ,是目前全 球功能最全的视频生成模型。 在视频创作上,万相2.6不仅推出了Sora2目前还没有的 多 音频驱动生视频 能力,还同步引入了音画同步、多镜头叙事等能力。 梦瑶 发自 凹非寺 量子位 | 公众号 QbitAI 像下面这个超火的一刀切ASMR,就是通过文本+音频直接驱动出来的: 再看这个由 文本+图像+音频 驱动的小猫沉浸式吃播,咀嚼声和嘴部动作基本能卡在点上,吃得那叫一个香: 文生图这条线也同步补强了,万相2.6在艺术风格控制、真实感人像、中英文长文本生图以及历史文化IP语义理解等方面的创作能力也都有明 显提升,效果be like: 本着啥都测测的原则,我也专门用不同Prompt和参考素材实测了一轮,总的来说: 万相2.6在音视频参考、声画同步、风格理解方面表现确实不错,但在个别场景下仍会出现画面逻辑偏差的小问题,不过对日常短视频和二创 来讲,已经是可用且好用的水平了。 模型到底表现如何,咱们边唠边测~ 视频生成能力一手实测 实测 ...
硬刚Sora2,万相2.6轻松定制角色、控制分镜,普通人也能当导演
机器之心· 2025-12-17 05:28
Core Insights - The article highlights the rapid advancements in video generation technology, particularly focusing on the release of Alibaba's Wan 2.6 model, which significantly enhances user capabilities in video creation and storytelling [1][36]. Group 1: Technological Advancements - OpenAI's Sora 2 introduced a "Cameo" feature that addresses the "character consistency" issue in AI video generation, transforming the process from unpredictable to controllable [1]. - Alibaba's Wan 2.6 model is noted for its comprehensive capabilities, including voice and image synchronization, allowing users to create videos with a high degree of realism and narrative coherence [3][9]. - The new model supports a maximum video generation duration of 15 seconds, which is the highest in the domestic market, and includes a "shot control" feature for professional storytelling [3][4]. Group 2: User Experience and Accessibility - The Wan 2.5 version of the model made video creation accessible on mobile devices, while the 2.6 version further democratizes professional video production, enabling anyone to take on roles like director or actor [2][4]. - Users can create videos with high fidelity in both visual and auditory aspects, showcasing the model's ability to replicate character traits and emotional expressions accurately [11][24]. Group 3: Practical Applications - The model's capabilities extend to generating complete narrative short films, making it suitable for advertising design and short drama production [16]. - The article emphasizes the model's potential in various creative fields, including AI comic production, advertising design, and short video creation, with over ten visual creation capabilities supported [35][36]. Group 4: Conclusion and Future Implications - The release of Wan 2.6 signifies a shift from a mere "lottery" approach in AI video generation to a new phase of precise and controllable cinematic creation [36]. - The technology effectively removes barriers to creativity, allowing users to leverage their imagination as their primary production tool [37].