Workflow
可灵2.1
icon
Search documents
用AI一键直出超绝电影级转场,我的PR真的可以卸载了。
数字生命卡兹克· 2025-08-21 13:48
这两天,刷到好几个超级酷的一镜到底的视频。 比如这个我昨天在X上刷到的视频,全程都是AI生成,一镜到底。 整个视频有点长,两分多钟,分了好几段,最惊艳的是前半段,我截给你们看一下, 这个真的很酷。 运镜,动作,质感,都非常非常惊艳,好几个镜头和转场放在游戏里也不为过。 还有小红书上看到的博主 @BOB二黑 的一个超帅的作品,动漫风格也是丝滑到无敌。 但是其实可以看到这两个片子,已经超级酷了,在每段和每段过渡之间,你还是能看出来一丝丝停顿的影子。 而另一个昨天刷爆我的微信群的一个究极丝滑的视频,那是真的连过渡都看不出来了,究极丝滑。。。 酷的离谱。。。 本来还想问一下哪个模型的首尾帧做的,因为丝滑的感觉之前的视频模型都不了这种效果,一看,两人都把答案贴明面上了。 可灵2.1的首尾帧。 我擦。 我打开了我自己的可灵看,果然,可灵2.1版本,终于支持首尾帧功能了。 点开可灵官网,找到视频生成,选择2.1模型,就可以直接在上面添加首帧和尾帧。 心心念念的可灵2.1的首尾帧,终于来了。这玩意狠的是 做各种视频和转场,以及实现一些高难镜头的必备功能,而且,它究极可控。 之前只有可灵1.6版本能做首尾帧,而这次2.1的首尾 ...
可灵 AI 技术部换将;宇树机器人“撞人逃逸”上热搜;邓紫棋自曝投资 AI 公司获 10 倍收益 | AI周报
AI前线· 2025-08-17 05:33
Group 1 - The first humanoid robot sports event took place on August 14, featuring 280 teams from 16 countries, showcasing the capabilities of humanoid robots in various competitions [3][4] - The UTree H1 robot won the 1500 meters race with a time of 6:34.40, marking the first gold medal in the event [3] - The TianGong robot team lost to UTree in both the 1500 meters and 400 meters races, with the CTO of TianGong expressing a desire to learn from UTree's performance [3][4] Group 2 - A corruption scandal involving DeepSeek's parent company has emerged, revealing that over 1.18 billion yuan was illicitly obtained through a kickback scheme over six years [8][9] - Reports indicate that DeepSeek's next-generation model, R2, will not be released in August as previously speculated, with the focus instead on iterative improvements to existing products [10] - The company has faced challenges due to supply chain issues related to AI chips, impacting its development timeline [10] Group 3 - Manus is facing potential forced withdrawal of a $75 million investment from Benchmark due to regulatory scrutiny over compliance with U.S. investment restrictions in Chinese AI firms [11] - The company has shifted its focus from domestic expansion to international markets, particularly Singapore, following the investment controversy [11][12] Group 4 - Kuaishou announced a leadership change in its AI division, with Gai Kun taking over the technical department, amid rumors of the departure of the previous head [12][13] - The CEO of Leifen publicly criticized a former employee over product performance comparisons, indicating internal conflicts and challenges in the company's public image [14] Group 5 - OpenAI employees are seeking to sell approximately $6 billion in stock at a valuation of $500 billion, indicating strong investor interest despite the company's current losses [15] - The company is also exploring advertising as a revenue stream while maintaining a focus on subscription growth [38] Group 6 - Alibaba's "扫地僧" Cai Jingxian, the first programmer for Taobao, has reportedly left the company, marking a significant personnel change [17][18] - G.E. has launched a new open-source platform for robotics, aiming to integrate various aspects of robot control and learning [36] Group 7 - The National Data Bureau reported a dramatic increase in daily token consumption in AI applications, reflecting rapid growth in the sector [30] - Alibaba's international platform has gained popularity with its AI agent, prompting plans for expansion to accommodate increased demand [31]
腾讯研究院AI每周关键词Top50
腾讯研究院· 2025-05-30 18:51
Group 1: Key Trends in AI - The article highlights the emergence of various AI models and applications, indicating a rapid evolution in the AI landscape, with significant contributions from companies like Google, OpenAI, and Tencent [2][3]. - Notable advancements include the release of new models such as QwenLong-L1-32B by Alibaba and the introduction of the RLVR paradigm by Claude, showcasing the competitive nature of AI development [2][3]. - The article also emphasizes the importance of AI applications across different sectors, including updates to existing products and the launch of innovative tools like AI Scientist and real-time camera features [2][3]. Group 2: Corporate Activities and Acquisitions - The acquisition of Informatica by Salesforce is mentioned, reflecting ongoing consolidation in the tech industry as companies seek to enhance their AI capabilities [3]. - The article notes the merger of Haiguang Information with Zhongke Shuguang, indicating strategic moves to bolster computational power and resources in the AI sector [2]. Group 3: Industry Perspectives - Insights from industry leaders suggest a transformative shift in AI platforms, with Google and Anthropic providing perspectives on automation in white-collar jobs and the growth logic of AI products [3]. - The article discusses the implications of AI on employment, with NVIDIA offering recommendations for adapting to the changing job landscape due to AI advancements [3].
腾讯研究院AI速递 20250530
腾讯研究院· 2025-05-29 15:55
Group 1: DeepSeek-R1 and AI Developments - The new version of DeepSeek-R1 has been officially open-sourced, surpassing Claude 4 Sonnet in programming capabilities and performing comparably to o4-mini (Medium) [1] - DeepSeek-R1's core advantages include deep reasoning capabilities, natural text generation, and support for long-duration thinking of 30-60 minutes, allowing for the execution of complex code in a single run [1] - Tencent has integrated multiple products with the latest DeepSeek R1 model within a day, offering users free and unlimited access to the model [3] Group 2: Keling 2.1 Launch - Keling 2.1 has been launched with a price reduction of 65%, featuring improved performance and speed, categorized into standard, high-quality, and master versions [2] - The high-quality version (35 inspiration points) matches the old master version in quality, supporting 1080P video but only for image-to-video generation [2] - The new version significantly enhances cost-effectiveness, making AI video creation more accessible for ordinary users [2] Group 3: Opera Neon Browser - Opera has introduced Opera Neon, the first "AI Agent" browser, aiming to redefine the role of browsers in the network [4] - Opera Neon consists of three main features: Neon Chat (chatting), Neon Do (executing web tasks), and Neon Make (complex creation), which can understand user intent and convert it into actions [4] - The Neon Make feature utilizes cloud technology to execute complex tasks, such as generating reports and designing game prototypes, even while the user is offline [4] Group 4: VAST's Tripo Studio Upgrade - VAST has upgraded Tripo Studio with four core functionalities: intelligent component segmentation, texture magic brush, intelligent low-poly generation, and automatic rigging for all objects [5] - Intelligent component segmentation allows for one-click disassembly, accurately identifying different parts of a model [5] - The automatic rigging feature can recognize various biomechanical characteristics and quickly allocate skeletal weights, enabling non-professionals to complete the entire 3D creation process with over a tenfold efficiency increase [5] Group 5: Odyssey's World Model - Odyssey, founded by autonomous driving experts, has launched a world model capable of real-time video generation at 40 milliseconds per frame, supporting real-time interaction [6] - This technology differs from traditional video models by learning pixel and motion data from real-life videos, using a narrow distribution model architecture to address autoregressive modeling challenges [6] - Odyssey has secured $27 million in funding, with the current preview version supported by H100 GPU clusters, outputting 30 FPS for 5-minute coherent interactive videos [6] Group 6: AI Scientist Zochi - The AI scientist Zochi's paper has been accepted by the top-tier conference ACL, marking it as the first AI system to independently pass peer review at an A* level conference [7] - Zochi's paper demonstrates a multi-round attack method with a success rate of 100% on GPT-3.5 and 97% on GPT-4 [7] - Zochi can autonomously complete the scientific research process from literature analysis to peer review, although its company has faced criticism regarding the misuse of the scientific peer review process [7] Group 7: Wanda 2.0 Robot - Youliqi has launched the Wanda 2.0 wheeled dual-arm robot, priced from 88,000 yuan, capable of autonomously completing complex long-sequence tasks [8] - Wanda 2.0 is equipped with a pre-trained multimodal large model UniTouch and a long-sequence task planning model UniCortex, learning new actions with only 5-10 demonstrations [8] - Youliqi has reduced costs by 70% through full-stack self-research, targeting the C-end and small B customer market, and has completed several hundred million yuan in financing [8] Group 8: Boston Dynamics Atlas Robot - Boston Dynamics has upgraded the Atlas robot, which now features 3D spatial perception and real-time object tracking capabilities, allowing it to perform complex industrial tasks in automotive factories [9] - The core technology includes a 2D object detection system, 3D spatial positioning based on key points, and a SuperTracker object pose tracking system, capable of handling object occlusion and positional changes [9] - The system integrates kinematic data, visual data, and force feedback to estimate poses accurately, with the team working on building a unified foundational model to enhance perception and action integration [9] Group 9: Google CEO's Perspective on AI - Google CEO Pichai believes AI represents a platform-level transformation larger than the internet, entering a phase where research is becoming reality [10] - AI is transitioning into the second stage of building usable products, with search evolving into an agent that can execute tasks on behalf of users, potentially creating Web 2.0-level killer applications [10] - The key transformation brought by AI lies in the change of interaction methods and the lowering of creative barriers, with the third stage involving the integration of AI with the physical world to form universal robotic systems [10]
可灵2.1刚刚上线,价格降了65%,更快、更听话、也更强。
数字生命卡兹克· 2025-05-29 03:42
Core Insights - The launch of Kling 2.1 introduces significant improvements in effectiveness, speed, and pricing, making it a compelling option for users [1][27]. - Kling 2.1 offers three distinct models: Standard, High Quality, and Master, catering to different user needs and budgets [10][28]. Pricing and Value - The pricing structure has been adjusted, with the High Quality version of Kling 2.1 being 65% cheaper than the previous Master version, making it more accessible for everyday users [10][27]. - The Standard version is priced at 20 inspiration points for 720P, the High Quality version at 35 inspiration points for 1080P, and the Master version at 100 inspiration points for high-end cinematic effects [10][28]. Performance Comparison - Kling 2.1 High Quality and Master versions outperform previous models in terms of visual quality and dynamic motion, with the Master version providing superior results for professional-grade projects [27][28]. - Speed tests indicate that Kling 2.1 performs comparably to Kling 1.6, with both completing tasks in under one minute, while the Master versions take over three minutes [18][27]. User Experience - Users have reported that the Professional Mode of Kling 2.1 is sufficient for most casual video styles, while the Master version is better suited for action scenes and high-intensity projects [2][28]. - The updates have made it possible for a broader range of creators to access high-quality video generation tools, enhancing the overall user experience [27][28]. Market Positioning - Kling 2.1 aims to fill the gap between affordability and quality, allowing users to choose models based on their specific creative needs and budget constraints [28]. - The differentiation between the three models allows for targeted marketing towards various segments, from casual creators to professional filmmakers [28].