Marey

Search documents
腾讯研究院AI速递 20250801
腾讯研究院· 2025-07-31 16:01
Group 1 - The article discusses the anticipated release of GPT-5, which is expected to unify the GPT series and the o series, enhancing multimodal and reasoning capabilities [1] - GPT-5 will feature a main model (codename "nectarine" or "o3-alpha"), a mini version (codename "lobster"), and a nano version (codename "starfish") [1] - Internal sources indicate that GPT-5 will support a context window of 1 million tokens and will include MCP protocol and parallel tool invocation, with the mini version particularly enhancing programming capabilities [1] Group 2 - DeepSeek's collaboration with Peking University resulted in a paper that won the ACL Best Paper Award, achieving an 11-fold speed increase in processing long texts [2] - The technology introduces a "native sparse attention" mechanism, enhancing efficiency without sacrificing performance [2] - The NSA technology has completed pre-training validation on a 27B MoE architecture, showcasing its potential as a core technology for the DeepSeek R2 model [2] Group 3 - Google DeepMind launched AlphaEarth Foundations, integrating multi-source Earth observation data for a unified digital representation with 10-meter precision [3] - The system combines satellite images, radar scans, and 3D laser mapping, requiring only 1/16 of the storage space compared to similar AI systems [3] - Innovations include adaptive decoding architecture and geographic text alignment, utilized by organizations like the UN Food and Agriculture Organization for custom map creation [3] Group 4 - Moonvalley announced its flagship model Marey now supports Sketch-to-Video functionality, allowing users to generate movie-quality videos from hand-drawn sketches [4][5] - This feature aligns with Marey's "mixed creation" concept, facilitating the definition of character movements and camera paths for coherent video generation [5] - The service currently supports 1080p at 24fps output, available to subscribers starting at $14.99 per month [5] Group 5 - Ollama released version 0.10.1 with a visual interface, making it easier for non-technical users to interact with the platform [6] - The new version includes a dialogue interface, model downloads, PDF interaction, and multi-modal capabilities [6] - A new multi-modal engine allows users to send images to large language models, provided the models support multi-modal inputs [6] Group 6 - Alibaba's 1688 platform launched an AI version app featuring a free enterprise query tool and a digital agent for merchants, focusing on AI-driven transformation [7] - The AI version integrates features like AI search, product selection, and enterprise checks, with plans for bi-weekly updates [7] - The CEO announced that AI products will be free, with 400,000 merchants already using the digital agent, contributing to an 18% increase in GMV and inquiries [7] Group 7 - Zhujidi Power introduced the LimX Oli humanoid robot, claiming it to be the most cost-effective general-purpose humanoid robot globally, priced at 158,000 yuan [8] - The robot features a modular design and an open SDK system, supporting secondary development and OTA upgrades [8] - Three versions are available: Lite, EDU, and Super, targeting research teams and AI/robotics companies [8] Group 8 - Meta CEO Mark Zuckerberg announced signs of self-improvement in AI systems, indicating the near development of superintelligence [9] - The company is changing its AI model release strategy, suggesting that not all models will be open-sourced [9] - Meta plans to invest up to $72 billion in AI infrastructure by 2025, with stock prices rising by 10% following the announcement [9] Group 9 - a16z partner Martin Casado stated that AI investment criteria are shifting from model performance to the platform's ability to deliver business results [10] - The three key factors for platform competition are organizational model, resource allocation, and product strategy, emphasizing governance efficiency and product capability [10] - AI valuation logic is returning to specific scenarios, focusing on clear catalysts like customer contract rhythms and infrastructure development speed [10]
特效成本下降90%,它用1.54亿美元,打造合规电影级AI视频
3 6 Ke· 2025-07-22 12:07
Core Insights - The article discusses the transformative impact of AI on the film industry, particularly highlighting Moonvalley's AI video model, Marey, which addresses copyright compliance and production efficiency [1][2]. Group 1: Company Overview - Moonvalley has raised $84 million in Series A+ funding, led by General Catalyst, bringing its total funding to $154 million, making it one of the highest-funded players in the AI video sector [2][20]. - Marey is designed specifically for the film industry, offering advanced features such as layered editing and 3D camera control, with scene rendering costs as low as $1-2, representing a cost reduction of over 90% compared to traditional VFX [2][6][20]. Group 2: Product Features - Marey allows for minute-long video generation, supports various artistic styles, and delivers film-quality visuals at 1080P and 24 frames per second [6][9]. - The model includes advanced capabilities such as physical simulation, background replacement, and layered editing, enabling significant creative flexibility for filmmakers [7][9][11]. Group 3: Market Position and Compliance - Moonvalley is positioned as a leader in compliance, using only authorized data for training, with 80% of its material sourced from independent filmmakers and YouTube users [16][17]. - The company emphasizes ethical practices, allowing creators to request data removal and compensation, thus avoiding copyright disputes [18][20]. Group 4: Industry Challenges - The AI film sector faces significant challenges, including resistance from industry professionals regarding the use of AI in creative processes and ongoing legal disputes over data usage [12][16]. - Traditional studios are developing their own AI tools to protect intellectual property, highlighting the need for compliance and ethical considerations in AI applications [15][20].
速递|Moonvalley发布首个公开数据训练的AI视频模型Marey:如何实现360度镜头控制与物理模拟
Z Potentials· 2025-07-09 05:56
Core Viewpoint - Moonvalley, an AI video generation startup, emphasizes that traditional text prompts are insufficient for film production, introducing a "3D perception" model that offers filmmakers greater control compared to standard text-to-video models [1] Group 1: Product Offering - Moonvalley launched its model Marey in March as a subscription service, allowing users to generate video clips up to 5 seconds long, with pricing tiers of $14.99 for 100 points, $34.99 for 250 points, and $149.99 for 1000 points [1] - Marey is one of the few models trained entirely on publicly licensed data, appealing to filmmakers concerned about potential copyright issues with AI-generated content [1] Group 2: Democratization of Filmmaking - Independent filmmaker Ángel Manuel Soto highlights Marey's ability to democratize access to top-tier AI narrative tools, reducing production costs by 20% to 40% and providing opportunities for those traditionally excluded from filmmaking [2] - Soto's experience illustrates how AI enables filmmakers to pursue their stories without needing external funding or approval [2] Group 3: Technological Capabilities - Marey possesses an understanding of the physical world, allowing for interactive storytelling and features like simulating motion while adhering to physical laws [3] - The model can transform scenes, such as converting a video of a bison running into a Cadillac speeding through the same environment, with realistic changes in grass and dust [4] Group 4: Advanced Features - Marey supports free camera movement, enabling users to adjust camera trajectories and create effects like panning and zooming with simple mouse actions [5] - Future updates are planned to include new control features such as lighting adjustments, depth object tracking, and a character library [5] - Marey's public release positions it in competition with other AI video generators like Runway Gen-3, Luma Dream Machine, Pika, and Haiper [5]