Avatar

Search documents
具身数采方案一览!遥操作和动捕的方式、难点和挑战(2w字干货分享)
自动驾驶之心· 2025-07-10 12:40
以下文章来源于具身智能之心 ,作者具身智能之心 具身智能之心 . 与世界交互,更进一步 点击下方 卡片 ,关注" 自动驾驶之心 "公众号 戳我-> 领取 自动驾驶近15个 方向 学习 路线 继具身本体未定论专场讨论后,几位嘉宾意犹未尽,决定再来一场圆桌,聚焦具身智能的"方向盘"--遥操作。 遥操作本身并非新概念,甚至在一二十年前效果就非常好了。那这一次,遥操作再次走进大家视野,是带来或准备带来哪些升级呢? 同时,希望本次圆桌,会给正在或准备进行遥操作相关学习和研究的同学,带来有关遥操作一些高屋建瓴的认知,同时为他们今后的学习研究之路带来一些 启发。 本期我们会深入聊到:遥操作是什么、各式各样的遥操作体验分享、遥操存在的意义只是为了采数据吗、动捕有什么难点、aloha的划时代意义、遥操终局 畅想、如果机器人有操作系统等。大家一起来体验这场火花四溅又若有所思的圆桌吧! 完整视频已经上传到国内首个具身智能全栈技术社区: 具身智能之心知识星球 内部,感兴趣的同学欢迎加入交流。 圆桌嘉宾:赵仲夏 格灵深瞳算法总监 北京大学和智源研究院访问-学者(小红书id:夏染) 圆桌嘉宾:智元机器人遥操负责人-王文灏 圆桌嘉宾:清华 ...
3 Dates for Disney Investors to Circle in July After the Stock Hit New Highs in June
The Motley Fool· 2025-07-01 15:45
A new and an updated attraction hope to keep turnstile clicks coming at both domestic resorts with a new Marvel movie hoping to keep Disney's stock upticks coming. culminates in riders in six-passenger cars circling around the attraction building at speeds up to nearly 65 miles per hour. It is introducing some features that will give a nod to the original and more tranquil World of Motion attraction that opened in the same space when the park opened in 1982. Shares of Walt Disney (DIS -0.79%) enter July wit ...
CBHH's Charles Cameron on Financing The Next Generation of Critical Infrastructure - On Navatar's A-Game Podcast: Sector Focus, Growth Infra, Cross-Border M&A Execution and CRM Value
GlobeNewswire News Room· 2025-07-01 05:30
Core Insights - CBHH focuses on sourcing and executing infrastructure financing and M&A opportunities across the UK and continental Europe, particularly in next-generation infrastructure businesses [1][2] - The firm operates in the "core+ or value-add infrastructure" space, which includes sectors like data centers, EV charging, energy generation, and smart city technologies [1][2] Core+ Infrastructure - CBHH targets "next-generation infrastructure" assets that are too small for large-cap investors but too capital-intensive for early-stage funds, emphasizing their importance in driving mission-critical infrastructure [2] Operational Insights - Companies in this sector are described as capital-hungry and operationally intense, but understanding unit economics allows for effective growth underwriting [3] Market Dynamics - The merger with Herbst Hilgenfeldt Partners enhances CBHH's coverage in two active European infrastructure markets, aligning with public priorities of decarbonization and digital infrastructure [4] Advisory Approach - CBHH maintains strong relationships with clients, advising them from early institutional rounds to large-scale exits, and has co-invested in past clients, blending traditional banking principles with modern M&A execution [5][6] Competitive Positioning - Despite being a boutique firm, CBHH competes effectively with global investment banks due to the senior team's banking heritage, deep sector knowledge, and agility in complex transactions [6] Institutional Knowledge - CBHH utilizes Navatar's CRM platform to enhance firmwide institutional knowledge, allowing for better relationship management and deal execution [7][8][9] Team Background - The firm is composed of former bankers from major institutions like Goldman Sachs and UBS, bringing a distinct discipline and empathy to client relationships [10][11]
如何做到在手机上实时跑3D真人数字人?MNN-TaoAvatar开源了!
机器之心· 2025-06-25 00:46
TaoAvatar 是由阿里巴巴淘宝 Meta 技术团队研发的 3D 真人数字人技术,这一技术能在手机或 XR 设备上实现 3D 数字人的实时渲染以及 AI 对话的强大 功能,为用户带来逼真的虚拟交互体验。 它是如何实现的呢?本文将为您揭秘 TaoAvatar 背后的黑科技!同时在今天,我们正式宣布开源了 3D 真人数字人应用:MNN-TaoAvatar!目前应用源 码已同步发布在 MNN 的 GitHub 仓库,开发者可自行下载安装和体验,欢迎大家和我们一起交流讨论,共同探索 AI 数字人技术的无限可能。 什么是 TaoAvatar? TaoAvatar 是淘宝在数字人技术领域取得的最新突破,更多详细的研究成果已经发表在相关论文。 论文标题:TaoAvatar: Real-Time Lifelike Full-Body Talking Avatars for Augmented Reality via 3D Gaussian Splatting 论文地址:https://arxiv.org/abs/2503.17032v1 开源地址: https://github.com/alibaba/MNN/blob/ ...
人工智能周报(25年第24周):Opetai上线o3-pro模型,字节跳动发布豆包大模型1.6-20250619
Guoxin Securities· 2025-06-19 12:39
证券研究报告 | 2025年06月19日 人工智能周报(25 年第 24 周) 优于大市 OpenAI 上线 o3-pro 模型,字节跳动发布豆包大模型 1.6 人工智能动态:1)产品应用:OpenAI 上线 o3-pro AI 模型,兼具高效 性能与精准输出能力;Meta 推出世界模型 V-JEPA 2,具备卓越环境理 解与预测能力;苹果发布 Xcode26 开发者工具,内置 ChatGPT 赋能开发; 字节跳动发布豆包大模型 1.6,采用统一定价模式;阿里开源 3D 数字人 应用,革新直播与虚拟互动体验;腾讯混元 3D 2.1 全链路开源,几何 生成与材质表现显著提升。2)底层技术:阿里通义实验室开源 Mask Search 预训练框架,提升 AI 复杂问题解决表现;DeepMind 与布朗大学 合作开发"力提示"技术,实现无 3D 模型逼真运动效果 3)行业政策: 工业和信息化部会议审议《2025 年两化融合工作要点》,部署推进策略。 投资建议:互联网一季报披露完毕,业绩整体稳健。电商行业竞争依旧 激烈,各平台选择继续向商家让利、或在外卖即时零售领域加大投入寻 找新增量。AI 方面,巨头的业务场景,如云 ...
人工智能周报(25年第24周):OpenAI上线o3-pro模型,字节跳动发布豆包大模型1.6-20250619
Guoxin Securities· 2025-06-19 09:33
证券研究报告 | 2025年06月19日 人工智能周报(25 年第 24 周) 优于大市 OpenAI 上线 o3-pro 模型,字节跳动发布豆包大模型 1.6 人工智能动态:1)产品应用:OpenAI 上线 o3-pro AI 模型,兼具高效 性能与精准输出能力;Meta 推出世界模型 V-JEPA 2,具备卓越环境理 解与预测能力;苹果发布 Xcode26 开发者工具,内置 ChatGPT 赋能开发; 字节跳动发布豆包大模型 1.6,采用统一定价模式;阿里开源 3D 数字人 应用,革新直播与虚拟互动体验;腾讯混元 3D 2.1 全链路开源,几何 生成与材质表现显著提升。2)底层技术:阿里通义实验室开源 Mask Search 预训练框架,提升 AI 复杂问题解决表现;DeepMind 与布朗大学 合作开发"力提示"技术,实现无 3D 模型逼真运动效果 3)行业政策: 工业和信息化部会议审议《2025 年两化融合工作要点》,部署推进策略。 投资建议:互联网一季报披露完毕,业绩整体稳健。电商行业竞争依旧 激烈,各平台选择继续向商家让利、或在外卖即时零售领域加大投入寻 找新增量。AI 方面,巨头的业务场景,如云 ...
2 Reasons AMC Stock Is Soaring in June
The Motley Fool· 2025-06-07 11:53
Core Viewpoint - AMC, the largest movie theater operator globally, is experiencing a resurgence due to recent box office successes, particularly during the Memorial Day weekend, despite ongoing challenges from streaming and pandemic-related fears [1] Group 1: Recent Successes - The company benefited from the success of two major films, contributing to a revitalized box office performance [2] - Memorial Day weekend set a record with $326.7 million in domestic ticket sales, driven by Disney's Lilo & Stitch and Paramount's Mission: Impossible -- The Final Reckoning [3] - AMC reported all-time records for admissions revenue, food and beverage revenue, and total revenue during this weekend, marking the highest-attended weekend and five-day period of the year [5] Group 2: Future Outlook - Management believes AMC has turned a corner, with expectations for a robust theatrical box office due to a slate of upcoming films from major studios [7] - Upcoming releases include Disney's Avatar sequel, the next Frozen film, and Warner Bros.' new Superman, which are anticipated to drive further attendance [7] Group 3: Financial Performance and Market Sentiment - Despite recent successes, AMC is still facing revenue declines and losses as of Q1 2025, indicating uncertainty about sustained performance [8] - Short interest in AMC has increased to nearly 15% of outstanding shares, reflecting skepticism among investors about the longevity of the recent stock price surge [9] - The company's future performance is heavily reliant on the film industry's ability to produce hit movies and manage their release timing relative to streaming [10]
腾讯开源的HunyuanVideo-Avatar上传一张图+一段音频,虚拟角色“活”过来
Sou Hu Cai Jing· 2025-06-04 02:48
Core Viewpoint - Tencent has launched an open-source video generation tool called HunyuanVideo-Avatar, which allows users to animate characters from a static image and audio, creating lifelike interactions and performances [3]. Group 1: Technology Features - HunyuanVideo-Avatar acts as a "digital director," interpreting a static image and animating it based on the emotional tone of the audio [3]. - The tool eliminates the "internet celebrity face" issue by embedding the user's photo into the model, preserving original details like clothing folds and background lighting [4]. - It can extract emotional features from audio, allowing for nuanced facial expressions beyond simple lip-syncing [5]. - The technology enables multiple characters to interact independently, with natural eye contact and gestures, enhancing realism in performances [6]. Group 2: Application Scenarios - In e-commerce, the tool can create AI hosts for live streaming, using product images and promotional text to engage customers and drive sales [6]. - In music platforms, it allows for real-time performances by AI avatars, such as singing new songs or narrating stories in children's voices [7]. - For film production, directors can generate storyboard animations from simple sketches and voice scripts, streamlining the creative process [8]. Group 3: Technical Requirements - The minimum configuration for smooth operation is an NVIDIA RTX 3090 GPU with 24GB memory, while the recommended setup includes an NVIDIA A100 GPU with 80GB memory [9]. - Additional requirements include 64GB DDR4 RAM (minimum) and 500GB NVMe SSD storage [9].
腾讯研究院AI速递 20250604
腾讯研究院· 2025-06-03 14:49
生成式AI 一、 微软发布Bing Video Creator , 由 OpenAI 的 Sora技术支 持 1. 微软发布Bing Video Creator,由OpenAI的Sora提供技术支持,可通过自然语言生成多 种类型视频; 2. 该服务免费使用,提供快速和标准两种生成模式,初始有10次快速生成机会,生成视频长 度为5秒; 3. 系统内置安全保障措施防止滥用,并为每个生成视频添加内容凭证和溯源信息,目前国区 尚未开放。 https://mp.weixin.qq.com/s/Wn1rdofeVHW1s-lJm4CszQ 二、 Manus推出全新的幻灯片功能, 一手实测!10分钟8页PPT 1. Manus新推出的幻灯片功能获好评,能在10分钟内生成8页专业PPT,并支持导出为 Google Slides; 2. 实测过程显示Manus能自动搜索资料、规划结构、生成内容,支持即时修改和多种导出格 式,但存在页面显示不完全问题; 3. 与Genspark对比,Manus速度更快(10分钟vs20分钟),功能更强,被网友评为当前PPT 制作最佳。 https://mp.weixin.qq.com/s/Zz3 ...
AI陪伴Top 1应用上线视频生成!图片人物能说话唱歌,多轮对话场景依然稳定
量子位· 2025-06-03 06:21
克雷西 发自 凹非寺 量子位 | 公众号 QbitAI 这项新功能名叫 AvatarFX ,主要用于图生视频,更具体说就是让静态图片中的人物"开口 说话"。 AvatarFX一个月之前面向订阅用户开放,现在所有用户都可以用了,同时c.ai也上新了多项 其他AI创作功能。 c.ai上新多项AI创作功能 在最新的公告中,c.ai宣布上新或即将上新一系列新功能,其中不少与AI创作相关。 首先就是 AvatarFX ,它主打图片动画化,而非从零开始的文本生成,可以让图片中的人物 说话、唱歌并和用户互动,也可以为角色生成自我介绍视频,同时支持宠物等非人类面孔。 c.ai介绍,AvatarFX基于DiT架构,自称达到了SOTA水准,技术亮点在于 高保真度和强时 间一致性 。 据介绍,即便面对多角色、长序列或多轮对话的复杂场景,AvatarFX生成的视频依然能够保 持稳定性。 AI陪伴应用的Top 1—— Character.ai (c.ai),也开始做起视频生成了。 在c.ai平台中,可以让AI扮演各种角色陪你对话,现在有了视频生成,这些角色可以动起来 了。 c.ai展示了用户的创作成果,还自嘲称之为"内部运作模式可视化 ...