Agent Skills

Search documents
腾讯研究院AI速递 20251020
腾讯研究院· 2025-10-19 16:01
Group 1: Nvidia and TSMC Collaboration - Nvidia and TSMC unveiled the first Blackwell chip wafer produced in the U.S., marking a significant milestone in domestic chip manufacturing [1] - The TSMC Arizona factory has a total investment of $165 billion and will produce advanced chips using 2nm, 3nm, and 4nm processes [1] - The Blackwell chip features 208 billion transistors and achieves a connection speed of 10TB/s between its two sub-chips through NV-HBI [1] Group 2: Anthropic's Agent Skills - Anthropic launched the Agent Skills feature, allowing users to load prompts and code packages as needed, enhancing the capabilities of AI [2] - Skills can be used across Claude apps, Claude Code, and API platforms, with a focus on minimal necessary information loading [2] - The official presets include nine skills for various document formats, and users can upload custom skills [2] Group 3: New 3D World Model by Fei-Fei Li - Fei-Fei Li's World Labs introduced a real-time generative world model, RTFM, which can render persistent 3D worlds using a single H100 GPU [3] - RTFM employs a self-regressive diffusion Transformer architecture to learn from large-scale video data without explicit 3D representations [3] - The model maintains spatial memory for persistent world geometry through pose-aware frames and context scheduling technology [3] Group 4: Manus 1.5 Update - Manus released version 1.5, introducing a built-in browser that allows AI to interact with web pages, test functions, and fix bugs [4] - A new Library file management system enables collaborative editing within the same Agent session, reducing average task completion time significantly [4] - The system allows for no-code music web application construction through natural language, supporting real-time updates [4] Group 5: Windows 11 Major Update - Windows 11's major update features "Hey Copilot" for voice activation and Copilot Vision for screen understanding, enhancing user interaction [5][6] - Copilot Actions can perform operations on local files, while Copilot Connectors integrate with OneDrive, Outlook, and Google services [5][6] - Manus AI operations are integrated into the file explorer, allowing for automatic website generation and video editing functionalities [6] Group 6: Baidu's PaddleOCR-VL Model - Baidu open-sourced the PaddleOCR-VL model, achieving a score of 92.6 on the OmniDocBench V1.5 leaderboard with only 0.9 billion parameters [7] - The model supports 109 languages and excels in text recognition, formula recognition, table understanding, and reading order prediction [7] - It utilizes a two-stage architecture combining dynamic resolution visual encoding and a language model, achieving high inference speed on A100 [7] Group 7: AI in Fusion Energy Development - Google DeepMind collaborates with CFS to accelerate the development of the SPARC fusion device using AI [8] - The partnership focuses on creating precise plasma simulation systems and optimizing fusion energy output [8] - The TORAX simulator is a key tool for CFS, enabling extensive virtual experiments and real-time control strategy exploration [8] Group 8: Harvard Study on AI's Impact on Employment - A Harvard study tracking 62 million workers found a significant decline in entry-level positions in companies using AI, primarily through slowed hiring [9] - The impact of AI is most pronounced among graduates from mid-tier universities, while top-tier and bottom-tier institutions are less affected [9] - The wholesale and retail sectors face the highest risk for entry-level jobs, with a trend towards skill polarization [9] Group 9: Concerns Over AI-Generated Content - Reddit co-founder Ohanian warned that much of the internet is "dead," overwhelmed by AI-generated content [10] - Reports indicate that automated traffic could reach 51% by 2024, with AI-generated articles surpassing human-written ones [10] - Research suggests that training models on AI-generated data may lead to a decline in model performance [10] Group 10: Andrej Karpathy on AGI Development - AI expert Andrej Karpathy expressed skepticism about the current state of AI agents, predicting that AGI is still a decade away [11] - He criticized the noise in reinforcement learning and the limitations of pre-training methods [11] - Karpathy anticipates that AGI will contribute modestly to GDP growth, emphasizing the importance of education in the AI era [11]
X @Anthropic
Anthropic· 2025-10-16 18:52
New on the Anthropic Engineering Blog: Our tips for developers on using Agent Skills, a new way to extend Claude's capabilities with instruction folders, scripts, and resources:https://t.co/2liRUo4AWO ...