Workflow
腾讯研究院
icon
Search documents
招聘|腾讯研究院法律研究中心实习生
腾讯研究院· 2025-11-06 08:33
Core Viewpoint - The article outlines an internship opportunity at Tencent Research Institute, focusing on legal research in the fields of digital economy and artificial intelligence, emphasizing the importance of understanding legal and ethical issues in these rapidly evolving sectors [1][3]. Group 1: Job Description - The internship involves tracking, analyzing, and interpreting global legal and ethical issues, legislative trends, and industry governance practices related to the digital economy and AI [3]. - Responsibilities include supporting research on AI governance and safety, conducting surveys and interviews, and assisting in writing research articles and reports [3]. Group 2: Requirements - Candidates should be graduate students (not recent graduates) with backgrounds in internet law, digital law, AI law, technology ethics, or AI governance, with interdisciplinary backgrounds preferred [4]. - Strong interest in the internet technology industry and familiarity with legal policies, safety, and governance issues in the internet and AI sectors are essential [4]. - Proficiency in using generative AI products, data analysis skills, and good PPT creation abilities are required [4]. - Candidates must possess strong logical thinking, theoretical knowledge, and writing skills, along with excellent English proficiency [4]. Group 3: Internship Details - The internship is a full-time position for at least six months, with an immediate start date [4]. - The work location is in Shenzhen or Beijing [5]. Group 4: Opportunities - Interns will participate in significant research projects related to AI safety and governance [7]. - The position offers exposure to the latest developments in the internet industry and access to research resources from both domestic and international academic and industry circles [7].
我们对AI认识远远不足,所以透明度才至关重要|腾研对话海外名家
腾讯研究院· 2025-11-06 08:33
Core Viewpoint - The article emphasizes the importance of AI transparency, arguing that understanding AI's operations is crucial for governance and trust in its applications [2][3][9]. Group 1: Importance of AI Transparency - The ability to "see" AI is essential in an era where AI influences social interactions, content creation, and consumer behavior, raising concerns about misinformation and identity fraud [7][8]. - AI Activity Labeling is becoming a global consensus, with regulatory bodies in China and the EU mandating clear identification of AI-generated content to help users discern authenticity and reduce deception risks [7][8]. - Transparency not only aids in identifying AI interactions but also provides critical data for assessing AI's societal impacts and risks, which are currently poorly understood [8][9]. Group 2: Mechanisms for AI Transparency - AI labeling is one of the fastest-advancing transparency mechanisms, with China implementing standards and the EU establishing identification obligations for AI system providers [12][14]. - Discussions are ongoing about what should be labeled, who embeds the labels, and how to verify them, highlighting the need for effective implementation standards [12][14][15]. - The distinction between labeling content and AI's autonomous actions is crucial, as current regulations primarily focus on content, leaving a gap regarding AI's behavioral transparency [13]. Group 3: Model Specifications - Model specifications serve as a self-regulatory mechanism for AI companies, outlining expected behaviors and ethical guidelines for their models [17][18]. - The challenge lies in ensuring compliance with these specifications, as companies can easily make promises that are difficult to verify without robust enforcement mechanisms [18][20]. - There is a need for a balance between transparency and protecting proprietary information, as not all operational details can be disclosed without risking competitive advantage [20]. Group 4: Governance and Trust - Transparency is vital for building trust in AI systems, allowing users to understand AI's capabilities and limitations, which is essential for responsible usage and innovation [9][23]. - The article argues that transparency mechanisms should not only focus on what AI can do but also on how it operates and interacts with humans, fostering a more informed public [10][23]. - Ultimately, achieving transparency in AI governance is seen as a foundational step towards establishing a reliable partnership between AI technologies and society [23].
腾讯研究院AI速递 20251106
腾讯研究院· 2025-11-05 16:01
Group 1: Generative AI Developments - Google announced Project Suncatcher, planning to launch two prototype satellites with Trillium TPU by early 2027, utilizing solar energy for AI computation [1] - Anthropic introduced a new paradigm called "code execution," reducing token consumption from 150,000 to 2,000, achieving a 98.7% efficiency improvement [2] - Open-Sora Plan company launched Uniworld V2, excelling in Chinese language processing and detail control, outperforming OpenAI's GPT-Image-1 in benchmarks [3] Group 2: Browser and AI Integration - QQ Browser's version 19.8.0 introduced an "AI+" floating window feature integrating 14 AI tools for various tasks, enhancing user experience [4] Group 3: Geographic AI Enhancements - Google upgraded Earth AI with new foundational models for remote sensing, demographic dynamics, and environmental analysis, significantly improving performance metrics [5][6] Group 4: Robotics Innovations - Xiaopeng showcased the next-generation IRON humanoid robot with 82 degrees of freedom and a total computing power of 2250 TOPS, setting a new standard in humanoid robotics [7] - Generalist launched a new embodied foundational model GEN-0, trained on over 270,000 hours of real-world data, demonstrating significant advancements in robotic capabilities [8] Group 5: Navigation and AI Collaboration - Galaxy Generalist collaborated with multiple universities to introduce the NavFoM model, unifying various navigation tasks and enhancing spatial understanding [9] Group 6: Startup Methodologies - ElevenLabs operates with 350 employees divided into 20 autonomous product teams, each required to achieve product-market fit within six months or face dissolution [10]
最危险的不平等,是理解的不平等|AI x 留守儿童测评发布
腾讯研究院· 2025-11-05 11:14
Core Viewpoint - The article discusses the role of AI in supporting children, particularly focusing on left-behind children, and emphasizes the need for a scientific evaluation framework to assess AI's suitability for children's welfare [5][59]. Group 1: AI Suitability Assessment - Tencent Research Institute and Beijing University of Science and Technology designed an "AI Suitability Assessment" to evaluate mainstream AI models in addressing children's diverse questions [7]. - A unique five-layer pyramid evaluation model was created, encompassing safety, understanding, empathy, relationship support, and autonomy [9][10]. - The assessment identified ten dimensions that collectively form a comprehensive profile of a child-friendly AI [16]. Group 2: Evaluation Findings - The assessment revealed that AI models scored higher in foundational dimensions like safety and reliability, with scores of 4.04 for confidentiality, 3.88 for accuracy, and 3.87 for non-harmfulness [24]. - However, higher-level dimensions such as empathy, relationship support, and autonomy scored below 3, indicating a significant gap in AI's ability to provide emotional support and foster social relationships [24][57]. - Deepseek was noted as the highest-scoring model in the assessment, although its performance was less pronounced compared to its results in youth sexual education topics [26][57]. Group 3: Emotional and Social Support - In addressing emotional issues, AI performed best with a score of 3.64, reflecting its ability to handle universal emotional patterns but lacking in contextual understanding [28][57]. - The AI's inability to provide deep empathy and effective empowerment was highlighted, particularly in the context of left-behind children's unique challenges [45][49]. - The assessment indicated that while AI can simulate emotional responses, it struggles with fostering genuine relationships and understanding the complexities of children's social environments [40][55]. Group 4: Implications for Education Equity - The article argues that AI can democratize access to educational resources but may also obscure deeper issues of inequality, particularly in understanding children's real-life challenges [59][60]. - The findings suggest that the most effective AI for children is not necessarily the most advanced but one that knows when to engage and when to allow children to take the lead [63]. - The ultimate goal of educational equity should be to nurture well-rounded individuals rather than merely providing advanced tools [63].
腾讯研究院AI速递 20251105
腾讯研究院· 2025-11-04 16:05
Group 1 - OpenAI has entered a strategic partnership with AWS, securing a $38 billion deal over seven years, which includes access to NVIDIA GPU-equipped Amazon EC2 UltraServers [1] - AWS will create a dedicated infrastructure for OpenAI, aiming to deploy all computing power by the end of 2026 [1] - This partnership is considered one of the largest cloud service transactions in history, positively impacting Amazon's stock price [1] Group 2 - Kunlun Wanwei's AI video creation platform SkyReels has officially launched on web and mobile, integrating top global AI multimodal models [2] - The platform features six core functionalities, including an infinite canvas and digital human capabilities, aimed at enhancing creative efficiency in marketing and education [2] - SkyReels is positioned as a zero-threshold creative generation tool, addressing the fragmented and inefficient tools in the creative industry [2] Group 3 - Tencent's ima now supports importing and exporting Tencent documents, enhancing workflow efficiency [3] - Users can import various document types into the knowledge base for analysis and export responses back to Tencent documents [3] - This integration allows for a seamless content management process, significantly improving user experience [3] Group 4 - MiniMax has released its latest music model, Music 2.0, which closely mimics real human vocal tones and supports various singing styles [4] - The model allows precise control over vocal tones and can generate cinematic-level soundtracks with emotional depth [4] - Music 2.0 can produce songs up to five minutes long, maintaining a coherent melody and structure [4] Group 5 - The first AI trading competition concluded, with six AI models trading cryptocurrency, where Qwen3 Max achieved a 22.3% return [5][6] - Two domestic models performed well, while other models, including Claude and GPT-5, experienced significant losses [5][6] - The competition showcased the trading capabilities and risk management of different AI models [6] Group 6 - The AI pendant Nuna, priced at $299, uses radar and AI sensors to monitor emotional changes without intrusive interactions [7] - It categorizes memory into six modules and processes sensitive data on the user's device to ensure privacy [7] - Nuna is designed to be a non-intrusive emotional recorder, helping users reflect on their feelings [7] Group 7 - NVIDIA has launched the first space-based AI server, deploying the H100 GPU in a satellite for advanced data processing [8] - The satellite will operate in low Earth orbit, processing earth observation data at a fraction of the energy cost of ground data centers [8] - Future plans include launching additional satellites equipped with advanced GPUs to expand space-based data processing capabilities [8] Group 8 - a16z partner David George argues that current AI investments differ from the 2000 internet bubble, with substantial cash flow backing [9] - Major corporations are leading AI investments, with a projected $3-4 trillion to be spent on data centers in the next five years [9] - The increase in global token processing volume indicates a sustainable growth trend rather than speculative investment [9] Group 9 - AI pioneer Hinton warns that to achieve significant returns on AI investments, companies may need to replace human labor with AI [10] - He criticizes the focus on profit over human welfare and suggests a coexistence model between humans and AI [10] - Data shows that 95% of enterprises using generative AI have failed, impacting various job sectors while some roles remain resilient [10]
HER来了吗:AI社交的热潮与沉思
腾讯研究院· 2025-11-04 11:16
Core Insights - The article discusses the transition of AI from a "tool" to a "companion," highlighting the growing demand for AI social interaction and the challenges faced by various applications in this space [2][4]. Market Trends - AI social companionship has rapidly gained traction since 2023, with predictions that by spring 2025, it will surpass short videos and gaming in user engagement, reaching an average of 167.9 interactions per month per user [4]. - Leading applications like Character.AI and Replika have surpassed 10 million monthly active users, with optimistic forecasts suggesting the global AI social companionship market could reach $150 billion by 2030 [4]. Market Dynamics - The market exhibits a significant head effect, where only 10% of applications contribute nearly 89% of the revenue, indicating a harsh selection process [5]. - Many well-known projects have failed in 2024, with user complaints about high costs and low retention rates, as evidenced by several top products having an average usage of less than 5 days per month [5][12]. Product Categories - The market features six main categories of AI applications based on emotional needs: emotional companionship, practice assistance, alternative expression, social co-creation, entertainment interaction, and general assistance [6]. User Experience and Memory - Long-term memory is identified as the soul of AI social interaction, with advancements in memory mechanisms allowing for more meaningful and continuous user engagement [14]. - Multi-modal interactions enhance the sense of presence in AI companionship, with new technologies enabling richer user experiences through video, sound, and interactive storytelling [15]. Challenges and Limitations - Despite advancements, AI still struggles with narrative development, often lacking the ability to create engaging and contextually relevant stories [16]. - The need for AI to possess situational awareness and narrative-driving capabilities is emphasized as crucial for enhancing user experience [18][20]. Business Models and Ecosystem - The industry is exploring various business models, including content-driven platforms, vertical scene-focused products, and AI companionship as an operating system [22][26]. - Subscription models remain prevalent, but there is a growing need for diverse revenue streams to ensure sustainability [27]. Ethical Considerations and Governance - The article highlights the dual nature of AI companionship, where it can provide emotional support but also pose risks of dependency and isolation [29]. - Regulatory measures are being implemented to ensure user safety, particularly for minors, with guidelines for age verification and content restrictions [30][31]. Future Directions - The evolution of AI social companionship is expected to follow a progression from expression to relationship and structure, emphasizing the importance of maintaining boundaries and enhancing user engagement [40]. - The balance between technology, business, and ethics is crucial for the positive impact of AI companionship, ensuring it complements rather than replaces real human interactions [41].
腾讯研究院AI速递 20251104
腾讯研究院· 2025-11-03 16:01
Group 1: Generative AI Developments - Cambricon has launched the Cambricon NeuWare foundational software platform, fully compatible with the latest version of PyTorch and Triton operator development language, enabling rapid migration of user models and custom operators [1] - OpenAI has tightened its usage policy, stating that ChatGPT will no longer assist in providing professional advice in high-risk fields such as healthcare, law, and finance, due to rising legal risks and global compliance pressures [2] - Meituan has open-sourced its multimodal model LongCat-Flash-Omni, which has a total parameter count of 560 billion and an active parameter count of 27 billion, achieving state-of-the-art results in multimodal benchmark tests [3] Group 2: AI Applications and Innovations - Baidu's Wenxin app has introduced a "Magic Comic" feature that allows users to generate multi-page AI comics from a single sentence or photo within two minutes, supporting custom character designs and various artistic styles [4] - Cartesia has launched the new Sonic-3 voice model, supported by a $100 million investment from Nvidia, which can generate voice in 42 languages and over 500 tones, with a response time of under 190 milliseconds [5][6] - Turbo AI, founded by two 20-year-old college dropouts, has seen its user base grow from 1 million to 5 million in six months, generating annual recurring revenue in the eight figures while serving clients like Goldman Sachs and McKinsey [7] Group 3: AI Tools and Market Trends - A review of mainstream AI browsers indicates a division between progressive browsers (Chrome/Edge) and radical browsers (ChatGPT Atlas/Perplexity Comet/Dia), each with unique strengths and weaknesses [8] - Rokid has partnered with BOLON to launch the BZ5000 AI smart glasses, which weigh only 38 grams and feature a 12-megapixel camera, emphasizing localized services through its YodaOS system [9] - AI expert Fei-Fei Li has called for universities and non-profit organizations to reclaim the mission of advancing AI as a public good, highlighting the shift from open research to closed commercial competition [10][11] Group 4: Data and Market Opportunities - a16z partners emphasize the importance of building "data moats" in fragmented, sensitive, or hard-to-access fields, with examples like VLex and OpenEvidence showcasing proprietary data systems as competitive advantages [12]
守护克孜尔
腾讯研究院· 2025-11-03 10:59
Core Viewpoint - The article emphasizes the historical and cultural significance of the Kizil Grottoes, highlighting their role as a starting point for Chinese cave art and the ongoing efforts for their preservation and digital restoration [1][2][3]. Group 1: Historical Significance - Kizil Grottoes, dating back to the late 3rd century, contain 349 caves and nearly 4,000 square meters of murals, marking the origin of many early Chinese cave art styles [2][3]. - The site was a crucial cultural hub along the ancient Silk Road, facilitating the exchange of diverse cultures and ideas, particularly Buddhism [1][2][3]. - Kizil was historically a prosperous area, contributing to its rich cultural and artistic heritage, evidenced by the exquisite murals and sculptures found in the caves [19][22]. Group 2: Preservation Efforts - The Kizil Grottoes have faced significant damage over time due to natural disasters, religious changes, and looting, with many murals now scattered across various countries [3][4]. - Recent preservation efforts include physical restoration and the application of digital technologies for documentation and reconstruction of damaged murals [3][4][5]. - The "Tao Yuan Plan 2024" supports the use of AI and advanced imaging techniques to restore and identify missing mural patterns, showcasing a modern approach to cultural heritage preservation [4][5][39]. Group 3: Cultural Exchange and Influence - The murals reflect a blend of Eastern and Western artistic influences, showcasing the creative exchanges that occurred along the Silk Road [16][24]. - The Kizil Grottoes are seen as a vital link in the transmission of Buddhist culture and knowledge into China, significantly impacting Chinese civilization [18][19]. - The unique artistic style of Kizil, characterized by its diamond-shaped murals, represents a distinct cultural expression that has influenced subsequent cave art in China [13][16]. Group 4: Future Directions - The establishment of a digital exhibition center aims to alleviate visitor pressure on the physical site while enhancing public engagement with Kizil's cultural heritage [64][65]. - Ongoing research and technological advancements in digital restoration are expected to improve the understanding and appreciation of Kizil's historical significance [66]. - The integration of AI in restoration efforts is anticipated to enhance the accuracy and efficiency of mural reconstruction, ensuring the preservation of this cultural treasure for future generations [39][46].
腾讯研究院AI速递 20251103
腾讯研究院· 2025-11-02 16:06
Group 1: AI Security Solutions - OpenAI has launched the "white hat" Agent Aardvark powered by GPT-5, capable of automatically identifying and fixing security vulnerabilities in codebases, having recognized 92% of known and artificially injected vulnerabilities [1] - Aardvark's workflow includes threat modeling, submission scanning, sandbox validation, and Codex repair, utilizing LLM reasoning capabilities to operate like human security researchers [1] - Major tech companies such as Google, Anthropic, and Microsoft have also released similar white hat agents in October to address the increasing number of vulnerabilities and the sophistication of attack methods in the AI era [1] Group 2: AI Programming Models - The AI programming application Cursor and Windsurf's newly released models, Composer-1 and SWE-1.5, are suspected to be based on Chinese models, with Cursor showing a tendency to respond in Chinese [2] - Users discovered that Cursor Composer-1 employs the same tokenizer as DeepSeek, while Windsurf's claims of being self-developed were contradicted by its ties to the GLM model developed by Zhiyu [2] - Chinese open-source models dominate performance rankings, filling the top 5 and even top 10, making them a rational choice for startups due to their cost-effectiveness [2] Group 3: Attention Mechanisms in AI Models - Linear attention mechanisms are making a comeback, with domestic models like MiniMax-M1, Qwen3-Next, and DeepSeek V3.2 adopting linear or sub-quadratic attention variants [3] - The new MiniMax model M2 has reverted to traditional attention, citing accuracy issues with linear attention in reasoning and multi-turn dialogue tasks [3] - Kimi Linear proposes a hybrid attention strategy, combining three linear attention blocks with one full attention block, achieving a 75% reduction in KV cache and up to a 6x increase in decoding throughput [3] Group 4: Canva's AI Innovations - Canva, valued at $42 billion, has introduced a self-training foundational model capable of producing complete design files with editable layers and has made the acquired Affinity tool permanently free [4] - The core feature, Ask @Canva, is deeply integrated into the design interface, allowing users to modify elements using natural language, with AI also providing suggestions for design improvements [4] - Canva's annual revenue is approximately $3 billion, with over 240 million monthly active users, and it is expected to go public in 2026, directly competing with Adobe for a 70% market share [4] Group 5: Neuralink's Ambitions - Elon Musk announced that the first Neuralink recipient, Noland Arbaugh, may be the first to receive upgrades or dual chip implants, predicting that Neuralink users could eventually outperform others in gaming [5] - Neuralink has had 12 users with a cumulative usage of over 2,000 days and a total active time exceeding 15,000 hours, with research results from the first three trial participants submitted to the New England Journal of Medicine [5] - The company has initiated a new clinical trial called "thought-to-text," aiming to implant 20,000 individuals annually by 2031, targeting annual revenue exceeding $1 billion and applications for healthy individuals starting in 2030 [5] Group 6: AI in Speech Therapy - A research team from Stanford University tested 15 mainstream models for speech disorder recognition, with the best-performing model achieving only 55% accuracy, below the FDA's clinical standard of 80-85% [6] - The study revealed biases in the models, with better performance on male voices compared to female, and English speakers outperforming those using other languages, as well as older children over younger ones [6] - Fine-tuning techniques have shown promise, with performance accuracy improving by 10% after utilizing a small dataset of children's speech for fine-tuning, indicating the potential of multimodal language models in speech pathology applications [6] Group 7: AI Workflow Transformation - Brex, valued at $12.3 billion, is transforming its internal AI platform into a product, built on Retool and reusing external AI capabilities, maintained by a 25-person systems engineering team [7] - The COO is restructuring the operational workflow, delegating L1 tasks to AI, shifting L2 roles from managers to managing agents, and evolving L3 responsibilities from problem-solving to system design, predicting a 5 to 10 times increase in operational efficiency [7] - Recruitment strategies are shifting from favoring specialists to generalists, with interviews focusing on AI usage habits, requiring AI case studies, and assessing AI application capabilities through real business challenges [7] Group 8: OpenAI's Restructuring - OpenAI has completed a restructuring, with a non-profit foundation holding shares valued at $130 billion, becoming one of the largest charitable foundations globally, with an initial investment of $25 billion for healthcare and AI safety [8] - A new agreement stipulates that OpenAI's current and future AGI model APIs will be exclusively deployed on Azure for seven years, with Microsoft holding approximately 32.5% of OpenAI's shares valued at around $135 billion [8] - Both parties have signed a $250 billion pre-purchase contract for Azure, with Microsoft's capital expenditure reaching $34.9 billion last quarter, a 40% increase from the previous quarter, primarily directed towards new data centers and AI chip procurement [8] Group 9: Legal Issues Surrounding OpenAI - Ilya Sutskever testified for nearly 10 hours in the lawsuit filed by Elon Musk against OpenAI [9] - Ilya submitted a 52-page memorandum detailing allegations against Altman, including accusations of deceiving the board, sowing discord, creating chaos, and enabling the growth of Anthropic [9] - Following Altman's dismissal, the board seriously considered the possibility of merging with Anthropic and appointing Dario Amodei as CEO, but this plan fell through due to operational challenges and a revolt from 700 employees [10]
腾讯研究院AI每周关键词Top50
腾讯研究院· 2025-11-01 02:33
Core Insights - The article presents a weekly roundup of the top 50 keywords related to AI developments, highlighting significant trends and innovations in the industry [2]. Group 1: Chips - Vera Rubin is a notable keyword associated with NVIDIA, indicating advancements in chip technology [3]. - Qualcomm has introduced a new AI inference solution, showcasing its commitment to enhancing AI capabilities [3]. Group 2: Models - OpenAI has developed a safety classification model, emphasizing the importance of security in AI applications [3]. - Cursor has launched its self-developed Composer model, reflecting the trend of companies creating proprietary AI models [3]. - NVIDIA's OmniVinci model and MiniMax's M2 model are also highlighted, indicating ongoing innovation in AI modeling [3][4]. Group 3: Applications - Sora has introduced a role cameo feature, enhancing user interaction with AI [3]. - MiniMax Speech 2.6 and Beijing Zhiyuan's WuJie·Emu3.5 are examples of new AI applications aimed at improving communication [3]. - Adobe's Firefly Image 5 and Tencent's interactive AI podcast demonstrate the growing integration of AI in creative and media sectors [3][4]. Group 4: Technology - The NEO home robot by 1X Technologies and the LeRobot v0.4.0 by Hugging Face represent advancements in consumer robotics [4]. - Neuralink's PRIMA artificial vision and Merge Labs' ultrasound brain-machine interface highlight significant technological innovations in AI and neuroscience [4]. Group 5: Capital - OpenAI is undergoing a capital structure reorganization and has plans for an IPO, indicating its growth and potential market impact [4]. Group 6: Events and Opinions - There is a call for copyright protection in Japan, reflecting ongoing discussions about intellectual property in the AI space [4]. - Yoshua Bengio's new definitions of AGI and insights on mental health data from OpenAI indicate evolving perspectives on AI's role in society [4].