Workflow
icon
Search documents
大模型安全研究报告(2024年)
阿里巴巴· 2024-09-30 03:55
Industry Investment Rating - The report does not explicitly provide an investment rating for the industry [1][2][3] Core Viewpoints - The rapid development of large models (LLMs) is driving the transition from specialized weak AI to general strong AI, marking a significant leap in intelligence and transforming human-computer interaction and application development models [3] - The commercialization and industrialization of large models have introduced new risks such as model "hallucinations," prompt injection attacks, and the democratization of cyberattacks, alongside exacerbating existing AI security risks [3] - Large models also present new opportunities for solving cybersecurity bottlenecks, leveraging their capabilities in information understanding, knowledge extraction, and task orchestration [3][24] - The report proposes a comprehensive framework for large model safety, focusing on both the safety of the models themselves and their application in enhancing cybersecurity [3][25] Summary by Sections Large Model Technology Evolution - The evolution of large models can be divided into three phases: the exploration phase (2017-2021) with pre-trained language models like GPT-1 and BERT, the explosion phase (2022-2023) with language models like ChatGPT, and the enhancement phase (2024-present) with multimodal models like Sora and GPT-4o [14][15][16][17] Security Challenges of Large Models - Large models face significant security risks across four key components: training data, algorithm models, system platforms, and business applications [18] - Training data risks include data leakage, data poisoning, and low-quality data [19] - Algorithm model risks include insufficient robustness, model "hallucinations," bias, and poor interpretability [20] - System platform risks include vulnerabilities in machine learning frameworks and development toolchains [21] - Business application risks include the generation of illegal or harmful content and data leakage [22][23] New Security Opportunities with Large Models - Large models can significantly enhance the precision and timeliness of threat identification, defense, detection, response, and recovery in cybersecurity [24] - They improve the universality and usability of data security technologies by automating data classification and reducing reliance on manual analysis [24] - Large models also enhance the robustness and accuracy of content security technologies, particularly in detecting deepfakes and other malicious content [24] Large Model Safety Framework - The safety framework for large models includes four dimensions: safety goals, safety attributes, protection objects, and safety measures [25][26] - Safety goals focus on ensuring the credibility of training data, the reliability of algorithm models, the stability of system platforms, and the controllability of business applications [32][33] - Safety attributes include authenticity, diversity, accuracy, confidentiality, accountability, predictability, fairness, transparency, explainability, compliance, reliability, controllability, and robustness [34] - Protection objects include the system, data, users, and behavior [35][36] - Safety measures cover training data protection, algorithm model protection, system platform protection, and business application protection [37][38][39][40][41][42][43][44][45][46][47][48][49][50][51][52][53][54][55] Large Model Empowering Security - Large models can empower cybersecurity, data security, and content security by leveraging their capabilities in natural language understanding, knowledge extraction, and task orchestration [60] - In cybersecurity, large models are applied in threat identification, protection, detection, response, and recovery, with applications in threat intelligence generation, vulnerability mining, code auditing, and network attack tracing [61][62][63][64][65][66][67][68][69] - In data security, large models are used for automated data classification and detection of personal information violations in apps and SDKs [60] - In content security, large models enhance the detection of text, image, video, and audio content for illegal or harmful information [60]
2024年阿里妈妈热点指南VOL.16
阿里巴巴· 2024-09-29 07:15
alimama 热点趋势 VOL.16 alimama . 16. 16. the first 10 g C の 1 雅丹风、购金热与携宠游 秋潮涌动 扫码下载完整报告 开启多元消费季 金秋时节新趋势涌现,阿里妈妈精准洞察!雅丹风穿搭掀动秋冬自然潮流,年轻人购金热引领黄金 消费变革,宠物出行潮拉动宠物装备需求增长。阿里妈妈深度剖析,导航金秋消费热点,与商家共同探 索金秋市场潜能! 雅丹风穿搭,焕发大地生机 服饰鞋帽 秋冬"雅丹风"盛行,大地色系彰显荒野气质,层次剪 裁让传统材质焕发新生。牛仔裤、风衣、毛衣等单品大热, 层次与质感并存,商家可借势聚焦热门材质,开发叠穿单 品,迎合都市女性需求,引领秋冬衣橱升级。 年轻人购金热,性价颜值兼顾 钟表配饰 珠宝文玩 年轻人卷起"购金"热潮,小而美、性价比高的金饰成 为都市青年攒财扮靓首选。从快速消费的小金饰,到复古 打金工艺的独特魅力,再到跨界联名引发的文化共鸣,商 家可融合古法工艺与当代审美趋势,打造平价轻奢新金饰。 携宠看世界,宠物出行新装 宠物生活 宠物友好服务日盛,假期携宠出游热度高,推动宠物 出行装备需求多元发展。从出行保障到食宿娱乐,从实用 产品到品质需求 ...
2024年环境、社会和治理(ESG)报告阿里巴巴
阿里巴巴· 2024-08-05 09:10
Investment Rating - The report does not explicitly state an investment rating for the industry or company Core Insights - The report emphasizes the importance of integrating ESG (Environmental, Social, and Governance) strategies into the core business model, highlighting that ESG is a foundational strategy for the company [13][22] - The company aims to achieve carbon neutrality in its operations by 2030 and has set ambitious targets for reducing emissions across its value chain [26][27] - The report outlines the company's commitment to supporting small and micro enterprises through technology and platform innovations, enhancing their market access and operational capabilities [42][44] Summary by Relevant Sections Environmental Initiatives - The company has made significant progress in reducing its carbon footprint, achieving a reduction of 5.0% in net emissions to 4.449 million tons and increasing the use of clean energy to 39.0% [27] - The company has implemented a range of strategies to promote a circular economy, including reducing packaging waste and enhancing water management efficiency [29] Social Contributions - The company has focused on enhancing social inclusion and resilience, particularly through initiatives aimed at rural revitalization and emergency response [47] - The report highlights the company's efforts in providing digital services and training to rural areas, contributing to local employment opportunities [47] Governance and Management - The company has established a robust ESG governance framework, including a sustainable development committee and specialized working groups to oversee ESG initiatives [24][22] - The report indicates that the performance of business unit CEOs will be evaluated based on their achievement of ESG targets, reinforcing accountability [24] Employee Development - The company is committed to fostering a diverse and inclusive workplace, with 47.2% of employees being women and 41.4% of management positions held by women [32] - The report outlines various programs aimed at employee health and well-being, including comprehensive health services and a supportive work environment [36][31] Sustainable Consumption - The company promotes responsible consumption through initiatives that encourage low-carbon lifestyles and the availability of eco-friendly products [38][39] - The report details efforts to enhance accessibility for disabled and elderly consumers, ensuring that digital platforms cater to their needs [39]
2024年9月采购节「核心活动会场」解读报告
阿里巴巴· 2024-07-20 04:52
E SUPER SEPTEMBER Alibaba.com 九月采购节「核心会场」详解 | --- | --- | |---------------------------|------------------| | | | | | | | CONTENTS | | | 01 市场背景及大促策略 | 02 买家限时权益 | | 03 核心会场介绍及Demo展示 | 04 FAQ及报名指引 | EL SUPER SEPTEMBER Alibaba.com 市场背景及大促策略 Part 01 市场分析 - 性价比依然是采购的第一决策因素 美国消费者信心指数(CCI)连续三个月下降,从年初的114.8回落至4月的97.5,为2022年7月 · 欧洲市场: 通货膨胀逐步缓和/经济恢复 以来的最低水平,5月虽回升至102.0,但较年初尚有差距。 在2024年的出口趋势中尤为突出。海关数据显示,前五个月累计出口货运量7.65 美国市场:性价比仍是最大的决策依据( The conference 亿吨,同比增长6%;承运外贸货物的进出境船舶量也大幅增长,前五个月累计达16.2万艘,同 ● 比大增12.4%。但同期中国出口总值 ...
2024百炼成金大金融模型新篇章
阿里巴巴· 2024-07-02 09:15
百炼成金 大金融模型新篇章 New Chapter of Financial LLM New Finance New Future 作者 张 翅 阿里云智能集团副总裁 新金融行业总经理 简介 张翅先生曾经在蚂蚁金服智能科技团队负责蚂蚁金服 科技产品的开放合作,推动内部技术产品化和金融行 业数字化转型。从 17 年开始先后负责了从银行、保 险、证券到金融服务等多个重要客户的数字化项目, 深耕金融科技、云原生分布式架构、移动平台、大数 据、人工智能、区块链等数字金融技术领域。在加入 阿里云和蚂蚁金服前,张翅先生先后供职于甲骨文、 Pivotal,领导参与了多个重大项目的建设,拥有丰富 的企业架构设计、IT 战略规划、产品研发及团队管理 等专业经验。 前言 山不让尘,川不辞盈。2024 年是互联网进入中国的第 30 个年头,中国金融行业也走过了金融科技和数字 化的 10 个年头。科技金融这篇大文章正方兴未艾,智能金融随着大模型日新月异发展突然按下了加速键。如果 将过去一年大模型的发展比作《三体》中描述的"技术爆炸",正形象地展现出了 AI 领域前所未有的快速变革。 这种爆炸式增长不仅仅是技术参数的简单膨胀,更是整个 ...
2024年GenAI技术落地白皮书
阿里巴巴· 2024-06-19 09:15
[]阿里云 ...
经营指南关键词BEST方法论BEST品类热趋关键词方法论阿里妈妈
阿里巴巴· 2024-06-04 03:05
天猫618 妈耶!真好搜 滋阿里妈妈 6.18特辑 年中狂欢节 阿里妈妈经营指南 关键词BEST方法论 ...
2024大模型训练数据白皮书
阿里巴巴· 2024-05-28 09:55
Industry Overview - The report emphasizes the importance of high-quality and diverse training data for the success of large language models (LLMs) like GPT, highlighting that data quality and scale are key drivers of model performance [11][12] - The training data for LLMs is categorized into three stages: pre-training, supervised fine-tuning (SFT), and reinforcement learning with human feedback (RLHF), each requiring different types of data [14][16] - For multimodal models, training data includes image-text pairs and video-text pairs, enabling models to understand and generate information across multiple modalities [17] Data Types and Quality - High-quality data is crucial for model performance, as it enhances accuracy, stability, and generalization capabilities [23] - The report identifies three uncertainties in defining high-quality data: the type of data required, the evolution of data forms, and the effective combination of different data types [25][26] - Data quality is evaluated based on quality, scale, and diversity, with no universal standard for high-quality data [27][28] Synthetic Data - Synthetic data is proposed as a solution to address the shortage of training data, offering benefits such as cost efficiency, privacy protection, and the ability to simulate rare events [31][33] - Synthetic data can be generated through algorithms and mathematical models, either based on real datasets or created from scratch using existing models or domain knowledge [34][35] - It is particularly useful in multimodal data generation and domain-specific knowledge creation, enhancing model performance in specialized fields [37][39] Data Governance and Compliance - The report discusses the importance of data governance, emphasizing that model training does not rely on personal information and that the use of copyrighted data for training is considered transformative and falls under fair use [50][51] - It suggests that data governance should focus on output control and post-event remedies rather than imposing strict pre-use restrictions, allowing for more flexibility in data utilization [52] Government and Social Collaboration - The report compares the data ecosystems in the US and China, highlighting that the US government promotes open access to public data, while China faces challenges in data openness and integration [54][55][60] - Social forces in the US play a significant role in integrating public and open data to create high-quality training datasets, whereas in China, the reliance on overseas datasets and limited public data access hinders the development of a robust data ecosystem [56][60] Alibaba's Exploration in LLM Training - Alibaba integrates high-quality Chinese datasets with overseas open-source data, ensuring data compliance and optimizing training data quality [63] - The company employs synthetic data in e-commerce scenarios to enhance recommendation systems, improving both performance and privacy protection [64]