蛋白质折叠
Search documents
苹果掀桌,扔掉AlphaFold核心模块,开启蛋白折叠「生成式AI」时代
3 6 Ke· 2025-09-27 23:59
Core Insights - SimpleFold is a novel protein folding model that utilizes a general Transformer architecture, differing from traditional models like AlphaFold2 by not relying on complex, specialized components such as triangular updates or multiple sequence alignments (MSA) [3][4][10] Model Architecture - The SimpleFold architecture consists of three main components: a lightweight atom encoder, a heavy residue backbone, and a lightweight atom decoder, which collectively balance speed and accuracy [8][10] - The model employs flow matching to treat the generation process as a time-evolving journey, integrating ordinary differential equations (ODE) to refine the output structure progressively [6][10] Training and Evaluation - SimpleFold was trained on various scales, including models with parameters ranging from 100 million to 3 billion, with performance improvements observed as model size increased [11][24] - The training strategy involved replicating the same protein across multiple GPUs to enhance gradient stability and model performance [12][13] - Performance evaluations were conducted on widely recognized benchmarks, CAMEO22 and CASP14, demonstrating SimpleFold's competitive accuracy compared to leading models [14][19][21] Performance Metrics - In CAMEO22, SimpleFold achieved TM-scores and GDT-TS scores comparable to state-of-the-art models, with the 3 billion parameter model reaching a TM-score of 0.837 [15][19] - SimpleFold consistently outperformed other flow-matching methods, such as ESMFlow, across various metrics, indicating its robustness and generalization capabilities [18][22][31] Structural Generation Capability - SimpleFold's generative approach allows it to model structural distributions, producing not only a single deterministic structure but also multiple conformations for the same amino acid sequence [28] - The model's performance in generating structural ensembles was validated against the ATLAS dataset, showcasing its ability to capture diverse protein conformations effectively [29][31] Scalability and Data Utilization - The scalability of SimpleFold was confirmed through experiments showing that larger models performed better with increased training resources and data [34][35] - The model benefits from a growing dataset, with performance improvements noted as the number of unique structures in the training data increased [35]
开创多元协同治理格局 促进人工智能安全有序发展
Ke Ji Ri Bao· 2025-08-29 06:37
Group 1 - The core viewpoint of the article emphasizes the strategic importance of AI as a key driver for high-quality development in China, as outlined in the recent government opinion document [1][3][10] - The document identifies six key actions and eight foundational supports to promote the dual empowerment of technology and application, aiming for deep integration of AI into various sectors including scientific research, industry, and public welfare [1][3] Group 2 - AI is positioned as a "key increment" for high-quality development, with its core value reflected in four dimensions: empowerment, burden reduction, quality improvement, and efficiency enhancement [3][10] - AI is expanding the cognitive boundaries of scientific research, acting as an accelerator for foundational studies, such as AlphaFold solving the protein folding problem [3] - The document highlights AI's role in reducing workload through automation, thereby creating better job opportunities and enhancing consumer satisfaction [3] - In manufacturing, AI has been shown to reduce equipment failure rates by 20%, while in education and healthcare, AI systems are customizing learning paths and assisting doctors, respectively [3] Group 3 - The document addresses the need for a "safety and controllability" principle, emphasizing the importance of preventing security risks associated with AI [6][10] - It outlines inherent risks of AI models, including their "black box" nature, which leads to challenges in understanding decision-making processes and vulnerabilities to adversarial attacks [6] - Ethical challenges are also highlighted, where biases in training data can amplify societal issues, potentially leading to the spread of negative sentiments [6] Group 4 - The document proposes a new governance framework that emphasizes multi-dimensional collaboration to ensure the safe development of AI [8][9] - It suggests a "four-in-one" collaborative governance system that includes improving legal frameworks, establishing a multi-faceted public safety system, creating a network governance system, and developing an intelligent emergency response system [8] - The document also emphasizes enhancing safety governance capabilities across four key areas: technical safety, ethical safety, application safety, and national security [9]