Workflow
Scaling Laws(标度律)
icon
Search documents
训练自2.67亿个单细胞数据的AI虚拟细胞模型——STATE,无需实验,预测细胞对药物或基因扰动的反应
生物世界· 2025-07-07 03:17
Core Viewpoint - The article discusses the development of a virtual cell model called STATE by Arc Institute, which aims to predict cellular responses to various drug and genetic interventions, thereby enhancing the success rate of clinical trials and drug discovery [3][12]. Group 1: Virtual Cell Model STATE - STATE is designed to predict the responses of various cell types, including stem cells, cancer cells, and immune cells, to drugs and genetic disturbances [3][12]. - The model is trained on data from 167 million cells and over 100 million disturbance data points, covering 70 different cell lines [3][7]. - STATE consists of two interconnected modules: State Embedding (SE) and State Transition (ST), which allow for the prediction of RNA expression changes based on initial transcriptomes and disturbances [6][7]. Group 2: Performance and Advantages - STATE significantly outperforms existing computational methods, showing a 50% improvement in distinguishing disturbance effects and double the accuracy in identifying differentially expressed genes [7][9]. - The model is the first to surpass simple linear baseline models in all tests conducted [7]. - It focuses on single-cell RNA sequencing data, which is currently the only unbiased data available at scale for researchers [7]. Group 3: Data Collection and Causality - The research team compensates for the limitations of single-cell RNA sequencing data by collecting large-scale disturbance data through experiments like CRISPR gene editing [8][9]. - Disturbance data captures causal relationships between genes, providing insights into biological mechanisms that observational data cannot [8][9]. Group 4: Future Developments and Applications - The ultimate goal of the virtual cell model is to help scientists explore a vast space of combinatorial possibilities for cellular changes, which is impractical to test experimentally [12]. - The team has introduced Cell_Eval, a comprehensive evaluation framework for virtual cell modeling, focusing on biologically relevant metrics [12]. - A virtual cell challenge has been launched, offering a $100,000 prize to encourage innovation in this field [12].