机器学习

Search documents
J.P. Morgan机器学习卓越中心高管亲述,华尔街AI实战心法
机器之心· 2025-09-04 07:04
Core Insights - The article discusses the growing importance of artificial intelligence (AI) and machine learning (ML) in the financial industry, highlighting their applications in quantitative trading and risk management, while also addressing the challenges faced when transitioning from academic research to practical implementation [1][2]. Group 1: AI and ML Applications in Finance - AI and ML are increasingly being utilized in various financial applications, but there are significant challenges when these models are applied in real-world scenarios [1][2]. - Financial institutions prioritize decision-making tools that support "What-if" analyses, such as assessing the impact of interest rate changes [5]. - The complexity of financial data, which includes time series, yield curves, and macroeconomic data, poses challenges for traditional models like LSTM [5]. Group 2: Challenges in Implementation - Many discussions around AI and ML remain theoretical, with practical issues often lacking systematic public discourse [2]. - The integration of tools like Jupyter Notebook can hinder engineering management, and compatibility issues between TensorFlow and PyTorch complicate the development of reusable components [5]. - There is a scarcity of professionals who possess expertise in finance, machine learning, and systems engineering, which is critical for successful implementation [5]. Group 3: Educational and Recruitment Initiatives - The article mentions a lecture by Professor Chak Wong from J.P. Morgan's Machine Learning Center of Excellence, focusing on the practical applications of AI/ML in financial institutions [10][11]. - The event also serves as a recruitment session for J.P. Morgan, inviting candidates from various academic backgrounds to engage with a leading international team [11].
G20举办“黑客马拉松” 聚焦灾害风险管理
Xin Hua Wang· 2025-09-04 03:35
Core Viewpoint - The G20 "Hackathon" competition focuses on collaborative efforts to address disaster risks associated with climate change, emphasizing the use of digital technology and cross-national cooperation [1] Group 1: Event Overview - The G20 Hackathon, hosted by South Africa's Department of Science and Innovation, commenced on September 2 and will last for four days [1] - The event is conducted online, featuring participants from various countries including China, Canada, Singapore, Italy, Spain, Kenya, Nigeria, and Saudi Arabia, who are experts in data science, urban studies, and disaster risk management [1] Group 2: Competition Theme and Objectives - The theme of the competition is "Reducing Disaster Risk through Open Innovation," aiming to enhance disaster resilience in climate-vulnerable and water-scarce regions [1] - Participants will utilize artificial intelligence, machine learning, and geospatial analysis to develop innovative digital solutions, particularly focusing on predicting informal urban expansion and its impact on flood risks [1] Group 3: Expected Outcomes - The event serves as a dynamic testing ground for evidence-based solutions that can inform urban policy-making and planning [1] - The final results will be showcased at the G20 Research and Innovation Ministerial Meeting on September 23, contributing to high-level discussions on climate adaptation and urban resilience [1]
Alumis (ALMS) 2025 Conference Transcript
2025-09-03 14:47
Summary of Alumis Inc. Conference Call Company Overview - **Company**: Alumis Inc. (Ticker: ALMS) - **Industry**: Precision Immunology - **Key Products**: Focus on TIK2 inhibitors for autoimmune diseases, specifically psoriasis and lupus Core Points and Arguments 1. **Clinical Assets**: Alumis has three clinical assets, with a strong research organization. Currently in Phase 3 for psoriasis and Phase 2b for lupus, with read-outs expected in early Q1 and Q3 of next year respectively [2][3] 2. **TIK2 Target**: TIK2 was selected as a target due to its significant role in autoimmune diseases, with 5% of the population having mutations that provide protection against such diseases [4][5] 3. **Efficacy of Envu**: The company's TIK2 inhibitor, now called Envutucitinib (Envu), has shown a clean safety profile and high efficacy, with PASI-75 scores being the highest seen with an oral drug [8][10] 4. **Market Positioning**: The company believes that the oral drug market is underutilized, with less than 10% of diagnosed psoriasis patients on biologics. There is a strong preference for oral treatments among patients [18][19] 5. **Phase 3 Data Benchmarking**: The company is focused on long-term efficacy data (24-week and 52-week) rather than short-term results, which are more relevant for dermatologists [10][11] 6. **Lupus Opportunity**: The Phase 2b trial for lupus is pivotal, with the potential for only one Phase 3 trial if successful. The genetic evidence supports TIK2's role in lupus treatment [30][32] 7. **Trial Design**: The lupus trial includes 408 patients with strict entry criteria to minimize placebo effects, focusing on active SLE patients [35][36] 8. **Market Expansion**: There is potential to expand the systemic treatment market with better-tolerated oral drugs, targeting patients who may currently be on topical therapies [21][22] 9. **Launch Strategy**: Alumis plans to learn from competitors' launches, focusing on drug positioning, pricing, and effective communication of benefits [22][23] 10. **Cash Position**: As of the end of Q2, Alumis had $486 million in cash, expected to last into 2027, with anticipated spikes in R&D spending due to Phase 3 trial enrollment [46] Additional Important Content - **BMI Considerations**: The company acknowledges that BMI can influence drug efficacy and is a factor in cross-trial comparisons [15][16] - **Formulation Development**: Multiple formulations of Envu are being developed, with plans for a once-daily dosing regimen [28] - **Collaboration Potential**: Alumis is unlikely to launch Envu globally on its own and is considering partnerships for market entry [26][27] - **Future Indications**: The company is exploring the potential of TIK2 inhibitors in other diseases driven by interferon pathways, such as Sjogren's syndrome [33] This summary encapsulates the key points discussed during the conference call, highlighting Alumis Inc.'s strategic focus, clinical developments, and market opportunities in the precision immunology sector.
以高水平监测更好服务“三个治污”
Zhong Guo Huan Jing Bao· 2025-09-02 02:03
Core Viewpoint - The article emphasizes the importance of ecological environment monitoring as a foundation for ecological protection and pollution prevention, advocating for improved monitoring data quality and the implementation of advanced technologies to enhance monitoring capabilities [1][2][3]. Group 1: Improving Monitoring Data Quality - The article suggests enhancing the accuracy, comprehensiveness, and timeliness of monitoring data to support precise pollution control. It highlights the need for the widespread application of Laboratory Information Management Systems (LIMS) and unified regulatory frameworks for monitoring institutions [1]. - It calls for a shift in focus for monitoring personnel from merely ensuring data quality to also emphasizing the application of monitoring data, thereby strengthening its role in precise pollution control [1]. Group 2: Accelerating Digital Transformation of Monitoring Systems - The article advocates for the digital transformation of ecological environment monitoring systems, leveraging technologies such as artificial intelligence and cloud platforms to modernize monitoring capabilities [2]. - It emphasizes the need to develop monitoring technologies with independent intellectual property rights and to enhance the automation and intelligence of monitoring processes [2]. - The establishment of a comprehensive ecological environment smart monitoring system is recommended, which would improve the ability to trace pollution sources and enhance environmental quality forecasting [2]. Group 3: Strengthening Legal and Regulatory Frameworks - The article stresses the necessity of a solid legal foundation for ecological environment monitoring, particularly in clarifying the legal status of automatic monitoring data from polluting entities [3]. - It points out that currently, only data from waste incineration power plants can be directly used for administrative enforcement, indicating a need for broader legal recognition of monitoring data [3]. - The role of social monitoring institutions is highlighted, with a call for clear legal definitions regarding the use of their data in environmental enforcement to enhance their contribution to ecological management [3].
OpenAI大神:人工智能导论课程停在15年前,本科首选该是机器学习导论
机器之心· 2025-09-01 08:46
Core Viewpoint - The article emphasizes the importance of selecting the right introductory course in artificial intelligence (AI) for beginners, suggesting that "Introduction to Machine Learning" should be prioritized over "Introduction to AI" due to the outdated content of the latter [2][3]. Group 1: Course Recommendations - Noam Brown, a researcher from OpenAI, advises undergraduate students interested in AI to be cautious and not to choose "Introduction to AI" as their first course [2]. - The article highlights that many universities' "Introduction to AI" courses have not evolved significantly over the past 15 years, often lacking comprehensive coverage of machine learning topics [3]. - A well-structured introductory course should ideally include topics such as linear regression, gradient descent, backpropagation, and reinforcement learning [3]. Group 2: Course Content Comparison - "Introduction to AI" often covers traditional topics like rule-based systems and expert systems, while "Introduction to Machine Learning" focuses on modern AI technologies, including linear regression, neural networks, and deep learning [6]. - The renowned course "CS229: Machine Learning" at Stanford, taught by Andrew Ng, covers supervised learning, unsupervised learning, generative models, and foundational deep learning concepts [6]. Group 3: Industry Relevance - The article notes that most breakthroughs in AI today stem from machine learning and deep learning, rather than the older topics covered in traditional AI courses [11]. - There is a growing sentiment that students should focus on practical skills like prompt engineering and programming to navigate the evolving AI landscape effectively [11].
中山大学发表最新Science论文
生物世界· 2025-09-01 00:00
Core Viewpoint - The article emphasizes the urgent need for global carbon dioxide reduction and enhancing ecosystems' carbon absorption capabilities, highlighting afforestation as a cost-effective natural climate solution [4]. Group 1: Research Findings - A study published in the journal Science quantifies the carbon sequestration potential of soil during global forest restoration, integrating ecological, climatic, and policy factors to redefine afforestation's role in climate change mitigation [4][6]. - The research developed a machine learning model to quantify soil carbon changes post-afforestation, revealing a coexistence of carbon increase and loss primarily in surface soil (0-30 cm) [6]. - If afforestation is limited to areas that avoid unintended warming effects and ensure water resources and biodiversity, approximately 389 million hectares could sequester 39.9 Pg of carbon by 2050, significantly lower than previous estimates [6]. Group 2: Policy Implications - If land is further restricted to existing policy commitments (120 million hectares), the carbon sequestration potential drops to 12.5 Pg [6]. - The study suggests that to achieve larger-scale climate mitigation, there is an urgent need to expand dedicated afforestation areas and enhance commitments from countries with significant undeveloped potential [6][8]. - The findings provide actionable insights for optimizing land use policies and afforestation strategies to maximize climate benefits [8].
机器学习因子选股月报(2025年9月)-20250831
Southwest Securities· 2025-08-31 04:12
Quantitative Models and Construction Methods - **Model Name**: GAN_GRU **Model Construction Idea**: The GAN_GRU model combines Generative Adversarial Networks (GAN) for processing volume-price time-series features and Gated Recurrent Unit (GRU) for encoding time-series features to create a stock selection factor[4][13][41] **Model Construction Process**: 1. **GRU Component**: - Input features include 18 volume-price features such as closing price, opening price, turnover, and turnover rate[14][17][19] - Training data consists of the past 400 days of these features, sampled every 5 trading days, forming a 40x18 matrix to predict cumulative returns over the next 20 trading days[18] - Data preprocessing includes outlier removal and normalization at both time-series and cross-sectional levels[18] - Model architecture: Two GRU layers (128, 128) followed by an MLP (256, 64, 64), with the final output being the predicted return (pRet), which serves as the stock selection factor[22] - Training method: Semi-annual rolling training, with training conducted on June 30 and December 31 each year[18] - Optimization: Adam optimizer, learning rate of 1e-4, IC loss function, early stopping after 10 epochs, and a maximum of 50 training epochs[18] 2. **GAN Component**: - GAN consists of a generator (G) and a discriminator (D)[23] - Generator: Uses LSTM to preserve the time-series nature of the input features, transforming random noise into realistic data samples[33][37] - Loss function: $$ L_{G} = -\mathbb{E}_{z\sim P_{z}(z)}[\log(D(G(z)))] $$ where \( z \) represents random noise, \( G(z) \) is the generated data, and \( D(G(z)) \) is the discriminator's output probability[24][25] - Discriminator: Uses CNN to process the two-dimensional volume-price time-series features, distinguishing between real and generated data[33][37] - Loss function: $$ L_{D} = -\mathbb{E}_{x\sim P_{data}(x)}[\log D(x)] - \mathbb{E}_{z\sim P_{z}(z)}[\log(1-D(G(z)))] $$ where \( x \) is real data, \( D(x) \) is the discriminator's output for real data, and \( D(G(z)) \) is the output for generated data[27][29] - Training: Alternating updates of the generator and discriminator parameters until convergence[30] **Model Evaluation**: The GAN_GRU model effectively captures both time-series and cross-sectional features, leveraging the strengths of GAN and GRU for stock selection[4][13][41] --- Model Backtesting Results - **GAN_GRU Model**: - **IC Mean**: 11.36%[41][42] - **ICIR (Non-Annualized)**: 0.88[42] - **Turnover Rate**: 0.83[42] - **Recent IC**: -2.56%[41][42] - **1-Year IC Mean**: 8.94%[41][42] - **Annualized Return**: 38.09%[42] - **Annualized Volatility**: 23.68%[42] - **IR**: 1.61[42] - **Maximum Drawdown**: 27.29%[42] - **Annualized Excess Return**: 23.52%[41][42] --- Quantitative Factors and Construction Methods - **Factor Name**: GAN_GRU Factor **Factor Construction Idea**: Derived from the GAN_GRU model, this factor encodes volume-price time-series features to predict stock returns[4][13][41] **Factor Construction Process**: - The factor is generated using the output of the GAN_GRU model, which combines GAN-based feature generation and GRU-based time-series encoding[4][13][41] - The factor undergoes industry and market capitalization neutralization, as well as standardization, before being used for testing[22] **Factor Evaluation**: The GAN_GRU factor demonstrates strong predictive power across various industries, with consistent outperformance in recent years[4][13][41] --- Factor Backtesting Results - **GAN_GRU Factor**: - **IC Mean**: 11.36%[41][42] - **ICIR (Non-Annualized)**: 0.88[42] - **Turnover Rate**: 0.83[42] - **Recent IC**: -2.56%[41][42] - **1-Year IC Mean**: 8.94%[41][42] - **Annualized Return**: 38.09%[42] - **Annualized Volatility**: 23.68%[42] - **IR**: 1.61[42] - **Maximum Drawdown**: 27.29%[42] - **Annualized Excess Return**: 23.52%[41][42]
德国耐驰:树脂基复材在线固化监测与智能化生产控制
DT新材料· 2025-08-27 16:04
Core Viewpoint - The article emphasizes the innovative solutions provided by NETZSCH in the polymer and polymer-based composite processing industry, particularly through the application of their sensXPERT technology in Airbus's manufacturing processes [2][6]. Group 1: Industry Challenges - Various industries, including automotive and aerospace, face similar challenges such as reducing production cycles, increasing yield, and dynamically controlling each product [3]. - The increasing use of thermosetting plastics and composites in high-performance parts production necessitates customized resin and formulation materials, which introduces significant production challenges [3]. - A critical issue in composite manufacturing is the lack of "data transparency," particularly in real-time curing process data, which hinders process optimization and efficiency improvements [3]. Group 2: NETZSCH Solutions - NETZSCH has been selected by Airbus to provide intelligent sensor solutions, integrating innovative sensors with advanced analytics and machine learning to enhance polymer and composite manufacturing methods [6]. - The sensors installed in molds can measure key material properties in real-time, such as curing degree and glass transition temperature, thereby improving production efficiency [6]. - The combination of material science with real-time data from the manufacturing environment allows for the application of AI on the production floor, creating dynamic processes based on historical and new data [6]. Group 3: Benefits of sensXPERT - The sensXPERT solution aims to reduce scrap rates and achieve operational excellence by optimizing processes in real-time [10]. - It provides maximum equipment efficiency and transparency in the manufacturing process through customizable dashboards that allow for product traceability [10]. - The solution also accounts for variations in data from different batches due to factors like transportation, storage, and environmental conditions, ensuring a reliable manufacturing process [6]. Group 4: Upcoming Events - The 2025 Polymer Industry Annual Conference will take place from September 10-12 in Hefei, where industry leaders will discuss new opportunities in emerging industries, including AI and aerospace [12][18]. - Zeng Zhiqiang, Vice President of Market and Applications at NETZSCH, will present on "Online Curing Monitoring and Intelligent Production Control of Resin-Based Composites" during the conference [8][20].
字节跳动再失大将,豆包大模型视觉研究负责人冯佳时离职
Sou Hu Cai Jing· 2025-08-27 05:06
Core Insights - ByteDance has lost a significant figure in the AI field, Feng Jiashi, who was the leader of the Doubao large model visual research team, raising concerns in the industry [1][3] - Feng Jiashi's departure follows rumors from June, which were initially denied by ByteDance, indicating a confirmed exit [1][3] Group 1: Impact of Departure - Feng Jiashi's exit is expected to impact ByteDance, as he brought extensive academic and practical experience to the company, having previously served as an assistant professor at the National University of Singapore [3][11] - He has published over 400 papers in deep learning and related fields, with over 69,000 citations on Google Scholar, highlighting his significant contributions to AI research [3][11] Group 2: Talent Loss Context - Feng Jiashi's departure is part of a broader trend of talent loss at ByteDance, with several key figures leaving since December, including leaders from various product lines [13] - Despite these challenges, ByteDance is actively recruiting globally to fill the talent gaps, having previously hired key members from Alibaba and Google DeepMind [13][19] Group 3: Competitive Landscape - The competition for AI talent is intensifying, and ByteDance is striving to maintain its leading position in the industry despite the ongoing talent exodus [19]
打磨7年,李航新书《机器学习方法(第2版)》发布,有了强化学习,赠书20本
机器之心· 2025-08-27 03:18
Core Viewpoint - The article discusses the release of the second edition of "Machine Learning Methods" by Li Hang, which expands on traditional machine learning to include deep learning and reinforcement learning, addressing the growing interest in these areas within the AI community [4][5][22]. Summary by Sections Overview of the Book - The new edition of "Machine Learning Methods" includes significant updates and additions, particularly in reinforcement learning, which has been gaining attention in AI applications [4][5]. - The book is structured into four main parts: supervised learning, unsupervised learning, deep learning, and reinforcement learning, providing a comprehensive framework for readers [5][22]. Supervised Learning - The first part covers key supervised learning methods such as linear regression, perceptron, support vector machines, maximum entropy models, logistic regression, boosting methods, hidden Markov models, and conditional random fields [7]. Unsupervised Learning - The second part focuses on unsupervised learning techniques, including clustering, singular value decomposition, principal component analysis, Markov chain Monte Carlo methods, EM algorithm, latent semantic analysis, and latent Dirichlet allocation [8]. Deep Learning - The third part introduces major deep learning methods, such as feedforward neural networks, convolutional neural networks, recurrent neural networks, Transformers, diffusion models, and generative adversarial networks [9]. Reinforcement Learning - The fourth part details reinforcement learning methods, including Markov decision processes, multi-armed bandit problems, proximal policy optimization, and deep Q networks [10]. - The book aims to provide a systematic introduction to reinforcement learning, which has been less covered in previous textbooks [4][10]. Learning Approach - Each chapter presents one or two machine learning methods, explaining models, strategies, and algorithms in a clear manner, supported by mathematical derivations to enhance understanding [12][19]. - The book is designed for university students and professionals, assuming a background in calculus, linear algebra, probability statistics, and computer science [22]. Author Background - Li Hang, the author, is a recognized expert in the field, with a background in natural language processing, information retrieval, machine learning, and data mining [24].