Quantitative Models and Construction Methods 1. Model Name: GAN_GRU - Model Construction Idea: The GAN_GRU model combines Generative Adversarial Networks (GAN) for feature generation and Gated Recurrent Unit (GRU) for time-series feature encoding to construct a stock selection factor[4][13][14] - Model Construction Process: 1. GAN Component: - The generator (G) learns the real data distribution and generates realistic samples from random noise (Gaussian or uniform distribution). The generator's loss function is: where represents the discriminator's probability of classifying generated data as real[24][25][26] - The discriminator (D) distinguishes real data from generated data. Its loss function is: where is the probability of real data being classified as real, and is the probability of generated data being classified as real[27][29][30] - GAN training alternates between optimizing and until convergence[30] 2. GRU Component: - Two GRU layers (GRU(128, 128)) are used to encode time-series features, followed by a Multi-Layer Perceptron (MLP) with layers (256, 64, 64) to predict returns. The final output is used as the stock selection factor[22] 3. Feature Input and Processing: - Input features include 18 price-volume characteristics (e.g., closing price, turnover, etc.) sampled over the past 400 days, with a shape of (40 days of features)[18][19][37] - Features undergo outlier removal, standardization, and cross-sectional normalization[18] 4. Training Details: - Training-validation split: 80%-20% - Semi-annual rolling training (June 30 and December 31 each year) - Hyperparameters: batch size equals the number of stocks, Adam optimizer, learning rate , IC loss function, early stopping (10 rounds), max training rounds (50)[18] 5. Stock Selection: - Stocks are filtered to exclude ST stocks and those listed for less than six months[18] - Model Evaluation: The GAN_GRU model effectively captures price-volume time-series features and demonstrates strong predictive power for stock returns[4][13][22] --- Model Backtesting Results 1. GAN_GRU Model - IC Mean: 0.1119*** (2019-2025)[4][41] - ICIR (non-annualized): 0.89[42] - Turnover Rate: 0.83X[42] - Recent IC: 0.0331*** (December 2025)[4][41] - 1-Year IC Mean: 0.0669***[4][41] - Annualized Return: 37.40%[42] - Annualized Volatility: 23.39%[42] - IR: 1.60[42] - Maximum Drawdown: 27.29%[42] - Annualized Excess Return: 22.42%[4][42] --- Quantitative Factors and Construction Methods 1. Factor Name: GAN_GRU Factor - Factor Construction Idea: The GAN_GRU factor is derived from the GAN_GRU model, leveraging GAN for price-volume feature generation and GRU for time-series encoding[4][13][14] - Factor Construction Process: - The GAN generator processes raw price-volume time-series features () and outputs transformed features with the same shape ()[37] - The GRU component encodes these features into a predictive factor for stock selection[22] - The factor undergoes industry and market capitalization neutralization and standardization[22] - Factor Evaluation: The GAN_GRU factor demonstrates robust performance across various industries and time periods, with significant IC values and excess returns[4][41] --- Factor Backtesting Results 1. GAN_GRU Factor - IC Mean: 0.1119*** (2019-2025)[4][41] - ICIR (non-annualized): 0.89[42] - Turnover Rate: 0.83X[42] - Recent IC: 0.0331*** (December 2025)[4][41] - 1-Year IC Mean: 0.0669***[4][41] - Annualized Return: 37.40%[42] - Annualized Volatility: 23.39%[42] - IR: 1.60[42] - Maximum Drawdown: 27.29%[42] - Annualized Excess Return: 22.42%[4][42] 2. Industry-Specific Performance - Top 5 Industries by Recent IC (October 2025): - Social Services: 0.4243*** - Coal: 0.2643*** - Environmental Protection: 0.2262*** - Retail: 0.1888*** - Steel: 0.1812***[4][41][42] - Top 5 Industries by 1-Year IC Mean: - Social Services: 0.1303*** - Steel: 0.1154*** - Non-Bank Financials: 0.1157*** - Retail: 0.1067*** - Building Materials: 0.1017***[4][41][42] 3. Industry-Specific Excess Returns - Top 5 Industries by December 2025 Excess Returns: - Banking: 4.30% - Real Estate: 3.51% - Environmental Protection: 2.18% - Retail: 1.76% - Machinery: 1.71%[2][45] - Top 5 Industries by 1-Year Average Excess Returns: - Banking: 2.12% - Real Estate: 1.93% - Environmental Protection: 1.50% - Retail: 1.46% - Machinery: 1.23%[2][46]
机器学习因子选股月报(2026年1月)-20251231