Research Article

Common Laws Driving the Success in Show Business

Table 1

The details of training data and test data for each model.

Training data including validation dataTest data

Model 1: MAO_ours70% data of an actor (AM ≥ 5, L ≥ 20), the sampling rate of subsequence generation: n = 6; the total number of subsequences in the training set: 60.721994 = 9237430% data of an actor (AM ≥ 5, L ≥ 20), the sampling rate of subsequence generation: n = 6; 100% data of an actor (AM ≥ 5, 5 = < L < 20), the sampling rate of subsequence generation: n = 5; 100% data of an actress (AM ≥ 5, L ≥ 20), the sampling rate of subsequence generation: n = 12; 100% data of an actress (AM ≥ 5, 5 = < L < 20), the sampling rate of subsequence generation: n = 5; the total number of subsequences in the test set: 60.321994 + 5115902 + 1219034 + 5112991 = 292462

Model 2: MAE_ours70% data of an actress (AM ≥ 5, L ≥ 20), the sampling rate of subsequence generation: n = 12; the total number of subsequences in the training set: 120.79034 = 7588530% data of an actress (AM ≥ 5, L ≥ 20), the sampling rate of subsequence generation: n = 12; 100% data of an actress (AM ≥ 5, 5 = < L < 20), the sampling rate of subsequence generation: n = 5; 100% data of an actor (AM ≥ 5, L ≥ 20), the sampling rate of subsequence generation: n = 6; 100% data of an actor (AM ≥ 5, 5 = < L < 20), the sampling rate of subsequence generation: n = 5; the total number of subsequences in the test set: 120.39034 + 5112991 + 6121994 + 5115902 = 308951

Model 3: MM_ours70% mixed data of an actor and actress (AM ≥ 5, L ≥ 20), the sampling rate of subsequence generation: n = 3 for the data of an actor in the mixed data, n = 6 for the data of an actress in the mixed data; the total number of subsequences in the training set: 30.721994 + 60.79034 = 46187 + 37942 = 8412930% mixed data of an actor and actress (AM ≥ 5, L ≥ 20), the sampling rate of subsequence generation: n = 3 for the data of an actor in the mixed data, n = 6 for the data of an actress in the mixed data; 100% data of an actress (AM ≥ 5, 5 = < L < 20), the sampling rate of subsequence generation: n = 5; 100% data of an actor (AM ≥ 5, 5 = < L < 20), the sampling rate of subsequence generation: n = 5; the total number of subsequences in the test set: 30.321994 + 60.39034 + 5112991 + 5115902 = 180520

The validation data are included in the training data. Note. MM_ours denotes the prediction model trained by the mixed data of an actor and actress, MAO_ours denotes the prediction model trained by the data of an actor only, and MAE_ours denotes the prediction model trained by the data of an actress only.