Research Article

Identifying Incident Causal Factors to Improve Aviation Transportation Safety: Proposing a Deep Learning Approach

Table 4

Important statistics about the utilized ASRS data set.

Multilabel statisticsValue

Number of utilized factors6
Number of valid samples172,990
Factor cardinality1.47
Factor density0.245
Number of distinct label sets28
Most frequent label set{Human factor, aircraft}

After cleaning and preprocessing, we use the six most frequent labels from 172,990 reports. On average, every report has 1.47 labels (label density of 0.245).