Research Article

A Collaborative Deep and Shallow Semisupervised Learning Framework for Mobile App Classification

Table 1

The summary of the existing semisupervised learning techniques.

SchemePrincipleDrawbacksReferences

Generative semisupervised learningIt assumes that both labeled and unlabeled samples are generated from the same parametric model; then, it treats the labels of the unlabeled samples as missing values of the model parametersWhen the parametric model assumption is incorrect, fitting the model using unlabeled samples would result in performance degradation[21, 27, 28]

Graph-based semisupervised learningIt constructs a graph whose nodes are samples (including both labeled and unlabeled samples) and edges reflect relations between nodes (e.g., feature similarity); then, the labels are propagated by exploiting the connective characteristicsFirst, it suffers from poor scalability; second, it is difficult to build the relations between samples when the features are sophisticated[29ā€“31]

Disagreement-based semisupervised learningMultiple base learners are initially trained on labeled samples, and then they learn from each other by exploiting the disagreement among them on unlabeled samplesHow to guarantee the diversity between base learners is an open problem[25, 32, 33]