Research Article

Identifying Human Phenotype Terms by Combining Machine Learning and Validation Rules

Figure 1

Layout of IHP’s annotation pipeline. IHP requires as input a Gold Standard Corpora that will serve as a training set for the CRFSuite and to evaluate IHP performance in the end; a feature set to use in CRFSuite; and a list of rules a dictionary to solve potential errors.