Research Article

Research on the Key Technology of Web Data Extraction and Mining Based on the Probability Distribution

Table 2

Experimental results of the second group.

The domain nameThe number of words
TitleAuthorDateAffiliation

Correct-marked words170511821641356
Words in data set181614251751518
Words in extracted result set190712422321609
REC0.9388770.8294730.9371430.893281
PRE0.8940740.9516900.7068970.842759