Research Article

Research on the Key Technology of Web Data Extraction and Mining Based on the Probability Distribution

Table 1

Experimental results of the first group.

The domain nameThe number of words
TitleAuthorDateAffiliation

Correct-marked words74955785621
Words in data set80662193683
Words in extracted result set835575101691
REC0.9292800.8969400.9139780.909224
PRE0.8970010.9686960.8415820.898698