Research Article

A Method for Identifying Japanese Shop and Company Names by Spatiotemporal Cleaning of Eccentrically Located Frequently Appearing Words

Table 12

Processing accuracy of removal of noise words (Data consists of 1000 samples extracted randomly from the 2005 Tokyo prefecture telephone directory).

Number of samples1000 

Is it necessary to remove noise words from names, as determined by a manual check?Yes: 654No: 346 

Can we get the same result as manual processing using the FAW dictionary?Yes: 513No: 141   

Can we get the same result as manual processing using the dictionary of geographic names and station names? Yes: 70No: 71   

Can we get the same result as manual processing after LFAW removal?  Yes:11No:60   

Do pure names remain after all noise word removal processing?    Yes: 330No: 16Sum total

Number of data processed successfully513701103300924

Processing accuracy (%) 92.40