Research Article
A Method for Identifying Japanese Shop and Company Names by Spatiotemporal Cleaning of Eccentrically Located Frequently Appearing Words
Table 12
Processing accuracy of removal of noise words (Data consists of 1000 samples extracted randomly from the 2005 Tokyo prefecture telephone directory).
| Number of samples | 1000 | |
| Is it necessary to remove noise words from names, as determined by a manual check? | Yes: 654 | No: 346 | |
| Can we get the same result as manual processing using the FAW dictionary? | Yes: 513 | No: 141 | | | |
| Can we get the same result as manual processing using the dictionary of geographic names and station names? | | Yes: 70 | No: 71 | | | |
| Can we get the same result as manual processing after LFAW removal? | | | Yes:11 | No:60 | | | |
| Do pure names remain after all noise word removal processing? | | | | | Yes: 330 | No: 16 | Sum total |
| Number of data processed successfully | 513 | 70 | 11 | 0 | 330 | 0 | 924 |
| Processing accuracy (%) | | 92.40 |
|
|