Research Article
An Algorithm for Discretization of Real Value Attributes Based on Interval Similarity
Step1: Compute inconsistency rate of information system; | Step2: Sort data in ascending order for each attribute and calculate the similar | value SIM of each adjacent intervals according to (5) and (6); | Step3: Merge | While (merge-able cut point) | { | Search cut point that has the maximal similar value, then merging it; | If (many maximum values) | | Merge adjacent two intervals with the smallest number of classes; | If ( increases) | { | Withdraw merging; | Exit procedure; | } | Else {break; goto Step2;} | } | If (some several maximum values and the same class number of classes | among groups of adjacent intervals) | { | Merge the adjacent two intervals with the smallest number of samples | of adjacent intervals; | If ( increases) | { | withdraw merging; | exit procedure; | } | Else {break; goto step2;} | } | } |
|