Research Article
An Algorithm for Discretization of Real Value Attributes Based on Interval Similarity
Step1: Initialization. Set significance level . Calculate inconsistency rate of information | systems: . | Step2: Sort data in ascending order for each attribute and calculate value of | each adjacent two intervals according to (1), then using a table to obtain | the corresponding threshold. Calculate difference . | Step3: Merge. | While (mergeable cut point) | {Search cut point that has the maximal difference , then merging it; | If change | {Withdraw merging; | goto Step4;} | else goto Step2; | } | Step4: If can not be decreased | Exit procedure; | Else ; | Decreasing the significance level by one level; | goto Step2; } | Step5: Do until no attribute can be merged | {For each mergeable attribute | {Calculate difference ; | ; | sign flag=0; | While (flag= =0) | {While (mergeable cut point) | {Search cut point that has the maximal difference , then merging it; | If change | {Withdraw merging; | flag=1; | break;} | Else update difference ; | } | If can not be decreased | Break; | Else {Decreasing the significance level by one level; | Update difference ;} | } | } | } |
|