Research Article

An Algorithm for Discretization of Real Value Attributes Based on Interval Similarity

Algorithm 1

Step1: Initialization. Set significance level . Calculate inconsistency rate of information
   systems: .
Step2: Sort data in ascending order for each attribute and calculate value of
   each adjacent two intervals according to (1), then using a table to obtain
   the corresponding threshold. Calculate difference .
Step3: Merge.
   While (mergeable cut point)
   {Search cut point that has the maximal difference , then merging it;
    If change
     {Withdraw merging;
      goto Step4;}
    else goto Step2;
   }
Step4: If can not be decreased
       Exit procedure;
   Else ;
      Decreasing the significance level by one level;
      goto Step2; }
Step5: Do until no attribute can be merged
   {For each mergeable attribute
    {Calculate difference ;
      ;
     sign flag=0;
     While (flag= =0)
     {While (mergeable cut point)
        {Search cut point that has the maximal difference , then merging it;
        If change
         {Withdraw merging;
         flag=1;
         break;}
         Else update difference ;
        }
      If can not be decreased
        Break;
      Else {Decreasing the significance level by one level;
       Update difference ;}
     }
    }
   }