Research Article

Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics

Algorithm 1

Pseudocode of the semantic K-means clustering algorithm.
(1)Pick the number of clusters, K
(2)Initialize cluster centroid by randomly choosing K random points from given data as cluster centroids
(3)Determine the summation of squared distance between data points (Pts) and all the other cluster centroids
(4)Allot each data point (Pt) to the nearby cluster centroid
(5)Recompute centroids of the new clusters by getting mean of the all data points belonging to each cluster
(6)Repeat steps 3 to 5 until centroids in new clusters do not change