Review Article

Complex Power System Status Monitoring and Evaluation Using Big Data Platform and Machine Learning Algorithms: A Review and a Case Study

Table 2

Comparisons of open-source machine learning tools/algorithms for Big Data.

CategoryAlgorithmOpen source/free software
WekaRShogunMahoutMLibOrangeOryx

ClassificationLogistic regression
(Complementary) naive Bayes
Decision tree
Neural networks
SVM
Random forest
Hidden Markov models

RegressionLinear regression
Generalized linear models
Lasso/ridge regression
Decision tree regression

Clustering-means
Fuzzy -means
Gaussian mixture model (GMM)
Streaming -means

Collaborative filteringAlternating least squares (ALS)
Matrix factorization-based

Dimensionality ReductionSingular value decomposition (SVD)
Principal component analysis

Optimization primitiveStochastic gradient descent)
Limited-memory BFGS (L-BFGS)

Feature extractionTF-IDF
Word2Vec

Frequent pattern miningFP growth
Association rules