Research Article

Cloud for Distributed Data Analysis Based on the Actor Model

Table 1

The solutions for building data analysis cloud services.

CapabilitiesBC-PDMAzure MLAmazon machine learningGoogle Cloud machine learningWatson AnalyticsDMCFApache Spark MLlibApache MahoutWeka 4 WS

Cloud service modelSaaSSaaSSaaSSaaSSaaSSaaS
User interfaceWebWebWebAPIWebWebDesktop
User’s levelDeveloperKnowledge ML algorithmsAnalyticDeveloperAnalytic/managerDeveloperDeveloperDeveloperDeveloper
API InterfaceNoRESTRESTRESTRESTJS4CloudYesYesYes
Scalable computingYesFor single modulesYesYesYesYesYesYesNo
Data source locationInside cloudInside cloudInside cloudInside cloudInside cloudInside cloudOutsideOutsideAny
Distributed computing platformApache HadoopApache HadoopApache HadoopSOAApache SparkApache HadoopWSRF
Full analysis cycleYesYesYesYesNoYesNoNoNo
Included data mining algorithmsClassification, clustering, and associationClassification, anomaly detection, regression, and clusteringClassification, regressionClassification, regressionClassification, regressionClassification, regression, and clusteringClassification, regression, Clustering, dimensionality reduction, and feature extractionCollaborative filtering, classification, clustering, and dimensionality reductionClassification, association, regression, and clustering
Adding new algorithmsNoFrom Machine Learning MarketplaceNoNoNoYesYesYesYes
UsingNoPaidPaidPaidPaidNoOpen SourceOpen SourceOpen Source