Abstract

The paper aims to propose a distributed method for machine learning models and its application for medical data analysis. The great challenge in the medicine field is to provide a scalable image processing model, which integrates the computing processing requirements and computing-aided medical decision making. The proposed Fuzzy logic method is based on a distributed approach of type-2 Fuzzy logic algorithm and merges the HPC (High Performance Computing) and cognitive aspect on one model. Accordingly, the method is assigned to be implemented on big data analysis and data science prediction models for healthcare applications. The paper focuses on the proposed distributed Type-2 Fuzzy Logic (DT2FL) method and its application for MRI data analysis under a massively parallel and distributed virtual mobile agent architecture. Indeed, the paper presents some experimental results which highlight the accuracy and efficiency of the proposed method.

1. Introduction

Today, computer science technologies apply artificial intelligence and data science models in order to design new intelligent applications, such as fraud detection and recommendation engine. These applications have to deal with big data sets, which need to be processed for extracting meaningful information and predict unknown patterns. To do so, they introduce the use of machine learning models based on complex algorithms such as classification and clustering algorithms.

Consider the great number of unstructured datasets and the complex algorithms that these applications have to deal with. Their efficiency relies on their capability to manage these challenges. For example, running a medical application for image analysis requires high processing power and a scalable clustering method. Clustering algorithms which are widely applied in the medicine field have been used by many researchers for MRI image classification. As an illustration, they proposed a parallel Fuzzy c-means method for image segmentation analysis in [1] and for clustering large data sets on a parallel SPMD architecture using MPI tools in [2]. For the type-2 Fuzzy, some parallel implementation are proposed such as in [3] over FPGA technology. The question of how distributed clustering methods can improve the big data analysis and provide an effective tool for decision making models then arises. Consider the considerable requirements of medical big data analysis and decision making models based on clustering methods. The paper focused on presenting a cooperative machine learning model based on the distributed type-2 Fuzzy method that combines computational processing requirements and the cognitive aspect. This paper is organized as follows:We will describe the machine learning model-based mobile agents and their main components, which are a team leader agent and AVPEs agents (Section 3)We will focus on presenting the proposed distributed DT2FL method (Section 4) and demonstrating its promising advantages through medical image analysis application (Section 5)

2. Background

To highlight the aim of this paper, we start with a brief overview of machine learning models [4, 5] and their ability to perform prediction, pattern recognition, and decision making. These models provide the computers by the teaching behavior to learn from datasets and lead humans to convey their field expertise to them in order to design and implement smart systems, which are not explicitly programmed. These, later, are able to collaborate with humans and bring relevant solutions. For example, in the banking sector, the machine learning model is implemented in order to provide a fraud-detection system able to detect the fraudulent transactions. It is also implemented in social media for face recognition and in commerce for product recommendation to the customers. Therefore, there are three types of machine learning methods, where each one deals with a specific kind of problems.Supervised learning: the algorithms of this type learn from training datasets in order to predict outcomes, where the output results are given in the training processUnsupervised learning: the algorithms of this type find patterns without using training datasets. Examples of these algorithms are clustering and anomaly detectionReinforcement learning: the algorithms of this type make decisions according to their past experience

The clustering algorithms are based on complex computing tasks and large datasets. They have a variety of applications such as image segmentation, medical imaging, and anomaly detection. Thus, it seems that these methods play a great role in designing and implementing effective machine learning models. This means that their performance depends on the clustering’s algorithm scalability. The distributed clustering method is a new paradigm that allows performing tasks in distributed nodes. Assume that the task T needs to be performed on large datasets D. The task T will be split into two global tasks, one executed by the host node and the second executed by all the slave nodes in the model. Furthermore, the data D will also be split into (me × ne) elementary data Dj {j = 1, … , (me × ne)} and distributed to the node, by the way that they perform collaborative clustering tasks.

The multiagent system (MAS) [6] is a distributed technology system composed of a set of distributed agents which live and cooperate between each other using their intelligence and skills in different environments in order to overcome complex challenges. For example, in [7], the authors proposed a platform to facilitate the provision of home-care services based on an agent three-layered architecture. In [8], the authors presented the improvement of the time efficiency of a medical reasoning system based on a multiagent architecture. Thus, how can the distributed clustering paradigm and mobile agents be both the promising solution for scalable clustering methods and machine learning models?

3. Autonomous Fuzzy Logic Model for Data Analysis

The distributed type-2 Fuzzy logic method is a new efficient and accurate machine learning model (Figure 1) for performing medical applications in a distributed system. The proposed method is based on cooperative mobile agents as Agent Virtual Processing Elements (AVPEs) in order to perform complex computational tasks and process intensive datasets over a distributed virtual architecture. Clustering techniques are one of the unsupervised machine learning methods which need intensive computational power. The proposed method provides the computational capabilities for these requirements. Although, the cognitive aspect of the computational team enhances the method features in order to provide accurate medical decision making. For example, in order to execute an MRI image segmentation-based type-2 Fuzzy clustering method, when the application is deployed, the cooperative DT2FL team works will be created. Its main components are the team leader agent and AVPEs (Agent Virtual Processing Elements) agents. The team leader agent is initialized by the input dataset, which is partitioned into (me × ne) elementary images and deploys NA = (me × ne) AVPEs agents. Each AVPE agent encapsulates its dataset and achieves tasks and sends the results to their team leader agent. At the end, the team leader agent gets the final results, which are used for accurate medical diagnosis. This is performed by the team leader agent under an embedded model, which enables autonomous detection of medical diseases and healthcare prediction (Figure 1).

4. Distributed Type-2 Fuzzy Logic Method

4.1. Standard Method

The Type-2 Fuzzy algorithm [9] is an extended version of Fuzzy c-means algorithm, and which is based on the method concepts for data clustering to c clusters. Beside, the Type-2 Fuzzy method integrates a new membership matrix aij assigned to the membership matrix. The method grants the convergence of the class centers to accurate values in the presence of noise compared to the Fuzzy c-means, and which minimize the objective function J given by the following equation:

In each iteration, the new class centers and the membership matrices are computed, respectively, by the following equations:

The clustering of datasets based on standard Type-2 Fuzzy algorithm (Figure 2) is achieved according to the following algorithm’s steps:Step 1: Initialization (i) Initialize the fuzzification parameter. (ii) Choose the number of clusters C. (iii) Initialization of the clusters centers and the threshold Eth.For each iteration t{Step 2: Determination of the membership matrices (i) Determine the membership matrix uij by (4). (ii) Determine the membership matrix aij by (3).Step 3: Compute the objective function J by (1).Step 4: Test of convergence if (|Jt − Jt−1| < Eth)  the algorithm converges (i.e. go to step 6). else  go to step 5.Step 5: Calculate the new class centers by (2), and repeat the loop from step 2.Step 6: End.}

4.2. Distributed Method

This section is investigated to present the proposed distributed type-2 Fuzzy algorithm (DT2FL) and its implementation in cooperative distributed virtual architecture based on mobiles agents. The input dataset of the machine learning model is loaded in a virtual architecture of size (me × ne). In the model, each agent virtual processing element (AVPE(g)) handles its assigned elementary dataset ed(e) and performs the different algorithm’s steps. The computational algorithm’s steps are achieved by the team leader agent in cooperation with the AVPEs agents in parallel, within distributed grid computing. The proposed DT2FL algorithm involves the 3 steps: initialization, grid construction, and distributed type-2 Fuzzy clustering, which are summarized in (Figure 3) and detailed as follows:

4.2.1. Step 1: Initialization

In this step, the team leader agent is initialized by the required algorithm’s inputs:DatasetNumber of clusters cThe value of column ne and line me needed for creating the AVPE grid of size (me, ne)

4.2.2. Step 2: Grid Construction

In this step, the team leader agent starts by partitioning the input dataset D into a set of (me × ne) elementary dataset. Then, it deploys NA = (me × ne) AVPE agents, which are initialized by the elementary dataset (Figure 4), one per AVPE(g). When it is performed, each AVPE(g) moves to its appropriate distributed node, where it is supposed to perform its tasks.

4.2.3. Step 3: Distributed Type-2 Fuzzy Clustering

In this step, the team leader agent chooses the initial class centers (i = 1, …, c) and the threshold Eth, while each AVPE(g) encapsulates the tasks and loads its elementary dataset e.

For each iteration t,{ 1: the team leader agent sends the class centers to all the AVPEs. 2: each agent AVPE(g) gets the class centers from its message and performs the distributed elementary clustering task. Besides, each one determinates the membership matrix uij, and aij, and sends the results DER1, DER2, and DER3 to the team leader agent. These results consist of the following components:where DER1() contains the results of the sum of () computed for each class center i. DER2() contains the results of the sum of (Am) computed for each class center i. DER3(g) contains the results of sum of () computed for all class centers. di is the number of data of the elementary dataset e of the AVPE(g).

3: the team leader agent performs the global clustering task, within the three following subtasks:

Assembling the elementary results: the team leader agent receives the AVPEs results (DER1, DER2, and DER3) and computes the global results GDER1, GDER2, and GDER3:where GDER1(i) is the global value of DER1() of all the AVPEs of the model. GDER2(i) is the global value of DER2() of all the AVPEs of the model. GDER3(i) is the global value of DER3() of all the AVPEs of the model.

Computing the class centers: the team leader agent computes the new class centers by getting the computed global values GDER1(i) and GDER2(i) by (11)

Computing the objective function Jt: the team leader agent computes the objective function by getting the computed global values GDER1(i) by (12)

5. Implementation and Results

The machine learning model-based distributed type-2 Fuzzy logic is implemented in a distributed computing virtual machine for running an MRI medical image analysis application. The input dataset (MRI image) is encapsulated on the team leader agent and partitioned into elementary images. After creating the grid computing, the team leader, in cooperation with the AVPEs agents, will perform distributed method tasks and provide the results, Figure 5. The effectiveness of this method is presented under the dynamic convergence and the speedup of the algorithm.

5.1. Dynamic Convergence

The proposed DT2FL method is executed with different initial class centers initializations. The convergence results are presented as follows:(c1, c2, c3, c4, c5)=(1.5, 2.2, 3.8, 5.2, 8.6): From Table 1 and Figure 6, it is clear that the dynamic convergence of the algorithm to the final class centers (c1, c2, c3, c4, c5) = (27. 711, 82.848, 89. 817, 103.896, 94. 371). The convergence is achieved after 23 iterations.(c1, c2, c3, c4, c5)=(140.5, 149.5, 150.5, 220.2, 250.5): the algorithm converges to the final class centers (c1, c2, c3, c4, c5) = (27.703, 82. 955, 89. 623, 103.884, 94.263) after 49 iterations as illustrated in Table 1 and Figure 7.

5.2. Clustering Time

In order to illustrate the effectiveness of the proposed model for data analysis, a comparative study is performed between the proposed DT2FL method and DFCM method in [10], by using the clustering time variation as metric according to the AVPEs agents.

From Table 2, we see clearly that the clustering time of the DT2FL method using one agent corresponds to the sequential method. Indeed, we see clearly from Figure 8 that for both methods, the clustering time is saved and achieves minimum values from 16 AVPE agents.

5.3. Speedup

The speedup of distributed methods compared to sequential methods is shown in Figure 9. Indeed, we see clearly that the speedup reaches interesting values compared to the sequential method which exceeded 4 by using 16 AVPE agents for two methods.

The main focus of this work is to propose a distributed method for big data analysis and medical image diagnosis. This method used the mobile agent-based model and a distributed approach of the type-2 Fuzzy logic algorithm.

The type-2 Fuzzy logic method theory and applications [11, 12] provide the important features of this method and its future directions for modeling complex systems based on data uncertainty. It was proposed by Zadeh [13], and it is one of the methods that allow modeling human perception effectively [13] and where modeling uncertainty is enhanced compared to Fuzzy type-1 [14]. This method has had great interests in several inspiring works in the literature at different domains. For example, in the medical domain, this method is implemented [15] to develop an automated tumor detection system. It is also widely investigated for medical image segmentation [16] and data classification applications [17] as a combined method with wavelets.The method has taken place as well as in image processing [18], and pattern recognition fields [19]. The type-2 Fuzzy method has a great interest in medical applications [20] for decision making [21], where the method is embedded with the neural network model. To illustrate, the authors in [22] proposed a hybrid model for 2-lead cardiac arrhythmia classification based on neural networks and type-1 and type-2 Fuzzy systems. Thus, the Type-2 Fuzzy drives interesting works in medical, and several other fields, such as in [2333].

In [27], the authors mention the great role of the cognitive agent’s ability to handle complex tasks. They formulate, implement, and evaluate a cognitive agent, which combines learning by, examples, with machine learning. The Fuzzy logic-based agent approach is also the aim of interest of different works. In [28], the authors proposed multiagent model predictive control including Fuzzy negotiation, which is implemented in a JAVA-based platform. It is also investigated [29] for designing and implementing an intelligent method for information extraction when information is abundant, vague, or imprecise. Furthermore, the multiagent approach is investigated for solving problems in traffic fields, [30] and for decision making of maintenance [31]in real time, and for effective coordination scheme [32] based on Type-2 Fuzzy decision making. The combination of the two aspects, the Fuzzy logic and multiagent approach, provides promising solutions which lead to extension the standard type-2 Fuzzy logic algorithm to a new distributed one based on the multiagent architecture. Namely, in [33] the authors proposed a multiagent architecture for the design of the hierarchical interval type-2 beta Fuzzy system.

Thus, these works’ foundations are developed in the following way. The proposed distributed type-2 Fuzzy logic in this paper is based on the multiagent approach and implements the given properties:Scalable medical MRI image analysis method, which is based on the distributed approach of type-2 Fuzzy logic algorithm, within mobile agents: this method grants fast distributed data analysis, within its computational type-2 Fuzzy clustering model.Accurate machine learning model-based type-2 Fuzzy logic for medical application: this method allows deep data analysis, within its embedded cognitive decision making model.Autonomous and hybrid Fuzzy logic system with two combined models: HPC computational intelligent medical data analysis, and cognitive decision making. The cognitive aspect of the method, combined with HPC clustering requirements provides a powerful system for medical applications.

7. Conclusions

In this paper, we presented a distributed type-2 Fuzzy logic method and its application for MRI medical image analysis. This method is implemented on machine learning model-based mobile agents. The obtained results demonstrate the accuracy and efficiency of the proposed method. That is, it can meet the data requirements of scientists and engineers for testing and implementing their models. Indeed, the DT2FL method provides an aspect of adaptability with machine learning models. Thus, it allows reducing the complexity of the fuzzy clustering algorithms and modeling flexible and high performance systems for medical applications. Furthermore, intensive works are driven for designing an expert system for medical healthcare applications based on the proposed method within the deep learning technique and its implementation in cloud computing.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.