Abstract

Because the static cloud platform cannot satisfy the diversity of mobile Internet service and inefficient data mining problems, we presented a reliable and efficient data mining cloud platform construction scheme based on the mobile Internet service opportunistic driving and application perception. In this scheme, first of all data selection mechanism was established based on mobile Internet service opportunistic drive. Secondly, through the cloud platform different cloud and channel aware, nonlinear mapping from the service to a data set of proposed perceptual model is applied. Finally, on the basis of the driving characteristics and extraction of perceptual features, the cloud platform would be constructed through the service opportunities of mobile Internet applications, which could provide robust and efficient data mining services. The experimental results show that the proposed mechanism, compared to the cloud platform based on distributed data mining, has obvious advantages in system running time, memory usage, and data clustering required time, as well as average clustering quality.

1. Introduction

The cloud platform has high performance of data management and efficient data mining [1], which has been widely applied in various fields, such as knowledge discovery and Leaf Spot dynamics [2], and ERP applications [3]. So, the cloud platform [4, 5] has played an important role for quality of service guarantee and data mining of mobile Internet.

In data mining, Rakocevic et al. [6] embedded a distributed data mining algorithm into a sensor network, which used local predictors on each sensor node to make a local prediction and offers several original voting schemes. The architecture of the huge amount 3D video data was proposed by [7], which designed a model of Key Encryption Model for protecting the privacy video data. Ronowicz et al. [8] researched the cause-effect relationships between pellet formulation characteristics, as well as the selected quality attribute.

About the service driver, Ghosh et al. [9] developed the cost analysis and optimization framework by using stochastic availability and performance models of an IaaS cloud. The economic model as a Bayesian network was proposed by Ye et al. [10] for selecting and composing cloud services. The importance and challenges in designing event-driven mobile services were discussed [11], which could detect conditions of interest to users and notify them accordingly.

In addition, the fast scalable video coding- (SVC-) based channel-recommendation system for IPTV on a cloud and peer-to-peer (P2P) hybrid platform was studied in [12]. The mechanisms for orchestrating cloud-enabled hardware and software resources were proposed by Petcu et al. [13], which should be supported by a recently developed open-source platform as a service. The novel monitoring architecture was proposed by Alcaraz Calero and Aguado [14], which should be addressed to the cloud provider and the cloud consumers.

However, these research results ignored the relationship between data set of cloud platform and service requirements of mobile Internet and lack the in-depth study of influence of Internet application on cloud platform. Hence, we proposed the cloud platform with mobile Internet service opportunistic drive and application aware data mining.

The rest of the paper is organized as follows. Section 2 describes the mobile Internet services driven data selection mechanism. In Section 3, we design the application perception model. In Section 4, we proposed the efficient data mining cloud platform. Simulation results are given in Section 5. Finally, we conclude the paper in Section 6.

2. Data Selection Mechanism Driven by Mobile Internet Services

In mobile Internet, we assume a service data set function, which was used to optimize data selection for satisfying the service demands and provided the optimal data set for service. This is the role scheme which was used to solve the combination issue of mobile Internet and cloud computing. In general, the mobile Internet service architecture model is shown in Figure 1. Data input was driven by service set and stored in the cloud platform. When the service request was received successfully, the cloud platform would select the data and sent output feedback to the user according to the request.

There are some problems that should be considered in the progress of data selection, which are as follows:(1)whether the data is corresponding to the service and whether it can meet the service needs,(2)lack of user information feedback mechanism which is used to judge whether the user is satisfied with the data,(3)how to coordinate multiple user needs,(4)how to coordinate the input and output of data without increasing the load of the cloud platform.

The mapping relationship of user set and data set is shown in Figure 2. Here, the relationship was constructed by the services, which include the following cases:(1)single user to intersection of multiple data sets,(2)single data set to multiple users,(3)single user to the union of data sets.According to the above problems, we give the following service driven definition:(1): service set and its length which is ,(2): data set and its length which is ,(3): user set and its length which is ,(4)QE(): service satisfaction of the data,(5): the number of data services in the unit time,(6): the time consumed by the data service,(7): the number of service sessions between and .According to the above stations, there is the relationship shown as formula (1) in mobile Internet application:Here, let denote the number of sessions. Let denote the length of data packet. Let denote the time of completing the services. Let denote the throughput.

So, formula (2) gives the data selection model based on service opportunistic driven. Here, service request launched the service opportunistic driven in accordance with the probability. The data sets to meet the service request through the cloud platform. For the request of hybrid multiple data sets of service, the data set could be computed with XOR operation to form a single data set, for selecting the data without interference and avoiding the performance degradation:Here, let denote the service opportunistic request driven factor. Vector denotes the service opportunistic weight. The value of dataType could be set according to the ability of satisfying the service requests. If dataType is 0, the current data set can be independent of the service request. If dataType is 1, more than 2 data sets could satisfy the current service request. Let denote the number of data sets. denote the data set which gives feedback to users.

3. Application Perception Model

In the processing progress of mobile Internet applications, cloud platforms can be achieved through nonlinear mapping between different clouds and channel from the service to the data set, which could meet the needs of different data sets from different cloud data fusion. Here, it is important to coordinate transmission and fusion of multicloud applications, as shown in Figure 3. Here, the cycle symbol of Figure 3 denotes the cloud of platform.

Based on the same service request and being located in the same area, data transmission coordination network composed of multicloud needs to be aware of the application and keep the time synchronization. When the application requests are met in different regions, in order to guarantee the space synchronization, the time must keep asynchronous, as shown in formula (3). Based on the above time and space of synchronous request and asynchronous operation, formula (4) illustrated the application of perception matrix and data output matrix and relationship:Here, and represent the different application request. Let function denote the data set location of the application mapping:Here, let denote application perception matrix. Let denote the multicloud coordination flux. Let OMI denote the output matrix of data set.

Hence, the data output flow of mobile Internet application perception is as follows:(1)initializing the vector and parameters according to the principles and models,(2)computing the matrix AMI according to formula (4),(3)OMI is set to 1, which means service request is active and waiting for data output,(4)data selection and integration after the activation of cloud devices,(5)the consistency and time of the region are judged according to formula (3),(6)if the space is consistent, the time synchronization is obtained by formula (4),(7)If the space is not the same, the time is asynchronous. The cloud platform is in a stable state. In an asynchronous cycle, the multiple cloud segmentation data are activated. The end of the independent data set is obtained after data integration.

4. Efficient Data Mining Cloud Platform

According to the service opportunities driving characteristics and extraction of perceptual features, how to build cloud platform for mobile Internet applications to provide robust and efficient data mining service has become a key issue of QoS guarantee in mobile Internet. In cloud platform, the customer service satisfaction could be computed by adding each data service satisfaction as shown in formula (5):If customer service demand satisfaction of data sets is larger than the user specified minimum value, the data sets are robust. Otherwise, the data sets should be reconstructed through data mining.

In order to facilitate the service opportunity to play the driving characteristics, we defined the service drive tree. The tree consists of leaf nodes. Each leaf node represents a service identifier and the opportunity probability. The intermediate node indicates the subtree ability which is denoted by the summary of service opportunities driven weight of leaf nodes. The service driven tree is shown in Figure 4. If the current node and the child nodes are time synchronization, the link will be recorded as 1; otherwise, it is denoted by 0. From the root node to any leaf node, a set of binary bit strings could be obtained. Each leaf node can access the service based on adaptive driving vector, which could reduce data mining complexity. The service opportunities driven tree is shown in Figure 5.

Here, we set 4 leaf nodes and service initiation probability of each leaf node is given. The sum of 4 leaf nodes probability is equal to 1. The service opportunity driven weight of root node is equal to 1. The intermediate nodes are composed of 2 clouds, which are shown as in Table 1.

As shown in Table 1, although the SE4 opportunity weight is low, the service driven path vector is all asynchronous, the data mining complexity is low, and the data clustering service can be obtained with higher real-time. Although the SE3 has the best chance, but the service driven path is heterogeneous, the data mining is complex, so the data clustering is poor, and the real-time performance is poor. In addition, the value of SE1, SE2, SE3, and SE4 could be computed based on the service requests and service driven tree as shown in Figure 4.

As shown in Table 2, with opportunistic driven scheme, the data mining complexity of 4 leaf nodes has been reduced. For SE2 and SE3, we can establish the path of multiple data transmission through the opportunity weighting, so as to ensure the real-time and accuracy of data clustering. In addition, the cloud platform could be constructed by choosing the same area of the current leaf nodes and service opportunities driven. Based on the above service opportunity driving features and application of data clustering mining, the architecture of the cloud platform is as shown in Figure 6. Here, ① expresses the analysis of service opportunistic drive characteristics; ② denotes the feature extraction of application perception; ③ denotes the clustering data mining. The construction algorithm is described as follows.

Algorithm 1. Construction of cloud platform is as follows.
Input. It includes , , , QE(), and data mining complexity threshold
Output. It includes data set and cloud platform
BeginWhile (1)Initial parameters: , ;Obtain the value of While If QE() = 1 or QE() = 3Count time;++;End whileComputing the value of and based on QE() and ;Obtain the service opportunity request drive factor ;Constructing the service opportunity driven tree;Analyzing the data mining complexity DM;If DM is larger than Updating the tree;ElseIf Time synchronizationElseTime asynchronousEnd ifEnd ifComputing multicloud coordination flux Select the optimal clouds to construct the cloud platform Activate the multicloud segmentation data and obtain the final independent data set after the fusion dataEnd while
End

5. Performance Evaluation

In order to verify the cloud platform performance and data mining performance of the proposed cloud platform based on opportunities in the service of mobile Internet driver and data mining of application perception (CP-SAD), we designed the experiment with 1 GB storage space and 100 data sets randomly distributed in 10 pieces of cloud equipment and evaluated with the transaction data base server. Verification metrics include the system running time and memory occupancy rate, the time required for data clustering, and the average clustering quality, which compared with the cloud computing mechanism with distributed data mining (CP-DDM); the results are as shown in Figure 7.

From Figure 7(a), we found that CP-DDM mechanism system running time increases linearly with increasing data set, while the operating time of the proposed CP-SAD mechanism remained stable, because the use of service opportunities driven model and real-time analysis of the characteristics of the driver can accurately grasp the service request, optimize the system initialization and service driven and application aspects of perception in the delay, not only shorten the operation time but also smooth the delay jitter.

Figure 7(b) proved that the proposed mechanism could optimize the memory capacity, because of the application of perceptual features extraction mechanism, which can not only be aware of the applications required data scale and distribution, but also optimize the composition of the cloud platform, by combining with the opportunity to serve driver tree. The proposed mechanism can increase the utilization ratio of memory, avoiding the cloud platform system performance degradation caused by memory overflow.

From Figure 7(c), we can see that the required time of data mining clustering increases first and then decreases with the increase of cloud scale activation. This is because the large computation result is more time consuming in initialization phase. However, the proposed CP-SAD mechanism can complete cloud equipment initialization and reduce data mining time faster and earlier. The CP-SAD mechanism can complete initialization after the first cloud device activation, and CP-DDM mechanism complete initialization when three-cloud device is activated, which wasted a lot of time for data mining clustering stage. This is because the distributed data mining is composed of a cloud platform data distributed storage and application features and service requests are not built for nonlinear mapping, resulting in data mining delay being large.

Clustering quality average is shown in Figure 7(d) with the increase of the size of the cloud. In the activation of the cloud number from 0 to 4, the average clustering quality of two mechanisms changes in the same law, and the gap is about 10 percentage points. With a further increase in the cloud, the average clustering quality of CP-DDM mechanism began to decline, until 10 clouds are activated to rise to 60%. However, the average clustering quality of CP-SAD mechanism has stalled when there are four clouds and then increases linearly. This is because the CP-SAD mechanism can effectively grasp the opportunity to drive characteristics and extraction of perceptual features and construct, for mobile Internet applications to provide robust and efficient data mining service cloud platform, mobile Internet applications for improving the efficient and reliable quality of service.

The iteration number analysis results of time complexity of CP-DDM and CP-SAD were shown in Figure 7(e). The proposed mechanism could compute the data efficiently and mine the data accurately with less iteration number than one of CP-DDM, which has the very low time complexity.

6. Conclusions

In order to improve the cloud platform of the mobile Internet services with quality assurance and data mining efficiency, we researched the opportunities in the service of mobile Internet driver, as well as the reliable and efficient application aware data mining. Firstly, a data selection mechanism is designed based on the analysis of the characteristics of the mobile Internet service opportunity driving. Second, based on the relationship between the Internet service and data set, the nonlinear mapping between cloud and the Internet channel was established, and it established an Internet application perception model. Third, based on the characteristics of service opportunity driven and application perception, a cloud platform is proposed to provide efficient and robust data mining services for mobile Internet applications. The experimental results show that the proposed mechanism in the runtime and memory occupancy rate, data clustering, and the time required for the average clustering quality are better than the cloud platform based on distributed data mining.

Conflict of Interests

The author declares that there is no conflict of interests regarding the publication of this paper.