Abstract

Aiming at the problem of low response speed and unbalanced distribution of data resources of production process (DRPP) for the distributed workshop production environment, an optimization scheduling method of DRPP based on a multicommunity cooperative search algorithm is proposed. A heuristic data resource service scheduling framework including a load manager and dynamic scheduling engine is first built to deal with the uncertainty of data resource service response and the imbalance of resource allocation; a core scheduling optimization mathematical model with the objectives: resource service efficiency, reduced response time, and load balancing, is established. Then, a multicommunity cooperative search algorithm for the scheduling model is presented, and the mapping relationship between the particle position vector and resource allocation is established via binary coding. Thus, the optimization algorithm is mapped to discrete data space, and the multicommunity bidirectional driving evolutionary mechanism is used to realize the cooperative and interactive search between common and model community, which enhances the adaptability of the algorithm to dynamic random scheduling tasks. Finally, the effectiveness of the proposed method is verified by an example of multiprocess quality prediction service scheduling in silk production process, which provides an effective means for solving the complex scheduling problem of production process data.

1. Introduction

The intelligent workshop integrates modern sensing technology, network technology, automation technology, and other advanced technologies, and a large number of intelligent equipment such as sensors and data acquisition devices have been put into use in the workshop [13]. And thus, the production workshop has become the collection center of information flow, material flow, and control flow. In the process of product production, a large amount of production, environment, status, and equipment operation data are generated at an unprecedented speed, thus forming workshop big data, which presents the new characteristics of multitask, cross-process, heterogeneous, and polymorphic. However, data has no subjective initiative. To realize the real-time perception and prediction of the production process, we must first realize the scheduling and optimization of “data,” that is, the scheduling and optimization of production process data. It is noted that the traditional manufacturing mode, data of information flow, material flow, and control flow are still isolated from each other in each stage of production execution, and it is difficult to form a joint force due to the lack of effective data resource scheduling mechanism, which restricts the further improvement of production efficiency and system intelligence level [4]. Therefore, the research on on-demand scheduling of workshop data resources is one of the core problems of intelligent manufacturing in production workshop.

Recently, most scholars at home and abroad focus on the methods and algorithms of workshop data collection, analysis and mining, such as machine vision preprocessing algorithm [5], neural network prediction algorithm [6, 7], intelligent decision algorithm [8], and multiobjective optimization algorithm [9]. However, the data have no subjective initiative; the data-based analysis and processing algorithm can not actively serve the business needs such as perception, decision-making, and execution of the production process; and the current research has not comprehensively considered the coupling and impact among demand, service, resources, and energy efficiency in the production process.

However, the research of domestic and foreign scholars on workshop process production scheduling and collaborative job scheduling has a good reference for the development of this work. Considering workshop process and production scheduling, literature [10] based on the dependence of the production planning, and scheduling problem of continuous production line on timing, a repair and optimization solution is proposed to solve the problem of energy efficiency in the production process. For the scheduling problem of complex products in multiworkshop production, literature [11] studies the characteristics of BOM structure and process route of complex products. Based on the construction of multilevel process network diagram, an improved particle swarm optimization algorithm is used to ensure the effectiveness of the algorithm search path. Literature [12] considers the problems of process connection and blocking of prefabricated parts in the process of workshop assembly line operation and establishes a scheduling model to minimize the total penalty cost of advance and delay, which improves the production efficiency of Prefabrication Yard. Literature [13] uses a machine learning method to assign jobs based on the priority rules of the decision tree as the scheduling method, which shows good performance in the case scenario with completion goal and total delay goal. Considering from workshop production collaborative job scheduling, Literature [14] estimates the process processing time in the production process through machine learning and uses the estimated processing time to schedule and optimize parallel machines, which reduces the maximum completion time by about 30% on average. Aiming at the optimal comprehensive production and transportation plan of a group of parallel batch machines, literature [15] constructs a 0-1 mixed integer programming model, solves the model, and completes the comprehensive scheduling through an improved genetic algorithm, which reduces the transportation cost. The above research provides an idea for this paper to realize on-demand scheduling data resources for production process.

On the other hand, it should be noted that the workshop production process involves multiprocess cross-production line business collaboration and business requirements. In the process of data resource scheduling, we should not only consider the association and cooperation relationship between different production tasks but also consider the transmission time of data resources between different production tasks, in particular, the uncertainty of concurrent service access affecting demand response, and the impact of these dynamic and uncertain factors on the balance of data resource allocation [1618].

This paper is concerned with production-oriented data resources scheduling, thus transforming DRPP into a service, and finally into economic benefits. Consequently, this study is to integrate the load manager and dynamic task scheduling engine, and combine them with scheduling processes to form a scheduling scheme, so as to provide intelligent support for production process. Following this idea, this paper is organized as follows: the heuristic data resource service scheduling framework is constructed in Section 1. The problem to be studied and the scheduling mathematical model are proposed in Section 2. Section 3 is devoted to establish the asynchronous parallel scheduling strategy and optimization method, simulation results are given in Section 4, and conclusions are made in Section 5.

2. DRPP Scheduling Process Analysis

According to the execution status of the DRPP in the scheduling center, the data in the resource pool are mobilized to form the optimal execution scheme of tasks. DRPP scheduling is one of the key links in the production decision process. Figure 1 presents a framework of DRPP scheduling that includes decomposition of business requirements, service task analysis, dynamic scheduling of DRPP, load monitoring, and service task execution. Firstly, during production operation, different processes send requests to the scheduling center according to task execution requirements. By queuing, merging, and analyzing the service requirements, the task analyzer degrades the vague and miscellaneous service tasks to form a set of low-granularity service tasks that can be directly served by DRPP. Secondly, the dynamic scheduler preliminarily matches the DRPP according to the task request and then matches the execution task characteristics with the static and real-time attributes of the DRPP to obtain the DRPP set that meets the current production business requirements. The load supervisor of the scheduling center dynamically adjusts the DRPP by monitoring the operation and load of the DRPP in the business process and solves the service interruption caused by uncertain events to ensure the accomplishment of the service process. Finally, the DRPP scheduling engine uses the integrated intelligent optimization algorithm to combine and match the state information of the DRPP and the real-time information of the service task to form the optimal resource service allocation scheme and submits it to the center for execution, so as to complete the scheduling process of the resource service.

Throughout the entire scheduling process, multiservice tasks are executed interactively, and there is a complex relationship between tasks. With the dynamic growth of production business scale, the response time of service tasks must be considered. Additionally, a large number of dynamic and uncertain factors will seriously affect the ability and effectiveness of DRPP service scheduling. The traditional resource scheduling method has low search efficiency and accuracy, which can easily lead to the problems of low response speed and the uneven distribution of resources in the service process. It is difficult to adapt to the allocation of STRs on demand. Therefore, this paper considers improving service efficiency while solving the problem of the unreasonable allocation of DRPP caused by the uncertainty of service response and the unbalanced load of nodes.

3. The Multiobjective Optimal Scheduling Model of DRPP

3.1. The Response Uncertainty Modeling

The invocation relationship of DRPP in the business process is complex, especially when a data resource service is invoked by multiple business processes, and these business processes run simultaneously; there will be concurrent access. In different service scenarios, there are some fluctuations and uncertainties in the access frequency, concurrency probability, and response time of users in the entity industry. There are certain fluctuations and uncertainties in the access frequency, concurrency probability, and response time of different business links in the production process, such as sequence, selection, parallelism, and the cycle of business processes [19, 20]. Therefore, the uncertainty of the service response of DRPP can be described by service access frequency.

In the following formula, represents the number of service tasks. represents the total DRPP resources. represents the process of any service tasks. represents the probability of performing service task . represents the probability of invoking DRPP for the service task . represents the probability of a service business being accessed. represents the probability of any process branch being selected.

Assuming that there are DRPP for subtasks to call when the scheduling center performs a certain service task, and these subtasks are completed on a specific node according to the service process; then, the probability that the process of the service task is executed is

When multiservice tasks are executed interactively, concurrent access often occurs in service invocation. Accomplishing a task involves invoking multiple resource service processes. When there is a selection structure in the process and a process branch covers concurrent access services, the probability of concurrent access to services is as follows:

When all the selected branches in the service process cover the current concurrent access service, the probability of concurrent access to the services is

3.2. The Modeling of Unbalanced Resource Allocation

Suppose a service task needs to call subtasks to complete, the total number of the DRPP that can provide services is . These subtasks are completed in different tasks node according to the service process and resource requirements. The expected completion time of task invoking technology resource is defined as where is the total instruction length of service task and is the execution speed that the DRPP are distributed invocation.

The average load of different service tasks scheduling DRPP is defined as the quotient between the total instruction length of service tasks and the total execution speed of data resources distributed scheduling, i.e., the completion time of the total service task is

For the above scheduling scheme, the load balancing of service resources invoked can be defined as where represents the completion time of the service task . Obviously, a smaller indicates a more balanced task load service scheduling.

3.3. The Multiobjective Optimal Scheduling Model

Considering the DRPP response and the unbalance of resource allocation, this paper established a multiobjective optimal scheduling mathematical model including the service efficiency, response time, and load balance of resource invocation.

In the production process, if represents the state of service node performing subtask , represents the subtasks, and represents the total number of DRPP provided by service tasks; then, the state set of resource services is , . The set of service efficiency is , the response frequency set is , and the load balancing set is . In these equations, is the service efficiency of service node executing scheduling task , is the response time of executing resource scheduling task for service node , and is the service efficiency load balancing of resource scheduling task for service node . The service status of any resource service node executing a task can then be expressed as

The set of DRPP mapped by service tasks is , where is a DRPP invoked for a service subtask. Thus, the mathematical model of multiobjective optimal scheduling considering the uncertainty of service response and the imbalance of resource allocation is as follows: where is the minimum service efficiency value in line with business requirements, is the maximum service response time that a DRPP node can take, and is the highest load balancing of a DRPP node.

Set the weights of the service efficiency, load balancing, and response frequency of user requesting DRPP as , , and , respectively, and .

The optimal scheduling model of DRPP considering the target weight of user demand is as follows:

4.1. The Evolution Model of Multicommunity Cooperative Network

The basic particle swarm optimization (PSO) algorithm is a single-community optimization model with global optimal particles as its core, which cannot solve the mixed and changeable scheduling problem very well. If this model is extended to task-related multicommunity cooperative optimization, the evolutionary information interaction and association will be generated among these communities, and then, a multicommunity cooperation network (MCCN) with high adaptability to the task will be formed [21, 22]. From a mathematical point of view, a network can be regarded as a combination of a vertex set and edge set. To better describe MCCN and establish its evolution model, the following definitions are first provided.

Definition 1. The threshold for community type determination is where is the global optimal fitness of community and is the number of communities in the cooperative network.

According to the threshold of community type determination, the particle community in the collaboration network can be divided into the model community and common community. If satisfies the criterion , the community has a strong ability of local optimization, which can be divided into model communities and recorded as . On the contrary, if satisfies the criterion , the community has a strong ability of global exploration, so it can be divided into common communities and recorded as .

Definition 2. Let the cooperative search activity among different communities be a binary group , where is the sequence of communities participating in the cooperative search activity and is the interdependency among communities in the search process. is called the cooperative relationship unit, where represents the cooperative relationship between a model community and a common community, is the cooperative relationship between any two model communities, and is the cooperative relationship between any two common communities. The number of cooperative units among different communities in a cooperative relationship set is called the module of the cooperative relationship set, which is recorded as .

Generally, if , there is an edge between two cooperative communities, and the more cooperative relationship units, the greater the edge weight between two different nodes. If there is no cooperative relationship between different communities, then .

Definition 3. Let be the cooperative weights among different communities in MCCN, where the cooperative weight between and is also called the edge weight of MCCN.

To complete the comprehensive quantitative evaluation of community nodes, the evaluation indexes of the optimum value of community nodes are introduced: collaboration distance and responsivity .

Definition 4. Collaboration distance. The global optimal value of the community is, respectively, compared with the individual optimal position of the particles, and the absolute value is obtained, that is to say, the cooperative distance of the global optimal value is .

Definition 5. Responsivity. The threshold of the qualified distance is set. According to the formula , the response value of the community particle to the optimal value of the node can be obtained by traversing the cooperative distance , and then, the responsivity of the global optimal value can be obtained by adding the response values in sequence.

Definition 6. Community node strength. In MCCN, the strength of the community nodes is defined as where is the cooperative weight between the community node and , is the responsivity of the community node, and is the neighborhood of the community node , and it satisfies

Generally, MCCN can be represented by its adjacency matrix as . If is the responsivity matrix of MCCN, then the node strength matrix is as follows: .

Definition 6 shows that the strength of community nodes not only takes into account the cooperative weights among the nodes of the community but also the optimization of the particles within the node itself. It is a comprehensive evaluation of the community’s local information and the ability of the community itself, which can better reflect the community’s ability to seek optimal guidance in the entire cooperative network.

Therefore, MCCN can be represented by undirected weighted graphs . represents different types of cooperative community node set, represents cooperative relationship edge set, is the cooperative edge weight set among, and is the strength set of community nodes, where is the value of the i-th row of the node strength matrix, representing the attributes of community nodes to measure their search ability. By Definition 6, MCCN can be expressed by adjacent augmentation matrix as follows: . The evolution model of the MCCN cooperative network is therefore

On this basis, the asynchronous parallel search strategy among different communities is formulated to reduce the communication between communities, and the efficient search is realized through the driving evolution mechanism to improve the optimization ability of the algorithm to the task scheduling. The rules of multi-population coevolution are as follows.

Rule 1. Evolutionary rules within microbial communities. In the process of multicommunity coevolution, the particles in a single community can be iteratively optimized according to formula (13) for speed and location updating, and the global optimum value can be generated within the community. where is the number of iterations of particle search, ω is the inertial weight, is the acceleration constant, and and are two random functions varying in the range of .

Rule 2. Driving coevolution rules between communities.

Rule 2.1. , , , and . The common community is , and the model community is . The particulate in enters , and the last community in is eliminated. After introducing the model learning factor into the internal evolution rules of , the new iterative evolution formula is as follows: where and is a random function and satisfies the convergence constraints .

Rule 2.2. , the community node strength satisfies for any .

Global optimum value of model community: .

Rule 2.3. , the community node strength satisfies for any .

Global optimum value of common community: .

4.2. The Coding Strategy for Optimal Scheduling of DRPP

Particle swarm optimization (PSO) is a computational model for real continuous space, and it is difficult to solve the task scheduling problem in discrete space [23]. Therefore, the binary system is used to encode the speed and position of particles, and the mapping from the particle swarm optimization algorithm to discrete space, and from the particle search space to the optimal scheduling scheme, is realized by reconstructing the particle expression.

In the above algorithm, an row, column matrix is defined as the position vector matrix of particles. The rows represent the situation of providing STR when any service task is executed, the columns indicate the distribution of service tasks in the scheduling process, and any particle represents the potential solution of the scheduling problem. The coding of the particle position is as follows: where , .

According to the coding scheme, each row of the location matrix has one and only one element value is 1, which indicates that DRPP are allocated to service task . Each DRPP can be invoked by multiple service tasks simultaneously, and the execution of any scheduling task cannot be interrupted.

The defined speed is shown in equation (16), which represents the basic exchange order of particle’s assignments to the execution of tasks.

The exchange operations of addition, subtraction, multiplication, and division in the algorithm are defined as , , , and , respectively. The specific operation rules are as follows: (1): represents in position matrix and velocity matrix ; on the contrary, it is 1; (2): indicates in position matrix and velocity matrix ; the other elements are randomly chosen as 0 or 1(3): indicates whether the particle performs a operation or not with matrix according to the corresponding probability value of the random number (4): represents , , , in position matrix and velocity matrix . According to the above definition of switching operation rules, formula (10) can be updated as follows:

The coding scheme is simple and feasible and thus meets the requirements of multiservice task scheduling of DRPP. It also clearly describes the mapping relationship between the particle population evolution space and the service task scheduling scheme, thus avoiding repeated searches in the process of particle evolution.

4.3. The Optimal Scheduling Algorithms of DRPP Based on Multicommunity Collaborative Search

Based on the multicommunity cooperative search algorithm and its coding scheme, the optimal scheduling process of multiservice tasks for distributed DRPP is shown in Figure 2. The specific steps are as follows.

Step 1. Initialization of population particles. According to the encoding strategy between the particle search space and the task scheduling scheme described in Section 3.2, the initialization of communities is carried out, and a random location (DRPP allocation scheme) and speed of population particles are given. The number of communities, the number of iterations of particles within the community members, and the acceleration coefficient of particles and the inertia weight coefficient are set.

Step 2. Initialized population particles are evenly distributed into the process to form a community of size . Residual particles are randomly allocated to the process. The fitness of each particle in the community is calculated according to the comprehensive optimization scheduling function constructed in Section 3.3.

Step 3. Asynchronous parallel evolutionary computation is performed by running each community separately in the process.

Step 4. Calculate the fitness values of each community and divide all communities into either the model community or common community according to the threshold value.

Step 5. According to the interactive evolution mechanism between different particle populations in Section 3.1, the position and velocity of particles are updated according to formula (14), and the global optimal locations of the model and common communities are saved to the optimal storage area.

Step 6. If all the particle populations satisfy the search termination condition, then the algorithm ends, the global optimal solution is obtained from the storage area, and the optimal scheduling scheme is output; otherwise, it will return to step 5.

5. Application Cases and Analysis

In this section, a DRPP scheduling case for the silk production line quality prediction and early warning service are given to validate the proposed model and algorithm. As shown in Figure 3, the quality prediction and early warning service include the “ single operation quality prediction and early warning,” “ multiprocess quality prediction and early warning,” “ quality prediction and optimization of the whole production line,” and many other services, where each service activity needs to invoke DRPP using service tasks, such as the modle, standards, algorithm, and component to provide on-demand service for different business links of the the tobacco production line.

For the task of quality prediction and optimization of silk drying process service in the service platform, it is divided into five subtasks: online data reading, prediction algorithm call, online real-time prediction, prediction result analysis, optimization algorithm call, and optimization parameter return, which is recorded as task set . And then, the available DRPP set corresponding to task set is. Following the idea proposed in Sections 1 and 2, the DRPP invoking process can be implemented as follows: (1)Coding settings

Define the location vector of the particle in the scheduling algorithm as the matrix , as shown in Table 1. Row represents the allocation of service task , and column represents the service situation of DRPP; if , it means that the task is served by data resources . At this time, each particle represents a service task scheduling scheme. (2)Scheduling algorithms

To verify the effectiveness of the presented scheduling method, the simulation experiment is carried out on the service platform based on the Xeon E5-2609V2 processor and RAM64G, and the service efficiency, response time, and load balancing in the process of multiservice task scheduling are collected as sample data. For comparison, the previously reported ERTPSO algorithm [24], DPSO algorithm [25], LAPSO algorithm [26], and the M-CBDCSM algorithm proposed in the present study are used to solve the optimal scheduling scheme for the engine fault identification and maintenance service. The simulation parameters are specified as the population size -500, the dimension 30, the evolutionary algebra , the inertia weight , and the acceleration constant . Moreover, a random function is introduced into the service response time ; , where varies randomly in a range , and . All the simulation experiments were carried out 500 times, and other parameters are the same to those used in the relevant literature.

Simulation results of the relevant algorithm are provided in Figure 4. It is shown that the multicommunity cooperative search algorithm can better adapt to the random changes of service response time in the process of multiservice task scheduling, in particular for the interactive evolutionary rules between communities are selected adaptively, the algorithm converges to the global optimal value quickly and stably. In particular, the optimal scheduling scheme can be found before 50 generations under different search conditions corresponding to the discrete particle swarm optimization (DPSO). Especially in the face of dynamic and random multitask scheduling, hybrid genetic algorithm is difficult to adaptively carry out individual mutation and cross-operation and then can not track the dynamic change of service scheduling, which makes it difficult to avoid premature phenomenon under high random search conditions, and its algorithm performance is far lower than multicommunity cooperative search algorithm.

To further validate the strong adaptability of the algorithm in the face of multiple scheduling tasks, the experiments of large population based on multi scheduling tasks is also simulated. For this purpose, the population and the scheduling tasks can be given as , -20. The corresponding results are provided in Table 2. Although the number of service tasks is increasing, the convergence rate of multi community cooperative search algorithm does not decline significantly in the process of scheduling. From aforementioned results, one can conclude that the proposed algorithms in this paper can achieve better performance for large population and tasks-varying parameters in terms of convergence speed and steady-state errors.

6. Conclusion

This paper addresses the multitask adaptive scheduling of DRPP. A heuristic scheduling framework are employed to deal with the uncertainty of DRPP service response and the imbalance of resource allocation. The load manager and dynamic scheduling engine are employed to approximate the uncertainty of scheduling service. Moreover, we propose novel cooperative search algorithm of the task scheduling model driven by scheduling objectives with service efficiency, reduced response time, and load balancing, so that fast scheduling convergence can be proved even in the dynamic random sense. The proposed scheduling schemes are robust against dynamic random disturbances. To guarantee the discrete data space of optimization algorithm, a binary coding strategy to map the particle position vector to resource allocation is introduced. Simulation examples are provided to verify the efficacy of the proposed algorithm.

Data Availability

The production process data used to support the findings of this study have not been made available because the enterprise of the data source requires confidentiality.

Conflicts of Interest

The authors declare that there is no conflict of interest regarding the publication of this paper.

Acknowledgments

This study is supported by the National Natural Science Foundation of China (52065033) and Yunnan provincial major science and technology special plan projects: digitization research and application demonstration of Yunnan characteristic industry (202002AD080001).