Discrete Dynamics in Nature and Society

Research Article

Fuzzy Theory-Based Data Placement for Scientific Workflows in Hybrid Cloud Environments

Mapping from encoded particles to data placement results.

	Procedure dataPlacement(G, DC, X)
Input: G, DC, X
Output:
1: Initialization: set the current storage capacity of data centers dc_cur(i) to 0 and the fuzzy data transmission time to (0, 0, 0)
2: for each ds_iof DS_ini//Determine whether the particle would cause the data center overloaded
3: dc_cur(X[i]) + = //Place the dataset ds_iin the data center dc_X[i]
4: ifdc_cur(X[i]) > V_X[i]
5: return *this particle is infeasible*
6: end if
7: end for
8: for j = 1 to \|T\|//Determine whether the data center is overloaded during task execution
9: Place the task t_j in the data center dc_j with the least fuzzy data transmission time
10: if dc_cur(j) + sum(I_j) + sum(O_j) >
11: return this particle is infeasible
12: end if
13: Place the output dataset O_j of t_j into the corresponding data center
14: Update the storage capacity of the data center
15: end for
16: for j = 1 to \|T\|//Calculate the fuzzy transmission time of the corresponding data layout
17: Find the data centers ds_i in the placement of task t_j’s input dataset I_j
18: Calculate the fuzzy data transmission time generated by the input dataset I_j to ds_j
19:
20: end for
21: Output and the corresponding data placement strategy
End procedure