Research Article

Policy Decomposition for Evaluation Performance Improvement of PDP

Algorithm 1

Greedy algorithm of policy decomposition.
Input: Cost, k
Output: k subsets
(1) Cost′ = Heap_Sort(Cost)
(2) for the first k elements do
(3) distributed respectively to one of k subsets
(4) end for
(5) A min-heap minHeap with the first k elements is built
(6) for the remainder elements of Cost′  do
(7)    distributed successively to one of k elements in minHeap,
  whose sum value is always kept the smallest
(8)    minHeap is adjusted
(9) end for