Research Article

Information-Balance-Aware Approximated Summarization of Data Provenance

Algorithm 1

Information-Balance-Aware Approximated Provenance Summarization Algorithm (IB-PROX).
Require: (original provenance), (annotations in ), (combiner
function) and (VAL-FUNC function), the weight for distance,
size, definition and weight of IBI score, selection size , size bound
TSIZE, distance bound TDIST
Returns: Summarized expression
(1) Initialize as
(2) While   > TSIZE or ) < TDIST Do
(3) candidateSet =
(4) For every   Do
(5)
(6) Add to candidate set
(7) End For
(8) selectedSet = from candidateSet with top percent size-
distance performance
(9) For every   in  selectedSet Do
(10) If   is optimal Then
(11)
(12)
(13) End if
(14) End For
(15) End While
(17) If   > TDIST Then
(17) return
(18) End If
(19) return