Journal of Healthcare Engineering

Research Article

A Lightweight API-Based Approach for Building Flexible Clinical NLP Systems

Pseudocode of the API integration algorithm.

	Input: X = [X₁, X₂, …, X_n]: returns of n APIs;
	W = [ω₁, ω₂, …, ω_n]: weights of n APIs;
	γ: similarity threshold;
	θ: extractor threshold;
	Output: T: a list of clinical terms
	Initialisation: ω_α = 0.25 and ω_β = 0.75
	Filter out same/similar terms extracted by one API
(1)	for i = 1 to n do
(2)	for x_a in X_ido
(3)	Get the rest of terms: X_j = X_i − X_a
(4)	for x_b in X_jdo
(5)	calculate the percentage of equal terms over all 10 terms: α
(6)	calculate the percentage of equal terms over top 3 terms: β
(7)	calculate the pairwise similarity:
(8)	if δ ≥ γ then
(9)	discard same/similar term: X_i = X_i − X_b
(10)	end if
(11)	end for
(12)	end for
(13)	end for
(14)	Get filtered arrays of terms: X_δ = [X_1δ, X_2δ, …, X_nδ]
Filter out extracted terms by the weights over all APIs
(15)	Compute weights over all APIs: X_ω =
(16)	for ω_sum, x in X_ωdo
(17)	if ω_sum ≥ θ then
(18)	Add the term the final list: T+ = [x]
(19)	end if
(20)	end for
(21)	return T