Abstract

A new technique utilizing Scalar Quantization is designed in this paper in order to be used for Digital Image Watermarking (DIW). Efficiency of the technique is measured in terms of distortions of the original image and robustness under different kinds of attacks, with particular focus on Additive White Gaussian Noise (AWGN) and Gain Attack (GA). The proposed technique performance is affirmed by comparing with state-of-the-art methods including Quantization Index Modulation (QIM), Distortion Compensated QIM (DC-QIM), and Rational Dither Modulation (RDM). Considerable improvements demonstrated by the method are due to a new form of distribution of quantized samples and a procedure that recovers a watermark after GA. In contrast to other known quantization methods, the detailed method here stipulates asymmetric distribution of quantized samples. This creates a distinctive feature and is expressed numerically by one of the proposed criteria. In addition, several realizations of quantization are considered and explained using a concept of Initial Data Loss (IDL) which helps to reduce watermarking distortions. The procedure for GA recovery exploits one of the two criteria of asymmetry. The accomplishments of the procedure are due to its simplicity, computational lightness, and sufficient precision of estimation of unknown gain factor.

1. Introduction

In modern communications, multimedia plays significant role. Ownership of multimedia data is important and needs to be protected [1]. As a part of nowadays popular multimedia content, digital images are an important class. A protection of digital rights of an owner is implemented by Digital Image Watermarking (DIW). A watermark that is inserted into an image has to be robust [2] as well as invisible [3].

Among the popular and efficient techniques in DIW, Quantization Index Modulation (QIM) is widely used in blind watermarking where neither original media nor watermark is known to the receiver [4, 5]. One of the aspects of robustness of QIM is evaluated by attacking a watermarked image with Additive White Gaussian Noise (AWGN). Unfortunately, all the known on practice implementations of QIM are far from achieving the channel capacity limit that was first derived in [6].

Several different QIM-related approaches are known. Some state-of-the-art realizations will be outlined briefly. According to QIM, intervals of equal length are mapped on the real number line. The oldest known approach is to replace all the original coefficients inside every interval with one of the two endpoints of that interval. The selection of the endpoint depends on a bit of a watermark [7]. The main disadvantage is that for high intensity of noise and the capacity of the oldest QIM is much lower than the theoretical limit. In a more advanced realization of DC-QIM, coefficients from every original interval are mapped into two disjoint subintervals. The gap between the subintervals is controlled by parameter , [8]. Assuming that initial distribution inside original interval and target distributions in subintervals are uniform, the mapping in accordance to DC-QIM is optimal in terms of Mean Square Error (MSE) of quantization. In order to maximize capacity for a given MSE under AWGN of different intensity, parameters , have to be adjusted. Nevertheless, the limit defined in [6] is still well above the one achievable by DC-QIM.

Not all the original coefficients in each interval need to be quantized. This idea has been explored by the authors of Forbidden Zone Data Hiding (FZDH) [9]. Another idea was proposed by the authors of Thresholded Constellation Modulation (TCM) that uses two different quantization rules to modify coefficients inside the original interval [10].

Despite sufficient robustness of QIM under AWGN, the limitation is that synchronization is required in order to reconstruct intervals that are necessary to extract (or decode) a watermark. A type of distortion which scales all the watermarked coefficients is called Gain Attack (GA). The scaling factor might be close to 1 and cause very little visual distortion, but it is unknown to the receiver which causes asynchronous extraction. Retrieval of the watermark is usually complicated by AWGN that follows GA [11].

Improvement of QIM performance under GA is the task of numerous known approaches [12]. Most of them can be classified into two groups where the main idea of the first group is to estimate the unknown factor [13] while the idea of the second is to quantize coefficients of a different kind that are invariant to scaling of original signal.

The solution proposed in [11] contributes to robustness enhancement in case of GA and a constant offset attack followed by AWGN. A pilot signal is embedded for this purpose. Fourier analysis is used during extraction to estimate the gain factor and the offset. Another method of recovery after GA and AWGN is proposed in [14]. It uses information about dither sequence and applies Maximum Likelihood (ML) procedure to estimate the scaling factor.

Watermarking that is invariant to GA demands more complex transform of original signal (e.g., nonlinear) to obtain coefficients. One of the most popular watermarking methods in that category is Rational Dither Modulation (RDM) [15]. For a particular coefficient, a ratio that depends on a norm of other coefficients is being quantized instead of a coefficient itself. In order to quantize the ratio, RDM utilizes the simplest QIM scheme. This implies that the performance of RDM under AWGN (without GA) is close to the simplest QIM. Among others recent blind watermarking methods robust to GA are, for example, detailed in [1618].

A new scalar QIM-based watermarking method is proposed in this paper. It provides high robustness under conditions of AWGN and GA. Among the new features of the method are IDL and a new form of distribution of quantized samples.

The organization of the rest of the paper is as follows. Section 2 explains the choice of the distribution of quantized samples and contains description of the procedure of recovery after GA. Concept of IDL and quantization model are described in Section 3 using formal logic approach. The aspects of analytic-based estimation of robustness under AWGN are discussed in Section 4. Next, Section 5 contains experimental results obtained under AWGN and GA. Discussion of the details of the experiment and comparison of the performance are given in Section 6. Section 7 concludes the paper. The list of the key variables and their meaning is given in Nomenclature section.

2. Distribution of Quantized Samples and Procedure for Recovery after GA

An asymmetric distribution of quantized samples is proposed and parametrized in this section. Asymmetry is the quality that can be easily expressed quantitatively. Under symmetric attack, like AWGN, such quantitative index remains sufficiently indicative. On the other hand, it can be affected by GA. Such semifragility is favorable for restoration of the right condition for decoding. The restoration is done by the procedure for recovery after GA which uses criterion of asymmetry. Compared to the known estimation procedures [14], the one proposed in this section depends on a single variable which is the unknown gain factor. This makes the technique simple and more precise.

For encoding, in our case, asymmetric distribution requires substantially more variables for description compared to common QIM methods. Because of that, it is advisable to refer to Nomenclature section.

2.1. Distribution of Quantized Samples

Symbol will be used to denote a random variable whose domain is the space of original coefficients of a host. A particular realization of will be denoted as . We will further consider manipulation of original values that are in some th interval of size and its left endpoint is . Such an interval is referred further as embedding interval. For any we define and will be used to denote a random variable which represents . The value of should be small enough so that the distribution of can be considered uniform. A random variable that represents quantized coefficients inside th interval is denoted as and its realization is denoted as . Each pair of an original and corresponding quantized   belongs to the same th embedding interval so that an absolute shift is never larger than . Correspondingly, a random variable that represents quantized coefficients on the whole real number line is denoted as and its realization is denoted as .

In order to provide efficient recovery after GA, we propose the following asymmetric distribution of quantized samples inside th embedding interval (Figure 1(a)):where and are two different kinds of truncated distributions defined asThe other parameters are constrained in the following way: ,   (see Nomenclature section). The meaning of parameters , , , will be discussed later in Section 3. In Figure 1(b) we can see the distribution of the quantized coefficients outside th embedding interval as well.

2.2. Procedure for GA Recovery

It is assumed that under GA the original length of embedding interval is altered by unknown gain factor and the resulting length is . In addition to that, AWGN attack is applied. The procedure for GA recovery is the estimator whose result is based on a criterion having higher values for the right length of embedding interval. The uniqueness of the distribution of quantized samples is exploited by two different criteria and . The procedure itself represents a brute force approach that substitutes guessed values of the length of embedding interval into a criterion. Guessed value of which maximizes it ( or ) should be selected:where is the final output of the procedure. Some interval for guessed values should be defined in advance. For instance, and works well in most cases because the diapason of scaling factor is quite limited on practice.

For each particular value , the index defined according to the criterion is calculated by projecting noisy quantized samples on a single embedding interval:This is needed to reconstruct the distribution of quantized samples inside embedding interval.

Two criteria are proposed for the assessment of the distribution of random variable (subscript “” means affected by noise):

Here, is the th central moment. Odd moments are zero for symmetric distributions, but for asymmetric distributions their values can be sufficiently large. If the assumption about is wrong, then the values of both criteria are low. In that case the distribution of is very close to uniform (which is symmetric). This is because of the effect caused by GA on calculation of in (5). Nevertheless, the distribution of demonstrates asymmetry in case is close to . The explanation is that the distribution of quantized samples inside embedding interval (before GA is introduced) is indeed asymmetric. In spite of utilization of brute force optimization, the procedure is simple and the computational demand is low. On practice, the number of brute force steps is much smaller than the number of quantized elements. Therefore, the complexity is in that case. For instance, for recovery with high accuracy it is enough to perform 103 brute force steps with values from the interval .

3. Quantization

A quantization model is introduced in this section. In order to represent it in a compact form, we combine all the quantization conditions in a single logical expression. Previously proposed distribution of quantized samples is assured. However, additional parameter of the quantization model implies different distribution of the samples associated with labels “0” and “1.”

3.1. Two Approaches for Quantization

Quantized samples are modified according to the model described in this subsection. A watermark bit is denoted as . Each sample with value inside th embedding interval has index according to its order in the host sequence. During watermarking a bit is assigned to each index . Different frameworks might be used for description of the quantization model. We will use first order predicate logic to describe our approach. This choice can be reasoned as follows. A closed-from expression has to be defined for quantization and it is important to show that the derived solution minimizes MSE between initial and target distribution. The kind of proposed target distribution is not common for QIM-based watermarking methods. Therefore, we find it necessary to explain in detail the process of derivation of quantization expression. Also, samples interpreting “0” should be quantized in a different way to samples interpreting “1.” Predicate logic is a suitable tool for description of embedding because logical construction can incorporate all the possible quantization conditions in a compact form.

Two-place predicate is to denote correspondence between some index and the value of coefficient. For example, is true if a coefficient with order has value . We will further use notation of the set which contains all the pairs that provide true value of . One-place predicate is to denote bit value assigned to a coefficient with particular index. For instance, is true if watermark bit is assigned to a coefficient with index and ~ is true if . Two-place predicates or will be used to define that some th sample with value has label “0” or “1,” respectively:

Sets and contain all the pairs that provide true values of and , respectively. Initial PDFs of inside , , and are considered to be uniform: (Figure 2).

Also, each coefficient is labeled either as IDL or non-IDL depending on its value and index . Samples labeled as IDL are quantized in a different way which reduces the total embedding distortion. Both types of coefficients (IDL and non-IDL) are being modified during quantization. However, after quantization, interpretation of a bit of each IDL coefficient is incorrect. The purpose of quantization is to provide that all the non-IDL samples can be extracted correctly and the resulting distribution of all the samples is the one depicted in Figure 1(a). Parameters and represent fractions of IDL for and , respectively. Parameters and represent fractions of non-IDL samples for and , respectively. The fraction of zeros in a watermark data is and fraction of ones is . It is required that .

We define IDL and non-IDL samples using two-place predicates , , , and in the following way (Figure 2):where , , and .

Sets , , , and will be used in order to specify all the coefficients that satisfy , , , and , respectively. Fractions , , , and can be expressed in terms of cardinalities of sets , , , , and . For example, .

In this paper, two different quantization techniques are proposed. Since predicate logic is used to describe watermark embedding, a suitable logical construction should be able to distinguish between the techniques. According to our model, each kind of quantization can be represented by setting a corresponding logical value (“0” or “1”) for zero-place predicate . Hence, is used to define one out of two possible quantization techniques. For each kind of quantization, is split on two subsets and . For two-place predicates and formulas and are defined in the following way:

Using information about distribution inside , , , and it is easy to derive distribution inside and . Let us introduce variable of natural numbers domain (not a logical variable) which satisfies . Common arithmetical operations can be performed with which makes it possible to express PDF in the following compact form:where .

Therefore can be expressed as (Figures 3 and 4)

Elements of sets and are modified during quantization so that new sets and are obtained, respectively. Therefore, for successful quantization, we require the following formula to be true:

As a result of quantization, variables and are modified in a way that the resulting and are distributed according to some desired distributions. For each kind of quantization (depending on the value of ), the pair of desired distributions is different. We propose the following distributions that can be expressed as (Figures 3 and 4)It can be seen that, for any logical value of , the distribution of inside is the same and matches the distribution represented in Figure 1. It means that the efficiency of the procedure of GA recovery (proposed in the previous section) cannot be affected by the selection of .

In addition to the necessity of providing desired distribution of the quantized samples, we need to minimize quantization distortions. Both requirements can be expressed by two two-place predicates and :

The idea of minimization of embedding distortions can be explained in the following example. Assuming two samples , , we infer that quantization in a way in which implies less distortion than in case when . Let us sort elements in and in the dimension of and , respectively. Then, for some (index is an order in a host sequence) the number of elements in with value less than should be equal to the number of elements in that have value less than . Integration should be used in case we switch from discrete distribution of samples in and to continuous one. Further, throughout the paper we assume that the constant of integration is zero for indefinite integrals. Hence, the truth values for both predicates and are defined asFurther, we introduce logical formula and state that argumentis valid. The task of watermark embedding is to assure that the mentioned argument is sound. For that purpose, a procedure that makes true should be proposed.

3.2. Quantization Equations

Quantization equations and their solutions are needed to satisfy formula during embedding. For this purpose, we will analyze conditions that enforce qualities of predicates and . Due to the large number of variables in the text we recommend to refer to Nomenclature section for clarity. We can rewrite elements of (17) in the following way:From (21) it is clear thatThe equation above means that the following is true:We introduce two two-place predicates and :According to (21) and (24) the following can be derived:whereNow, let us analyze conditions that enforce quality of predicate . Elements of (18) can be represented asWe can see that according to (25)This means that the following expression is true:Next, two two-place predicates and are defined asAccording to (27) and (30) the following can be derived:whereWe can express using and in the following way:Also, we can express using and :Further, utilizing property we can obtainHere, each premise should be true. With the aim to provide this, equations in (25) and (31) should be solvable. It can be seen that the solutions are straightforward:where, for example, in (36), denotes the values of that turn either or true for or , respectively. The diagram of quantization is represented in Figure 5. Each th original sample is chosen from array on th iteration. The corresponding bit of a watermark is chosen from array . Vector contains parameters of the quantization. At the end of each iteration, quantized value of th sample is written to array .

4. Robustness under AWGN

In this section, we will analytically estimate the robustness of the proposed watermarking scheme under AWGN. Robustness is reflected by the term “extracted information” which denotes mutual information between embedded and detected messages. In contrast to channel capacity, the index of extracted information is practical but depends on the algorithm of detection. Also, throughout this section we assume that the original samples are distributed uniformly inside the quantization interval.

The derivations for extracted information are less involved when is “false.” Therefore, only that condition is considered here. In order to estimate extracted information we first find error rates. The rates depend on the attack severity (represented by ), , and parameter set . Moreover, we derive a stronger statement that information about and is sufficient to perform analytic estimation of error rates for our watermarking scheme. Finally, we will demonstrate how error rates can be expressed using and .

4.1. Estimation of Error Rates

For our estimation, it is considered that, during watermark extraction, in each embedding interval samples that interpret “0” are separated from samples that interpret “1” using a threshold (e.g., hard decision region detector). The position of the threshold in th embedding interval is (dashed vertical lines in Figure 1(b)). Therefore, the whole real number line can be seen as a union of two domains:During extraction, all the elements in will be labeled “0” and all the elements in will be labeled “1.”

After noise is added, elements quantized in th embedding interval might spread over its limits and other notations should be used. We notate sample values that are affected by noise as . Also, belongs to some embedding interval and inside this interval we use . Random variables and represent and , respectively (alternatively we use and to save space in lower subscript part). Therefore, two modified sets are obtained: ; . For noise variance we might, for instance, estimate the expected fraction for each of the noisy sets and in . Fractions of and that belong to can be found in a trivial manner. In that way we obtain error rates for “0” and “1.”

However, instead of appealing directly to sets and , we use an indirect but computationally lighter approach. In case is “false” we can conclude for the following distributions of quantized samples (not affected by AWGN yet) thatwhereAlso, we can conclude that the following distributions are also identical:whereFor any , (40) means that, for example, the fraction of elements from that after AWGN appear in is equal to that of and can be calculated using . This fraction will be denoted as . The PDF of AWGN with variance is denoted as using parameters and . ThereforeFraction of elements from that after AWGN appear in will be denoted as :Error rates are calculated using and :In order to demonstrate that error rates can be calculated based on , we analyze expression for (expression for can be analyzed in a similar way). Function is present in (44). According to (2) it is defined using parameters , . Parameters , are also present in (2) as well as in (44). Parameters , have clear constraints (the same is true about , , , ). It is possible to express , using , , , , , . In the realization of our method parameter is set asDefining new parameter asit can be seen that does not depend on the choice of .

Using property of PDF, the following is obtained from (2):It is easy to derive from (48) and (49) thatAccording to (50), it is also obvious that parameteris independent of .

One of the properties of PDF of AWGN isTherefore, we can rewrite (44) in the following manner:Now, it can be demonstrated that domainis independent of if during extraction parametercan be set without information about (e.g., may be set as ). Hence, (53) can be represented in the following way:Here, for integration we consider and , where both domains and depend only on . Except the terms and , only , appear under the integrals. Therefore, the result of integration, , depends only on , .

Further we will express in terms of and which confirms that and can be defined using only and .

4.2. Estimation of

Measure is widely used in watermarking. It expresses relation between watermark and noise energies and in AWGN case iswhere is the energy of the watermark. Plot of robustness index in respect to is one of the characteristics that are the most meaningful for practical implementation [4, 11]. Therefore it is important to be able to express error rates using and the set of embedding parameters . For this purpose, we first express using and .

Parameter in (57) can be seen as a distortion of a host signal, caused by the quantization. There are many different approaches that adequately assess quality degradation for digital images [19, 20]. Nevertheless, in this paper we are using simple and well-known distortion measure based on MSE between original and quantized samples [21]. We will define and factor it in a form , where depends only on . It is assumed that original samples are distributed uniformly inside embedding interval. Distortion is needed to obtain quantized sets , , , and . Therefore, further we will consider as a sum of four kinds of distortion: . Each of the distortion components is defined as follows:

It can be demonstrated that each of the distortion components can be factored using . For instance, considering the next result can be obtained:whereHereThe rest of the distortion components can also be factored in a similar way, where

Factorization in the form can be done based on . Therefore, according to (57) can be expressed in the following way:

5. Experimental Results

In this section, two different settings are considered for experiments. AWGN attack is investigated assuming the first kind of settings and GA attack is investigated assuming the second kind of settings. For the first type, the obtained results are compared with the results of QIM and DC-QIM. For the second type, the performance is compared with the results of RDM (DC-QIM was not considered here as it is vulnerable to GA). Here, in each type of experiment, the goal is to estimate the highest possible amount of extracted information of the method for a given intensity of attack. We explore optimization of embedding parameters. During watermark embedding, parameters , and define sets and . In addition to the mentioned parameters, , , and are needed to define and . Extracted information is maximized over , , , , , , , and by brute force approach. With the aim to reduce computations, parameters , , , and are constrained according to our considerations.

5.1. Information Extracted under AWGN

During the experiment, parameters , and were constrained as , . It can be explained by our intention to use a detector based on median thresholding inside embedding interval. Therefore, if one detects the watermark message right after embedding, the fraction for “0” and “1” will be and , respectively. One of the advantages of median thresholding is that no additional information is needed for detection even though the distribution of quantized samples is asymmetric and controlled by many parameters. The parameter for median-based (hard decision) detector is calculated using two steps: (a) for all find according to (5); (b) calculate .

For other methods that were used for comparisons, the standard hard decision detector was used (with equal length of decision intervals for “0” and “1”).

For the proposed watermarking method occurs only if the condition holds. In that case, different values of cause different robustness characteristics. However, IDL might not be suitable for some application in Digital Watermarking. For instance, in semifragile watermarking a fraction of lost data can be interpreted as the presence of an attack (which increases false positive rate). Therefore, condition has been investigated first.

In Figure 6, amount of extracted information toward WNR is plotted for the proposed method, DC-QIM and QIM [8]. During watermark extraction, the value of the normalized threshold was set to .

Error rates were calculated according to (46). However, only the integers from were used as a set for in (38) instead of the whole set . The purpose of the limitation is to reduce computational complexity while still maintaining high fidelity of the result. Then, the maximized amount of extracted information was calculated according toHere, denotes joint probability of embedding symbol and extracting symbol ; and denote probabilities of symbol to be embedded and extracted, respectively. Using joint probabilities, we calculate probabilities of extracting a particular bit:Joint probabilities can be expressed using and error rates:As it was mentioned earlier, embedding probabilities areFrom Figure 6 it can be seen that with no IDL the proposed method performs better than QIM for  dB and better than DC-QIM for WNR values less than −2 dB.

While comparing the designed method with DC-QIM, explanation of a slightly better performance should be found in the new distribution of quantized samples and thresholding technique. Optimal solution for the problem of informed data hiding has been theorized by Costa [6], and every known practical answer, including DC-QIM, uses structured codebook which lacks certain desirable characteristics [11]. We believe that the minor advantage of the presented method is due to larger number of variables that were adjusted for embedding.

In case IDL is acceptable (for a particular watermarking application), much better results are achievable for both “true” and “false” under low WNRs (Figure 7). Obviously, the demonstrated superiority is due to IDL only. Additionally, it can be seen that “false” logical value of provides slightly more beneficial outcome under AWGN compared to when is “true.” As a reference, Costa Theoretical Limit (CTL) [6] is plotted in Figure 7:

5.2. Information Extracted under GA

In this subsection, we explore performance of the proposed quantization approach under GA. For comparison, RDM is chosen instead of DC-QIM as it is known to be vulnerable to GA. We describe conditions and the results of the simulations based on real images and assuming watermark embedding followed by GA.

For the experimental evaluation, we used 92 natural grayscale images with resolution 512 × 512. Each image was split on 4 × 4 blocks and first singular values of Singular Value Decomposition (SVD) were quantized to embed a watermark [22]. The watermarking was arranged without IDL and . Document to Watermark Ratio (DWR) was set to 28 dB, whereand is the variance of the original coefficients. An example of original and watermarked diagnostic images is given in Figure 8.

For some healthcare organizations, protection of personal data is a high priority task. On the other hand, diagnostic data might need to be shared between experts from other organizations. For that purpose, DIW is a suitable tool [23]. However, an important additional constraint is imposed in that case: an expert conclusion (diagnosis) should not be affected by watermarking. According to the judgements of collaborating group of medical imaging experts, the diagnostic statement for the watermarked image in Figure 8(b) is identical to the statement for the original one in Figure 8(a).

For watermark embedding (encoding), a brute force optimization over and was repeated for each new value of . Obviously, this needs to be done only once as the optimal parameters can be stored. In addition to the concept of optimal parameters, we investigated efficiency of a constrained version of the proposed quantization approach, where and were constant and equal to and , respectively. The common sense behind such a modification is that actual of AWGN might not be known on practice during watermark embedding (because the attack happens after embedding).

For watermark extraction (hard decision decoding), these two steps are required: (1) apply GA recovery procedure; (2) define threshold. In accordance with the proposed procedure for GA recovery, criterion was used for the estimation of actual during the experiment. The condition of GA was simulated by ignoring information about value that was used for embedding. Hence, the value was estimated by the procedure of GA recovery. No information except initial guess interval with , was used for watermark extraction. In contrast to that, RDM does use the information about the exact value of quantization step. For RDM, the value of a given quantized coefficient was calculated using the information about the previous 100 coefficients.

Two types of thresholding are possible and two types of extraction conditions exist. Under condition when and are constants, no additional information needs to be transferred to the decoder. However, if and are optimized on encoder’s side, information about them might need to be sent. This is necessary if thresholding is established in the way that (e.g., the threshold is in the middle of separating zone in quantization interval). Since the requirement for additional information seems impractical, we proposed median thresholding as well.

For each method that took part in the experiment, the resulting amount of extracted information is plotted toward AWGN variance (Figure 9).

As it can be seen from Figure 9, under both mentioned conditions of embedding, the proposed approach outperforms RDM. The advantage is more evident for larger AWGN variance.

6. Discussion

In the experimental section, robustness of the proposed quantization method was estimated under AWGN and GA. The proposed approach provides higher amount of extracted information compared to the other state-of-the-art reference methods, like DC-QIM and RDM. The reasons of its superiority will be discussed in more detail in this section.

Asymmetric distribution of quantized samples and the proposed procedure for GA recovery is a successful combination that provides robustness under GA. Compared to other estimations of the scaling factor from the literature, the proposed estimation approach is light. For instance, in order to estimate the scaling factor, a model of a host is used in [14] which complicates estimation and reduces its precision. On the other hand, an approach different to estimation is exploited by RDM [15]. However, Distortion Compensation is not present in RDM. In contrast to that, the proposed quantization method has Distortion Compensation and outperforms RDM because AWGN is introduced (as a second stage of GA).

The proposed quantization approach demonstrates higher robustness under AWGN compared to well-known DC-QIM. The advancement that causes such superiority is IDL. Parameter was introduced in order to distinguish between two different ways of realization for IDL. Target distribution for all the samples in quantization interval remains the same for any logical value of (which guarantees equally successful recovery from GA). Distributions are different if “ones” and “zeros” are considered separately. This influences the resulting performance. As it has been demonstrated by the experiment, modification of the quantization approach with “false” performs slightly better. In general, usage of IDL is beneficial under low WNRs. The common sense here is that predicting the loss of some information (as a result of an attack) we might accept the scenario when a part of information is lost initially. Compared to DC-QIM, such quantization behavior enables redistribution of embedding distortion from samples that are likely to be misinterpreted to the other (non-IDL) samples that can be more robust.

Unlike DC-QIM, the proposed quantization method has many parameters that need to be set up for watermark embedding. Some additional parameters might be needed depending on the technique for watermark extraction. For instance, the thresholding that depends on may be applied, which requires parameters , to be communicated to the receiver.

On the other hand, no parameters are needed for extraction if the proposed median thresholding is used (absolutely blind extraction). Parameter can be estimated using procedure for GA recovery taking as input only rough interval . This is an advantage compared to DC-QIM that always requires to be known to the decoder.

We do not consider any case with malicious attacks (that analyze and deliberately change the watermarked signal) as they are not the objectives of our paper. However in case a key is used to protect a watermark, it will also be needed for decoding.

Lastly, we need to emphasize that the computational cost of our scheme is low. Optimization of embedding parameters conducted in Section 5 is computationally heavy, but it needs to be done only once. The optimized parameters can be used for embedding then. The computational cost of embedding is comparable with that of DC-QIM (please, refer to the quantization diagram). For extraction, complexity of the proposed procedure of GA recovery is .

7. Conclusions

A new scalar QIM-based watermarking method has been proposed in this paper. It provides higher robustness under AWGN and GA compared to other quantization methods. The benefits of the method are due to the introduced procedure of recovery after GA as well as new distribution of quantized samples with IDL.

For the new distribution of quantized samples there is no symmetry inside embedding interval. The nonsymmetric distribution of quantized samples is exploited by the introduced procedure of recovery after GA. Two different criteria are proposed to be used within the procedure. During experiment it has been confirmed that the procedure is computationally light and efficient.

In addition to the new kind of distribution of quantized samples, the proposed QIM-based method benefits from IDL. Utilization of IDL can reduce embedding distortions introduced to a host signal. This is done by letting some watermark bits to be interpreted incorrectly during embedding and before any attack occurs. A model that describes quantization process assumes that IDL can be implemented in two different ways depending on the logical value of parameter . The proposed realization of IDL is beneficial for any under highly intensive AWGN attack. However, “false” value of provides slightly higher robustness compared to “true.”

Considerable performance improvements are due to the abovementioned advancements. The amount of information extracted (using hard decision decoder) under AWGN is at the same or of a higher level compared to DC-QIM. Usage of IDL is the most advantageous under AWGN for WNRs close to −12 dB, where it performs up to 104 times better than DC-QIM. Under GA, the performance of the proposed method is up to 103 times higher than that of RDM. Finally, visual quality degradation caused by the proposed quantization method was also estimated in a subjective way by a group of medical imaging experts. It was confirmed that as a result of watermarking, important diagnostic characteristics did not change.

Nomenclature

:Unique integer index for each particular sample
:Random variable for the domain of original samples
:Particular realization of
:The length of embedding interval
:The left endpoint of th embedding interval
:Random variable for the domain of original samples inside embedding interval
:Particular realization of
:Random variable for the domain of quantized samples inside embedding interval
:Particular realization of
:Random variable for the domain of quantized samples
:Particular realization of
, :Truncated distributions for quantized samples inside embedding interval
, ,, , :Parameters of ,
, , , :Fractions of samples that are labeled as non-, , non-, and , respectively
:The length of embedding interval that is required for watermark extraction after GA
:Uniformly sampled guessed values of
:Best-fit value from according to the estimator
(or ):Random variable for the domain of quantized samples affected by attack/noise
:Particular realization of
(or ):Random variable for the domain of noisy quantized samples inside embedding interval
:Particular realization of
:Parameter of quantization model
:The set of
, :Two disjoint subsets of defined by and , respectively
:The set of
, :Two disjoint subsets of
:Random variable from
:Random variable from
:Random variable from
:Random variable from
:PDF of
:PDF of
:PDF of
:PDF of
:The threshold used by the detector
:The normalized threshold used by the detector, for example,
:Decision region “0” for the detector
:Decision region “1” for the detector
:The set of all the elements of influenced by an attack
:The set of all the elements of influenced by an attack
, :Two disjoint subsets of , defined by and , respectively
, :Two disjoint subsets of , defined by and , respectively
, :Two disjoint subsets of , defined by and , respectively
, :Two disjoint subsets of , defined by and , respectively
:Embedding distortion necessary to transform to
:Embedding distortion necessary to transform to
:Embedding distortion necessary to transform to
:Embedding distortion necessary to transform to
:Bit Error Rate for
:Bit Error Rate for
:Maximized mutual information between embedded and detected messages.

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

Acknowledgments

The paper was supported by University of Tampere. Additional gratitude for provided diagnostic images and postwatermarking quality assessment is to Dr. Oleksandr Iatsyna and other members of collaborating oncological expert group from National Cancer Institute, Kiev, Ukraine.