The Scientific World Journal

Volume 2014 (2014), Article ID 895063, 12 pages

http://dx.doi.org/10.1155/2014/895063

## A New Pixels Flipping Method for Huge Watermarking Capacity of the Invoice Font Image

^{1}Institute of Graphics and Image, Hangzhou Dianzi University, Hangzhou, Zhejiang 310018, China^{2}Institute of Digital Media, Hangzhou Dianzi University, Hangzhou, Zhejiang 310018, China^{3}University of Yamanashi, Yamanashi-ken 409-3866, Japan^{4}Department of Information Engineering and Computer Science, Feng Chia University, Taichung 40724, Taiwan^{5}Department of Computer Science and Information Engineering, Asia University, Taichung 41354, Taiwan

Received 22 July 2014; Accepted 2 September 2014; Published 12 November 2014

Academic Editor: Fei Yu

Copyright © 2014 Li Li et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

#### Abstract

Invoice printing just has two-color printing, so invoice font image can be seen as binary image. To embed watermarks into invoice image, the pixels need to be flipped. The more huge the watermark is, the more the pixels need to be flipped. We proposed a new pixels flipping method in invoice image for huge watermarking capacity. The pixels flipping method includes one novel interpolation method for binary image, one flippable pixels evaluation mechanism, and one denoising method based on gravity center and chaos degree. The proposed interpolation method ensures that the invoice image keeps features well after scaling. The flippable pixels evaluation mechanism ensures that the pixels keep better connectivity and smoothness and the pattern has highest structural similarity after flipping. The proposed denoising method makes invoice font image smoother and fiter for human vision. Experiments show that the proposed flipping method not only keeps the invoice font structure well but also improves watermarking capacity.

#### 1. Introduction

With the development of multimedia technology, more attention is drawn to the certification of digital media content in the aspects of security, confidentiality, authenticity, and integrity, which, as a result, promotes the development of information hiding and digital watermark technology.

Nowadays, a large number of documents such as financial, insurance, and patents have been transformed into digital documents and stored as binary image documents. To guarantee the certification appears on the invoice font image, binary image watermarking technique is applied. According to human visual system model, few changes to the pixels on a color image or a grey image would not be perceived by human eyes because of the large pixel value range and high redundancy [1–5]. However, binary images only have two colors (black and white) which means little redundancy. When changing the black (white) pixels of a binary image into white (black) ones, human eyes can catch the change easily. Therefore, it is more difficult to embed watermark into binary images than into color images and grey images.

At present, there are some watermarking algorithms on binary image [6–11]. These algorithms can be classified into two categories. One category is based on adjusting the structure information of image content. Tan et al. [12] proposed a binary image watermarking algorithm based on changing the direction of the stroke, but it is for Chinese character image. The other category is to flip the pixels of binary image, that is to embed watermark information by changing the black (white) pixels into white (black) ones. Wu and Liu [8] proposed a scoring standard to judge the “flappable” pixels and embedded watermark by flipping the pixels of the binary image. Zhao and Koch [13] proposed a method to embed watermark by changing the rate of numbers of the black pixels and white pixels in one block. Qi et al. [10] embedded watermark according to the multiplicative transformation model to modify black pixels numbers of character picture.

Most binary image watermarking algorithms based on flipping pixels would use the scoring standard proposed by Wu and Liu [8]. But Wu’s method has some weakness because the flippable pixels are obtained by the connectivity and smoothness of pixels. Firstly, the structure information around pixels is not considered; secondly, when a huge number of pixels need to be flipped, Wu’s method may not be able to meet the flipping quantity requirement; lastly, with the numbers of flipped pixels increasing, the flipped binary image would have some noises just like some burrs using Wu’s method. This results in bad visual quality of the flipped binary image.

To solve those problems, we proposed a novel algorithm to flip pixels for the binary font image. It includes one novel scaling interpolation for binary image, one novel pixels flippable evaluation mechanism, and one novel denoising method based on gravity center and chaos degree. The scaling interpolation method ensures that the invoice font image keeps features well after scaling. The pixels flippable evaluation mechanism ensures that the pixels keep better connectivity and smoothness and keeps the pattern highest structural similarity after flipping. The proposed denoising method makes the flipped invoice font image smoother and fitter.

#### 2. The Proposed Pixels Flipping Method for the Invoice Binary Image

The proposed pixels flipping method includes a few steps as shown in Figure 1. Step 1, by the small proportion of the scaling for the invoice font image to reduce the flipping number. The font binary image is obtained by small proportion scaling interpolation of the original font binary image based on the flipping number. Step 2 is to calculate the pixels flipping number , of and , respectively, and choose the image as the final flipping object which has less flipping number. Step 3 is to choose and flip the flippable pixels based on proposed flippable evaluation mechanism. Step 4 is to denoise the watermarked invoice font binary image based on gravity center and chaos degree to get a better image quality.

The remainder of this section contains the following. In Section 2.1, the proposed scaling interpolation method for binary image is given. In Section 2.2, the proposed pixels flippable evaluation mechanism is given. In Section 2.3, the proposed pixels flipping strategy is given. In Section 2.4, the denoising method based on gravity center and chaos degree is proposed.

##### 2.1. The Proposed Scaling Interpolation for Binary Image

The huge watermarking capacity of invoice binary image needs to flip large number of pixels. According to human visual system (HVS), few changes of pixels on an image cannot be perceived by human eyes. The invoice font image after scaling interpolation will add or reduce some black pixels, which effectively reduce the number of pixels to be flipped.

The invoice font image after scaling keeps the original font structure features well, such as Figures 2(a) and 2(b). After scaling, Figure 2(b) keeps the whole font structure well except generating some noises which look like sawtooth though and the change is not sensitive. Therefore, before flipping the pixels, the font image needs scaling interpolation according to the ratio between numbers of pixels to be flipped and numbers of the font image’s total black pixels.

Traditional interpolation methods include nearest-neighbor interpolation, bilinear interpolation, bicubic interpolation, and spline interpolation. The interpolation methods mentioned above can achieve well effect to image scaling when the image is colored or grey. A new pixel value between 0 and 255 can be obtained using interpolation methods mentioned above. But a binary image only has two color values 0 and 255, and any other values between 0 and 255 are not allowed. In addition, the sawtooth effect caused by traditional interpolation methods will lead to bad effect to the flappable pixels which will be address later. To resolve these two weaknesses, we proposed a new interpolation algorithm for binary images.

Here we used the way similar to nearest-neighbor interpolation and bilinear interpolation. The new pixel is obtained by interpolating the 4 adjacent pixels of the original pixel. According to the 4 adjacent pixels, the new pixel to be interpolated can be placed in five different locations, as follows.(1)Four pixels around are all white. See Figure 3(a).(2)Three pixels are white among the four pixels around. See Figure 3(b).(3)Two pixels are white among the four pixels around. See Figures 3(c) and 3(d).(4)Only one pixel is white among the four pixels around. See Figure 3(e).(5)Four pixels around are all black. See Figure 3(f).

We applied different strategies to acquire the pixel value to be interpolated aiming for the five cases above.

In case , pixel has four white pixels around it, so we think in this case should be placed in white flat area and its value must be white. Otherwise, the binary image may show some noises.

In case , pixel is not in a flat area but the number of white pixels is larger than that of black ones, so pixel must be white. If is black, will become a noise with the diagonal being white.

In case , when the four pixels are distributed like in Figure 3(c), we think that the font image’s stroke has the direction of , and is the edge of the font from black to white. If we set to black pixel, the stroke direction of the original font image will be destroyed, so pixel must be white. When the four pixels are distributed like in Figure 3(d), we are not able to judge the direction of the stroke, so we find a pixel which is nearest to pixel from pixels , , , and , and then set pixel ’s value to its nearest one’s value.

In case , there is only one white pixel around pixel , as shown in Figure 3(e). Pixel is not in a flat area but in the edge of the stroke from black to white. In this case, we think that the stroke of font image has the direction of diagonal , and whether ’s value is black or white must be determined by the position of —whether it is inside or outside of triangle , so we just need to judge that is in bottom right or top left of line . This can be determined by computing the distance of to and to . The Euclidean distance of to and to is defined as and separately. is defined as the pixel value to be interpolated. Their relations are as follows:

Case is similar to case , and pixel is set to black.

The interpolation effect acquired from the method above can reduce sawtooth effect efficiently, and it is specially fit for scaling interpolation of font image. Only nearest-neighbor interpolation among traditional interpolation methods can be applied to binary image. So we compare our method against the nearest-neighbor interpolation method and simultaneously magnify the image by two sizes. We only showed a partial area of the font image because the image resolution is too high after magnifying. The comparison result shows that our method can reduce sawtooth effect efficiently, as shown in Figure 4.

##### 2.2. The Proposed Pixels Flippable Evaluation Mechanism

Wu’s method chooses the flippable pixels only according to the connectivity and smoothness of pixels without the structure information. When a large number of pixels need to be flipped, it may not be able to choose enough flipping quantity. To solve those problems, we proposed a novel flippable evaluation mechanism to determine the pixels to be flipped and then only select those flippable pixels to be flipped. The flippable evaluation mechanism includes the score of connectivity and smoothness and the pattern substitution based on SSIM, and the pixels which have higher score are flippable but to those pixels which have lower score we search the pattern which has the maximum SSIM to substitute the original pattern and the pattern is flippable.

###### 2.2.1. Wu’s Method on the Score of Pixels

Binary images only have two values, white and black which are denoted by 0 and 1, respectively, so the visual quality can be reduced greatly even if one value is changed. The binary image watermarking method is generally based on the modified pixels of the boundaries. Wu’s method gives the score of the flippable pixels based on the smoothness and connectivity of the pattern window. The method determines the scores dynamically by observing the smoothness and connectivity. The smoothness is measured by the horizontal, vertical, and diagonal transitions in a local window , and the connectivity is measured by the number of the black and white clusters. For example, Figure 5(a) has better visual quality when flipping the center pixel than Figure 5(b) using Wu’s method.

The higher score pattern indicates higher priority to flip the center pixel because of keeping good smoothness and connectivity after flipping. Using Wu’s method, we flip those flippable pixels whose scores are greater than or equal to 0.3 to ensure the flipped image quality. We summarized the ratio of different score pixels in four most-commonly used invoice font binary images as shown in Figure 7. The invoice is shown in Figure 6.

It is concluded that the sum of ratio of 0.625, 0.375, and 0.25 in the four font binary images are 9.09%, 6.71%, 8.44%, and 8.03%, respectively. The ratio of flappable pixels in binary image is less than 10%. Therefore, when there are more than 10% pixels to be flipped, we cannot find enough flappable pixels using Wu’s method. Moreover, Wu’s method does not consider the structural of window pattern when flipping. So, in this paper, we adapted the window pattern substitution based on SSIM.

###### 2.2.2. The Proposed Pattern Substitution Based on SSIM

Wang et al. [14] introduced an alternative complementary framework for quality assessment based on the degradation of structural information. Based on that, a measure of structural similarity (SSIM) that compares local patterns of pixel intensities that have been normalized for luminance and contrast.

The SSIM can be described as and are two images. The is the structural similarity of and . is the luminance comparison function between and , is the contrast comparison function, is the structure comparison function, and is the combination function. The three comparison functions are defined as follows: and are the mean intensity of and , respectively, is the dynamic range of the pixel value, and and are the two parameters which are far less than 1. and are the standard deviation of and , correlation coefficient between and .

Finally, combine the three comparisons of (3), (4), and (5) and obtain the similarity measure SSIM index between signals and :

The value of SSIM is between 0 and 1; the larger the value is, the higher the similarity of the two signals is.

The SSIM not only is the method to evaluate two images but also gives us the idea of how to flip pixels. In this paper, we define the local window in binary image pattern. As shown in Figure 8, the pattern has 4 black pixels; if we add black pixels to , we can calculate the SSIM of all the patterns where its black pixels count greater than 4 and choose pattern which has the maximum SSIM value to substitute pattern .

##### 2.3. The Proposed Pixels Flipping Method for Huge Watermarking Capacity

Assuming the original font binary image is , the resulting image is obtained by flipping pixels of . The quantity of flipping pixels is determined by watermarks. The novel pixels flipping method for huge watermarking capacity is as follows.

*Step **1*. Scale the image using our proposed interpolation method and determine the original font binary image or the scaled font binary image to be flipped.

The black pixels count of is ; the black pixels count of is . The scaling ratio based on and is estimated using the following formula: is scaled by using our proposed interpolation method according to the ratio , the scaled font binary image is denoted by , and the black pixels count of is . The distance between and is calculated. The distance between and is calculated, respectively:

*Step **2*. Select the smaller to be flipped by comparing and :
is the one to be flipped. The resulting flipping quantity is obtained by the following formula:

*Step **3*. Flip pixels based on the pixels scoring of smoothness and connectivity on . Based on the threshold , sort all the pixels point by the scoring and flip the pixels which score greater than . If pixels point’s score is greater than , then there are pixels to be flipped.

When there are a lot of pixels that need to be flipped, those sorted pixels whose scores are less than or equal to are of sequential pattern substitution until achieving the flipping quantity.

##### 2.4. The Proposed Denoising Method Based on Gravity Center and Chaos Degree

The flipped invoice font image which is generated by pixels flipping and pattern replaced by SSIM may produce burr or hole noises. In order to further reduce noises, we proposed an effective denoising method of pixels moving based on the gravity center and chaos degree of pattern.

The gravity center of an image is the center of the weight of the image. The weight of the binary image is the count of its black pixels; therefore, the black pixels distribute around the center of the gravity. The black pixels gather around the center of gravity, so the nearer the center of gravity of area is not allowed to have the appearance of white pixels. But before that, there should be one mechanism to judge whether the pixel in binary font image is noisy or not. So, we brought up the concept of chaos degree in pattern.

Assuming that the black pixels value in a binary image is 0 and white pixels value is 1, the chaos degree is the sum of difference between the pixels and its adjacent pixels of all the pixels in pattern where its size is . The greater the chaos degree is, the more disorderly the pattern is. In this section, the patterns are noise patterns if their chaos degree is equal to or greater than 5. The definition of the chaos degree is as follows: is the pattern where its size is in binary image and is the chaos degree. In Figure 9, the two examples of calculation of chaos degree are given and the pattern in Figure 9(a) has good quality and will not bring about noise because its chaos degree is 4, but the chaos degree of the pattern in Figure 9(b) is 12; it is very messy in visual and this noise pattern must bring about burr or hole noise in font binary image.

Based on the proposed chaos degree, we can denoise the font binary image as follows.

*Step **1*. Calculate the chaos degree and find out the noise pattern , the coordinate center of as the center outward expansion to form a wider range pattern ; for example, the size of can be , , or ;

*Step **2*. Calculate the gravity center of , assuming the coordinate of gravity center is ; the equation is as follow:

*Step **3*. Calculate the distance of gravity center for each pixels point in pattern :

Exchange the positions of the two points which have maximum distance black pixel and minimum distance white pixels to gravity center, respectively. By moving the black pixel which has the maximum distance to gravity center of to the position of white pixels which have the minimum distance, it ensures that the nearer the center of gravity of area is not allowed to have the appearance of white pixels. The process is repeated until the chaos degree of pattern falls below 5 or it cannot fall below 5 after having been repeated three times.

This denoise method can make the strokes of font binary image smoother and also fitter. The burr and hole noise in binary image can be effective reduced. Figure 10 gives an example of denoise in one pattern; we can see that the pattern with chaos degree 12 can be reduced to 4 in three steps and the resulting pattern has a good quality and is fitter for the stroke of font binary image.

In Figure 11, it is concluded that the proposed denoising method based on gravity center and chaos degree can effectively reduce the burr and hole noise and make the font binary image smoother and fitter.

#### 3. Experimental Results and Comparisons

A lot of experiments are conducted and the results show that the proposed algorithm is effective. In this section, we give the experiments results and the comparisons. Our comparisons focus on the image quality and pixels flipping capacity with Wu’s method.

Wu’s flipping method is an effective flipping method in binary image, but there are still some weaknesses.(1)It only considers the smoothness and connectivity but does not take into account the structure information around the flipping pixels.(2)When there are many pixels to be flipped, some burr or hole noises may occur.(3)When there are many pixels that need to be flipped, it may not be able to meet the flipping quantity requirement.

According to Wu’s method, we flipped the font binary image in Figure 12(a) with a different pixel flipping quantity.

It proved that the boundaries of the font stroke have a lot of burr when flipping count is more than 900 which results in greatly reduced visual quality. The proposed denoising method based on gravity center and chaos degree can reduce these noises. Denoise the font binary images after score as shown in Figure 13.

When flipping quantity is more than 1200, the number of flappable pixels is less than 1200. So it cannot work when flipping more than 1200 pixels unless it repeats flipping. The flipping method proposed can increase the capacity of flipping count and the pattern substitution based on SSIM ensures the structural similarity of font binary image. The proposed pixels flipping method is tested in Figure 14.

In Figure 14, the quality of flipping is twice as large as good as Wu’s. The font binary images have better visualization compared to Wu’s at same flipping quality, such as Figure 12(g) compared to 14(c) and 12(i) compared to 14(d).

The proposed flipping method can dynamically determine the object which is pixels flipping is original font binary image or the scaled font binary image which is using our interpolation algorithm. For example, when flipping 800 pixels the scaled font binary image was chosen as the flipping object. This method can increase the capacity of flipping efficiently as shown in Figure 15.

We can clearly see that Figure 15(c) has better visual quality than Figure 15(d) and is fitter for human vision.

#### 4. Conclusion

We proposed a new pixels flipping method for huge watermarking capacity of invoice font image, including the proposed scaling interpolation of the binary image, pixels flippable evaluation mechanism, and the proposed denoising method based on gravity center and chaos degree. The proposed scaling interpolation method ensures that the font image keeps features well after scaling. The pixels flippable evaluation mechanism ensures that the pixels after flipping keep better connectivity and smoothness and the pattern after substitution has the highest structural similarity. The proposed denoising method makes the font binary image smoother and fitter for human vision. The experiment shows that our proposed flipping method not only keeps the font structure feature but also has more watermark embedding quantity.

#### Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

#### Acknowledgments

This work was partially supported by the National Key Technology Research and Development Program of the Ministry of Science and Technology of China (no. 2012BAH91F03) and National Natural Science Foundation of China (no. 61370218).

#### References

- I. J. Cox and M. L. Miller, “The first 50 years of electronic watermarking,”
*Eurasip Journal on Applied Signal Processing*, no. 2, pp. 126–132, 2002. View at Google Scholar · View at Scopus - F. A. P. Petitcolas, R. J. Anderson, and M. G. Kuhn, “Information hiding—a survey,”
*Proceedings of the IEEE*, vol. 87, no. 7, pp. 1062–1078, 1999. View at Publisher · View at Google Scholar · View at Scopus - F. Hartung and M. Kutter, “Multimedia watermarking techniques,”
*Proceedings of the IEEE*, vol. 87, no. 7, pp. 1079–1107, 1999. View at Publisher · View at Google Scholar · View at Scopus - M. M. Yeung and F. C. Mintzer, “Invisible watermarking for image verification,”
*Journal of Electronic Imaging*, vol. 7, no. 3, pp. 578–591, 1998. View at Publisher · View at Google Scholar · View at Scopus - I. J. Cox, J. Kilian, F. T. Leighton, and T. Shamoon, “Secure spread spectrum watermarking for multimedia,”
*IEEE Transactions on Image Processing*, vol. 6, no. 12, pp. 1673–1687, 1997. View at Publisher · View at Google Scholar · View at Scopus - K. Matsui and K. Tanaka, “Video-steganography: how to secretly embed a signature in a picture,” in
*IMA Intellectual Property Project Proceedings*, pp. 187–205, 1994. View at Google Scholar - M. S. Fu and O. C. Au, “Data hiding by smart pair toggling for halftone images,” in
*Proceedings of the IEEE Interntional Conference on Acoustics, Speech, and Signal Processing (ICASSP '00)*, pp. 2318–2321, IEEE, June 2000. View at Scopus - M. Wu and B. Liu, “Data hiding in binary image for authentication and annotation,”
*IEEE Transactions on Multimedia*, vol. 6, no. 4, pp. 528–538, 2004. View at Publisher · View at Google Scholar · View at Scopus - N. F. Maxemchuk and S. H. Low, “Marking text documents,” in
*Proceedings of the International Conference on Image Processing (ICIP '97)*, vol. 3, p. 13, Marking Text Documents. in. ICIP (3, Santa Barbara, Calif, USA, October 1997. View at Publisher · View at Google Scholar - W.-F. Qi, X.-L. Li, B. Yang, and D.-F. Cheng, “Document watermarking scheme for information tracking,”
*Journal on Communications*, vol. 29, no. 10, pp. 183–190, 2008. View at Google Scholar · View at Scopus - H. Cao and A. C. Kot, “EAG: Edge adaptive grid data hiding for binary image authentication,” in
*Proceedings of the 4th Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC '12)*, pp. 3–6, December 2012. View at Scopus - L. Tan, X. Sun, and G. Sun, “Print-scan resilient text image watermarking based on stroke direction modulation for Chinese document authentication,”
*Radioengineering*, vol. 21, no. 1, pp. 170–181, 2012. View at Google Scholar · View at Scopus - J. Zhao and E. Koch, “Embedding robust labels into images for copyright protection,” in
*Proceedings of the International Congress on Intellectual Property Rights for Specialized Information, Knowledge and New Technologies*, pp. 242–251, Citeseer, 1995. - Z. Wang, A. C. Bovik, H. R. Sheikh, and E. P. Simoncelli, “Image quality assessment: from error visibility to structural similarity,”
*IEEE Transactions on Image Processing*, vol. 13, no. 4, pp. 600–612, 2004. View at Publisher · View at Google Scholar · View at Scopus