Abstract

Smart mobile devices are playing a more and more important role in our daily life. Cancelable biometrics is a promising mechanism to provide authentication to mobile devices and protect biometric templates by applying a noninvertible transformation to raw biometric data. However, the negative effect of nonlinear distortion will usually degrade the matching performance significantly, which is a nontrivial factor when designing a cancelable template. Moreover, the attacks via record multiplicity (ARM) present a threat to the existing cancelable biometrics, which is still a challenging open issue. To address these problems, in this paper, we propose a new cancelable fingerprint template which can not only mitigate the negative effect of nonlinear distortion by combining multiple feature sets, but also defeat the ARM attack through a proposed feature decorrelation algorithm. Our work is a new contribution to the design of cancelable biometrics with a concrete method against the ARM attack. Experimental results on public databases and security analysis show the validity of the proposed cancelable template.

1. Introduction

Nowadays, mobile devices, e.g., smartphone, have become one of our daily necessities. They are used to store personal data and handle private communication. Unfortunately, there are some privacy and security issues along with the use of smartphones; for example, a user’s private data, e.g., photo, contacts, and bank details, can be compromised, if his/her smartphone without any protection is stolen or lost. Secret knowledge-based approaches, e.g., password or PIN, are commonly used for authentication. However, these methods rely heavily on the user to ensure continued validity, and poor use of password or PIN may lead to great security breaches [1].

Fingerprint recognition has been extensively used in various applications, e.g., authentication on mobile devices. With good recognition accuracy and high convenience, fingerprint-based authentication systems hold more markets than other biometrics, e.g., face, iris, and voice. Fingerprint authentication is usually composed of two stages, the enrollment stage and the verification stage [2]. In the enrollment stage, feature data extracted from a captured fingerprint image are considered as a template, which is stored in a database. In the verification stage, the query feature data, extracted in the same way as template feature data, are compared with the stored template, and a match or nonmatch verdict will be made.

The use of fingerprint authentication systems eliminates the need of remembering long passwords or PIN, which is the disadvantages of traditional password- or PIN-based authentication schemes. However, fingerprint-based biometric systems have one main drawback; that is, once a fingerprint is compromised, it will be lost forever. To address this issue, cancelable fingerprint templates [3], among other biometric template protection schemes, have been proposed. Instead of storing raw fingerprint data as a template, it is distorted through a one-way transformation function in the enrollment stage. Such a transformation is intentional and repeatable. One important property of cancelable fingerprint templates is noninvertibility, which means that it should be computationally hard to recover the raw fingerprint data from the transformed fingerprint template [4]. In the verification stage, the same transformation is applied to the query data. Matching between the transformed template and query is conducted in the transformed domain. In this way, if the stored template is compromised, a new version of it can be generated by changing the transformation parameter(s) [5].

1.1. Related Work
1.1.1. Mobile Biometrics

A lot research effort has been devoted to the design of more precise, usable, and secure biometric authentication schemes on mobile devices. For instance, in [16], Clarke and Furnell introduced a method for authenticating users by getting them to input telephone numbers or write text messages. This method is called biometric keystroke analysis. In [17], Kim and Hong proposed to use teeth together with voice to authenticate users, which is the first research work using the teeth and voice combination. The matching scores of each individual trait are calculated and fused using a weighted-summation operation. The experiments are conducted by using a dataset that contains one thousand teeth images and voices collected by smartphones. Later they proposed an enhanced multimodal authentication system [18], which adds another biometric trait, face, on top of teeth and voice to achieve better results.

In [19], Lee et al. designed a mobile multimodal biometric system based on finger-vein and fingerprint. The proposed system can obtain fingerprint and finger-vein images simultaneously and also is able to overcome some limitations of unimodal biometric systems, e.g., lack of accuracy. In [20], Tao and Veldhuis developed a face-based biometric authentication system on mobile devices, which contained detailed information about the process, including face detection, registration, illumination normalization, verification, and information fusion.

In [21], Chen et al. proposed a fingerprint-based remote authentication method using mobile devices. In their method, both fingerprint and password are involved to improve the security level of the system. Moreover, hashing functions are used to implement mutual authentication. In [22], a palmprint-based recognition system is proposed for mobile devices. Specifically, a hand-shaped guide window is introduced for fast image acquisition and an enhanced competitive code is used to cope with image variation.

In [23], Rattani et al. investigated gender prediction from ocular images acquired by smartphones so as to enhance the accuracy of the integrated biometric authentication and mobile healthcare system. In [24], Marsico et al. compared the performance of several participant methods in the Mobile Iris Challenge Evaluation-1 contest. Furthermore, some analysis is given to image covariate and interoperability.

Several survey papers, e.g., [25, 26], discussed the biometric authentication methods on mobile devices, including current development, trends, and challenges. The significance of template protection on mobile devices is described in [26]. However, we notice that most of above-mentioned biometric authentication methods do not protect the biometric template on mobile devices, which potentially put important personal information at risk. With good recognition accuracy and high convenience offered by fingerprint, in this paper, we propose a fingerprint-based authentication system using the cancelable technique to provide template protection on mobile devices.

1.1.2. Cancelable Biometrics

The concept of cancelable biometrics was initiated by Ratha et al. in [27]. Later, they constructed a practical cancelable fingerprint authentication system [4] by using three different transformation functions, namely, Cartesian transformation, polar transformation, and functional transformation. The transformation functions are able to distort the fingerprint minutiae feature into a new data format. This method is registration-based and hence relies on precise detection of the reference points, e.g., singular, core, or delta points. However, fingerprint uncertainty caused by displacement, nonlinear distortion, and rotation during the process of fingerprint capturing is unavoidable, thus making accurate registration hard to achieve. A matching error can possibly be caused by a registration error [28].

To relinquish the process of global registration and also reduce the impact of nonlinear distortion [29, 30], registration-free local structure based methods have been proposed; see, e.g., [511, 13, 14, 3140]. In [5], Yang et al. proposed a cancelable template design based on geometric transformation. Each local Delaunay triangle-based structure instead of each single minutia acts as a unit to be transformed under the guidance of two transformation matrices. In [31], Farooq et al. presented a cancelable fingerprint template based on a set of triangles derived from any three minutiae. The features extracted from the set of triangles are further converted into the binary format. Then this binary string was randomly permuted into a different feature representation under the conduct of a user specific key. The cancelability of the feature representation can be achieved by applying different keys. Lee et al. [32] proposed an approach to calculating a rotation- and translation-invariant value from the orientation information of neighboring local regions around each minutia. The invariant value is then utilized as the input of two transformation functions to generate transformed features. The cancelable template is governed by these two transformation functions. In [33], in order to avoid global alignment, the authors used localized matching, which consists of matching minutia triplets constructed by each minutia and its two nearest neighbors. Invariant features extracted from these triplets are varied and secured by the symmetric hash functions. A major drawback of this approach is the assumption that the genuine query sample minutiae, being described as locations in a complex plane, are linearly transformed from the template minutia set. However, the most challenging issue for fingerprint minutia-based matching is the nonlinear elastic distortion of minutia locations. Therefore, this assumption is not realistic. Ahn et al. [6] applied geometrical properties, e.g., local relation, from minutiae triplets to hiding the minutiae information. Generation of these geometrical features is conducted via a transformation function in an attempt to keep the discriminating capability.

In [7], Yang et al. introduced the geometrically aligned and protected minutia vicinity for template protection. The proposed method transforms the original minutia vicinity by adding some parameter guided offsets into each minutiae group so as to destroy the original local topological relationship among those minutiae. The original minutia vicinity is defined by a minutia together with its closest neighboring minutiae. In [34], minutia pairs applying redundant combinations of two minutiae points are formed to counter some image noise. A bit-string cancelable template is derived from the minutia pairs. Similarly to [31], the same user specific tokens are used to guarantee that the bit-string features are permuted in the same manner during both enrollment and verification stages. A different local structure represented by a 3D array was proposed in [35]. In this structure, each minutia is chosen as the reference point and other minutiae are rotated and translated based on the orientation and position of the reference point so as to map the minutiae into the 3D array. Each minutia after transformation falls into a specific cell of the 3D array and each cell is marked as 1 if more than one minutia locates in it; otherwise it is marked as 0. A resultant bit-string is then permuted by using a user specific PIN. Yang et al. [8] proposed to use a dynamic random projection method to protect the biometric features extracted from local structures composed of a reference minutia and three closest minutiae around it. Although this method can dynamically choose a projection matrix from a set of candidate projection matrices, it is a primitive direct biometric key generation technique, which has poor error tolerance for low quality fingerprint images.

Ahmad et al. [9] built a pair-polar coordinate-based alignment-free structure. The pair-polar structure uses the relative position of each minutia to other minutiae in a polar coordinate space. Three local features are extracted from any two minutiae and then a functional transformation is applied to these local features to achieve the resultant cancelable template. In [36], a circular region is constructed around each minutia and the circular region can be divided into different levels according to different radiuses. Then the circular regions are encrypted by two transformation functions and stored as the cancelable template. In [37], Jin et al. generated a revocable fingerprint template via a polar grid-based method. For each reference minutia, the polar transform is performed first to align the remaining minutiae. After that, a 3-tuple quantization technique is utilized to generate the local feature in the form of a bit-string. The user specific token based permutation technique is also utilized for feature transformation. Das et al. [38] proposed to use the minimum distance graphs, which involve a set of interminutia minimum distance vectors starting from the core point, as rotation- and translation-invariant features to conduct hash transformation so as to protect the original template.

In contrast to the pair-polar based structure proposed in [9], Wang and Hu [1012, 41] mainly considered the noninvertible transformation functions. In [10], a densely infinite-to-one mapping method is presented to accomplish the transformation; in [11], a curtailed circular convolution is used to achieve noninvertible transformation; in [41], the identifiability condition in blind channel estimation is deliberately violated to protect the source input—the binary string’s frequency samples; and, in [12], the binary biometric representations are securely protected by the partial Hadamard transformation, which transforms them into complex vectors. In [39], Wong et al. designed a multiline code for generating the cancelable fingerprint template. The multiline code is a string-based minutia descriptor extracted from a set of minutiae surrounding a virtual line within a specified range. Then a user specific secret key guided permutation is performed to achieve feature transformation. In [13], Jin et al. developed a two-dimensional random projection technique to secure the minutiae-based fingerprint template generated from minutia vicinities. Each minutia vicinity is formed by a minutia and three nearest minutiae around it. The feature matrix generated from a set of minutia vicinities is transformed/mapped onto a random subspace determined by an external orthogonal random matrix, which is generated by a user specific token. In [14], Jin et al. adopted the same local structure as [13] but incorporated a different noninvertible transformation method named randomized graph-based hamming embedding (RGHE) to protect the original features. In [40], Zhang et al. presented two methods, which are a designed combo plate and a functional transformation, to produce cancelable templates based on the MCC code, which associates a local structure to each minutia [42]. Note that the method named P-MCC in [42] is not a cancelable template as it does not provide the property of revocability. Subsequently, a partial permutation based scheme named 2P-MCC [15] was proposed to add revocability to P-MCC. However, 2P-MCC suffers from the ARM. P-MCC is not revocable, which means that the feature vector that contains binary values in P-MCC remains unchanged in different applications. The 2P-MCC scheme uses a user specific key as the index and chooses () binary values from those binary values of to create a new feature vector . When unlinkability is required, at least one element of the binary values from any two templates, e.g., , should be different. By combining templates from at most (k-c+1) applications, the feature vector can be restored. Once feature vector is obtained by the adversary and due to being the same in different applications, the adversary can verify the correctness of the inverted feature generated from template and the user specific key s by comparing it with .

In [43], Kaur and Khanna presented a cancelable biometric method, named random slop, which can reduce the feature dimensions by up to 75%. This method has been tested on many biometric databases, such as face, palmprint, palmvein, and fingervein, except fingerprint, although it can be implemented to fingerprint features. In order to further enhance recognition accuracy and security of cancelable unibiometric systems, in [44], Paul et al. developed a cancelable biometric template creation algorithm using random biometric feature fusion, random projection, and selection based on face and ear. In [45], Yang et al. proposed a fingerprint and finger-vein based cancelable multibiometric system, in which a feature-level fusion strategy with three fusion options are designed. In the meantime, an enhanced partial discrete Fourier transform based noninvertible transformation is applied.

1.2. Motivation and Contributions

From the above-mentioned cancelable fingerprint template systems, it is observed that the rotation- and transformation-invariant local region based structures are employed and the main difference between these local structures is the region size. For example, small-sized local structures are used in [68, 13, 14, 33]. In these systems, the local structures are only composed of one reference minutia and two or three of its neighboring minutiae. Small-sized structures can resist nonlinear distortion to some extent. However, since these local structures only contain a moderate number of minutiae, the feature data extracted from them are not discriminative enough. By contrast, relatively large local structures are employed in [911, 31, 34, 35, 3740]. Large-sized structures consist of more minutiae and tend to include relatively more information that can be exploited for matching. However, the positional change of minutiae for large-sized structures is likely to occur under nonlinear distortion, especially when minutiae are located far away from the reference point. Because of nonlinear distortion, for example, minutiae , , and in the template image and their corresponding minutiae , , and in the query image are possibly located in different cells in a polar coordinate system, as shown in Figure 1. To strike a balance and achieve good matching performance, some authors [32, 36] proposed to use parameters to control the region size. However, testing different parameter settings in a practical authentication process would create additional computational load and resource burden or even be infeasible.

Another issue that troubles the existing cancelable template systems is the security concern. The methods in [5, 31, 34, 35, 37, 39] use permutation matrices for feature transformation. Since the permutation functions are invertible [10], original fingerprint features are not secured safely if the permutation matrices are compromised. Although the methods in [7, 915, 40, 41] that use the noninvertible many-to-one mapping strategy can avoid the above issue, they suffer from the attacks via record multiplicity (ARM) [46, 47], if multiple transformed templates and their corresponding transformation parameters are acquired by an adversary. Readers can refer to [46, 47] for more details about the ARM.

To address the above issues, in this paper, we propose a new cancelable fingerprint template system. Not only can the proposed system reduce the impact of nonlinear distortion on those minutiae that are located far away from the reference point, but it also can defend the system against the ARM attack. In particular, the new cancelable fingerprint template system processes both local and global structures and fuses two different schemes at the score level, so that the overall system achieves better recognition accuracy than a single scheme only. The overall processing flow of the proposed system is shown in Figure 2. Specifically, in the enrollment stage, two feature sets, the polar coordinate-based feature set (T1) and the Delaunay triangulation-based feature set (T2), are extracted from the polar coordinate-based local structure and Delaunay triangulation-based global structure, respectively. Next, feature set T1 is processed by the proposed feature decorrelation algorithm and then varied by a random projection transformation function. Feature set T2 is permutated under the guidance of the feature codes. Both transformed feature sets, T1 and T2, are stored in the database. In the verification stage, the same transformations used in the enrollment stage are applied to the query feature sets, Q1 and Q2. Matching between the transformed template and query feature sets is conducted in the transformed domain.

The main contributions of this paper are highlighted as follows.

(1) Two schemes, polar coordinate-based scheme and Delaunay triangulation-based scheme, are utilized and fused on the score level. Specifically, the polar coordinate-based scheme uses the feature set from the polar coordinate space and acts as our basic scheme, which can provide reasonable feature discrimination. The Delaunay triangulation-based scheme that uses the feature set from Delaunay triangulation serves as a complement to the basic scheme. It can effectively reduce the negative impact of nonlinear distortion on the minutiae that are located far away from the reference minutia. As a result, compared with the system that only uses the polar coordinate-based scheme, higher recognition accuracy is achieved.

(2) A main reason that cancelable biometrics suffer the ARM attack is due to feature correlation which exists among cancelable templates stored across multiple applications but actually derived from the same biometric features. To address the issue, we propose a feature decorrelation algorithm so that the feature vectors, which are generated from the same feature set, are uncorrelated in different applications. Without feature correlation, the adversary would not have adequate information to determine the original feature data.

The rest of the paper is organized as follows. In Section 2, two feature sets, polar coordinate-based feature set and Delaunay triangle-based feature set, are introduced. The proposed cancelable fingerprint authentication system, which can defeat the ARM attack, is presented in Section 3. In Section 4, experimental results and security analysis are demonstrated and discussed. The conclusion is given in Section 5.

2. Generation of Two Feature Sets

In this section, we introduce two feature sets that are extracted from the polar coordinate-based and Delaunay triangulation-based structures, respectively. These two structures are derived from the same minutiae set of a fingerprint image. Each polar coordinate-based structure is composed of a reference minutia and other minutiae in a predefined range. The use of the polar coordinate system allows the relationships between the reference minutia and other minutiae to be readily defined and measured. The Delaunay triangulation-based structure formed by a set of minutiae is a triangulation where no minutia in that set of minutiae is inside the circumcircle of any triangle in the triangulation. Specifically, given a set of minutiae , where is the number of minutiae, each minutia can be represented by a vector , where and are the , coordinates in the Cartesian coordinate system, is the orientation in the range of , and is the minutia type. For each fingerprint image, the following two feature sets are generated from the above two structures.

2.1. Polar Coordinate-Based Feature Set

In the polar coordinate-based structure, if the minutia, e.g., , is considered as the origin of the polar coordinate and the remaining minutiae in range of (=300 pixels) are rotated and translated with respect to such that the orientation of equals 0 degree in the polar coordinate system, then any minutia can be converted and expressed as a triplet , where is the radial distance, is the radial angle, and is the relative orientation of minutia to . An example of the polar space centered around minutia is shown in Figure 1. In order to tolerate small distortion, polar grid-based quantization [35] is performed on all the minutiae in the range of . We assume that the step sizes of , , and are , , and , respectively ( and ). Then the polar space centered around can be quantized into a 3D cube containing cells, where , , and . The cell where the minutia is located in the 3D cube is , where , , and . By this means, we obtain a vector of length containing only ‘0’s and ‘1’s, in which ‘1’ means the appearance of one or more minutiae in the corresponding cell. varies under different parameter settings of , , and . It is obvious that when these parameter settings change, the value of changes accordingly. In our application, different parameter settings are chosen for different databases in order to achieve best performance. Within the value range of these parameter settings, the largest value of that can be obtained is 34560. Therefore, we use a fixed value of for all the chosen databases and pad ‘0’ to those feature vectors with elements less than 34560. By applying the above approach to every minutia in the set of minutiae , a polar coordinate-based feature set is generated.

2.2. Delaunay Triangulation-Based Feature Set

As a complement to the polar coordinate-based feature set, we propose the second feature set, which is extracted from the Delaunay triangulation-based structure and takes advantage of the desirable features [4850] of Delaunay triangulation. In the presence of nonlinear distortion, Delaunay triangulation has a stable local neighborhood structure. Minutiae will keep the same neighboring structure if the nonlinear distortion does not move minutiae out of the tolerance region. Also, noise in fingerprint images influences the Delaunay triangulation only locally. Spurious or missing minutiae affect only those local Delaunay structures which contain them. A brief description about the generation of a Delaunay triangulation is given below; readers can refer to [51] for more details.

For the set of minutiae , a Voronoi tessellation, which divides the whole fingerprint image region into several smaller regions centering on each minutia, is created first as shown in Figure 3(a). All the points in the region around are closer to than to any other minutia. The Delaunay triangulation is generated by connecting the centers of every neighboring region as shown in Figure 3(b).

Assume that there are Delaunay triangles generated from minutiae; several invariant features can be defined from each Delaunay triangle. Taking triangle as an example, four rotation- and translation-invariant features are defined as follows:(i) is the orientation differences between and .(ii) is the length of edge .(iii) is the angle between edge and .(iv) is the concatenation of minutia type of , , and .

, , and . A feature data set, e.g., , can be extracted from each Delaunay triangle. To tolerate the variation caused by nonlinear distortion that is inherent in fingerprint images, quantization is applied to each feature set. The quantization step sizes are set to be , , and for , , and , respectively (, ). Each element of , after quantization, can be expressed as , , , and . If , , , and are represented by , , , and bits, respectively, then the triangle can be represented by a bit-string by putting them in sequence together. The integer value of this bit-string falls in the range of , where . With Delaunay triangles constructed from a given fingerprint image, each of them should match a value in and so the corresponding bin is indexed by 1, as shown in [34]. As a result, the Delaunay triangulation-based feature set can be represented by a binary vector of length .

We now explain why we add the second feature set. In the Delaunay triangulation-based feature set, each Delaunay triangle rather than each minutia is treated as a feature unit because a Delaunay triangle is more robust against nonlinear distortion than each individual minutia. For example, as mentioned in Section 1, minutiae , , and in the template image may not seem to match their corresponding minutiae , , and in the query image in a polar coordinate system since they fall in different cells due to nonlinear distortion, as shown in Figure 1. However, if we consider the triangle constituted by minutiae , , and as a unit, no matter where is moved, features such as the edge length and angle, extracted from it, still remain invariant. Thus, using these features, and can match each other.

3. Design of the Cancelable Fingerprint Authentication System

In this section, we present the two general stages included in the proposed cancelable fingerprint authentication system.

3.1. Enrollment Stage

In the enrollment stage, the original feature sets are transformed. The detail of the enrollment stage is explained below.

Given a template image (letter means template), the untransformed feature sets and as introduced in Section 2 can be extracted from the polar coordinate-based and Delaunay triangulation-based structures as shown in Figure 4. However, feature sets and are vulnerable. Because if the adversary acquires the vector , he/she would be able to know the minutiae’s location in the 3D cube. Subsequently the feature triplet of each minutia can be recovered. Similarly, it is not difficult to reveal the original minutiae information from . Hence, it is crucial to protect the feature sets and . To achieve this, the polar coordinate-based scheme and the Delaunay triangulation-based scheme are utilized to protect each element of and , respectively.

3.1.1. The Polar Coordinate-Based Scheme

In the polar coordinate-based scheme, the feature set is processed by a feature decorrelation algorithm and then varied by a random projection matrix generated under the participation of a user specific key . The loss of key means the loss of the projection matrix . The feature decorrelation algorithm can eliminate feature correlation in different applications, thus making it robust against the ARM attack.

For an element, e.g., , of the feature set , where is the number of minutiae in the template image , before performing random projection, it is first processed by the feature decorrelation algorithm as shown in Algorithm 1. This feature decorrelation algorithm is useful because only includes values of 1 and 0 and has a sparse distribution, which might restrict the search space if the random projection is directly implemented on it. Furthermore, if is applied in multiple applications, its feature correlation can be utilized by the adversary to launch the ARM attack. For the above reasons, Algorithm 1 is proposed to eliminate feature correlation in different applications.

INPUT: the original feature
Process:
Step 1: Each position of value 1 that is located in
the binary string , is input into a folding
function , where is an
application-specific parameter and set to be smaller
than . is a modulo operation, for example,
mod(7,5) = 2 and mod(12,5) = 2.
Step 2: With each value of 1 in relocated to a
new position depending on , a new binary string
of length is generated.
Step 3:   is converted into a real-value vector
through the Discrete Fourier Transform (DFT) as
.
OUTPUT: decorrelated feature

It follows from Steps 1 and 2 in Algorithm 1 that the new binary string , generated from the original feature , can be varied by simply adjusting the application-specific parameter in different applications. Note that even if one bit in is changed, the output vector will be totally different due to the nature of the Discrete Fourier Transform (DFT) in Step 3. Therefore, feature correlation does not exist in the real-value vector used in different applications.

The modulo operation in Step 1 is a many-to-one mapping and controlled by the parameter . Different values may lead to varying matching performance, which is discussed in Case 3 of Section 4.1. Under the assumption that the random projection based transformation is conquered, the modulo operation might not be strong enough to safeguard from the compromised if is set to be a large value. For example, when and is set to be 20000, for values of ‘1’ that appear in from positions 1 to 14560, e.g., 123, its original position in only has three possible locations, either 123 or 20123 or both. To increase security, we further propose Algorithm 2 as an enhancement of Algorithm 1.

INPUT: the original feature
Process:
Step 1:   is divided into two parts, and ,
where contains the first elements of ,
while contains the remaining elements.
Step 2: is evenly divided into segments, so
and each segment is of length .
Step 3: For each segment , calculate ,
where . is a weighted sum of
position and its binary value for segment . Then
all the elements in are circularly shifted left by
bits to generate a permutated segment .
Step 4: All the permutated segments are
concatenated into a new binary vector , which is
further concatenated with to create a binary vector
Step 5: Perform Steps 1 to 3 in Algorithm 1 with
input .
OUTPUT: decorrelated feature

Algorithm 2 produces a binary code based segment permutation before carrying out the steps in Algorithm 1. The segment permutation is guided by an on-the-fly binary code , which is calculated using the elements’ positions and values instead of a user specific key, so its security does not suffer from the lost key attack. Moreover, the binary code is calculated based on just binary elements in , which is only a portion of the original feature . This means that the error in one does not affect the permutation of other segments of . The parameter is the length of each segment, which impacts on the matching performance and security strength. A larger value of means that it would require more computational effort to invert to , but it would be more likely to cause errors in , thus having a negative effect on matching accuracy. The matching performance and security related to parameter are discussed in Case 3 of Section 4.1 and Section 4.4, respectively. We remark that Algorithm 2 offers an alternative pathway for tighter security at the expense of performance, as shown in Section 4.1 and that when , Algorithm 2 reverts to Algorithm 1.

The output vector of Algorithm 1 or 2 is then transformed by projecting onto a random space with the help of the random projection matrix of size , where . The transformation of using the random matrix can be compactly expressed bywhere is the inner product of and . The above transformation makes the dimension of transformed feature reduced to Y so that the feature set is protected. Different values of Y affect the performance of the proposed system, which is discussed in Section 4. Moreover, revocability is achieved—a new template can be issued easily by just changing the user specific key , which is analyzed in Section 4. By applying Algorithm 1 or 2, and random projection to each element of , can be transformed to be .

3.1.2. The Delaunay Triangulation-Based Scheme

In the Delaunay triangulation-based scheme, the feature set , which is a binary vector of length , is transformed by permutation guided by feature codes. Specifically, for each triangle, e.g., , a feature code is calculated first. Some stable local features can be used to obtain the feature code. These features are defined as follows:(i) is the orientation differences between and .(ii) is the length of edge .(iii) is the length of edge .(iv) is the angle between edges and .

, , and . Quantization is applied to each of these features to tolerate small variation. Accordingly, a quantized four-element array can be extracted from triangle which is further input into (2), based on [52], to generate the feature code , which corresponds to the triangle , as follows:where . is the output of function corresponding to inputs to and a random parameter . is set to be different in different applications, which enables to be diverse. In this way, one set of feature codes are computed as such from , where is the number of Delaunay triangles in the template image . After the feature code is obtained, the original bin to which is matched in , which is decided by its corresponding integer value , can be permutated under the guidance of the feature code . So the new bin becomes +. To increase the permutation randomness, an extra key guided permutation could be added on top of above permutation. By the same token, bins of other Delaunay triangles can be shuffled to new locations determined by their corresponding feature codes in . The vector array , after permutation, is transformed into a new version . Since each feature code in is generated from its corresponding Delaunay triangle feature and not saved in the database or on the smart card, it is hard to figure out the original bin location, even if the new bin position is acquired by the adversary from .

The reason why permutation is used to protect the feature set rather than random projection, as in the case of the feature set , is that permutation does not reduce feature dimension, which can help minimize the impact on the system’s matching performance. Moreover, feature code set is not saved in the database or on the smart card, which greatly improves the security of the original feature set . Most importantly, noise in fingerprint images influences the Delaunay triangulation only locally. Spurious or missing minutiae affect only those local Delaunay structures which contain them. Each Delaunay triangle is formed by only three minutiae, which means that if a feature code, e.g., , is incorrect, it only influences the permutated position of that Delaunay triangle, from which the feature code is generated. On the contrary, each polar coordinate-based structure is composed of a reference minutia and a number of other minutiae (usually more than ten minutiae) in a certain range. Obviously, the chance of generating an incorrect feature code from the polar coordinate-based structure is much higher than that from the Delaunay triangle, which only includes three minutiae. If we choose fewer minutiae from the polar coordinate-based structure, say, only three minutiae are chosen, which is the same number as that in the Delaunay triangle, then how to correctly determine the three minutiae from a query image and their corresponding minutiae from a template image is a tricky issue, especially under the presence of biometric uncertainty, e.g., spurious or missing minutiae. Therefore, compared with permutation, random projection is a better option for the protection of the feature set .

3.2. Verification Stage

In the verification stage, the same projection matrix used in the enrollment stage is used to transform the polar coordinate-based feature set extracted from the query image, and the Delaunay triangulation-based feature set can also be transformed under the guidance of the feature codes. Finally, matching between the template image and query image is conducted using the transformed features in the transformed domain. The detailed steps of the verification stage are explained below.

Given a query image (letter means query), untransformed query feature sets and together with a set of feature codes are extracted first, where , is number of minutiae, and is the number of Delaunay triangles in the query image .

In the polar coordinate-based scheme, suppose that denotes the transformed local feature of the query and denotes the transformed local feature of the template. Then the similarity score between them is calculated bywhere denotes the 2-norm. Each polar coordinate-based local feature in the query is compared with each polar coordinate-based local feature in the template to output a similarity score. Upon the completion of the comparison, there will be a score matrix of size . The maximum value in this score matrix is considered to be the matching score of the first feature set between the template and query images.

In the Delaunay triangulation-based scheme, the set of the query is permuted under the guidance of the feature codes to generate the transformed version . The permutation procedure is the same as that in the enrollment stage. The similarity score between and can be calculated as where represents the mean value.

The final score between the template image and query image is calculated using both and from the polar coordinate-based scheme and the Delaunay triangulation-based scheme, respectively, that is, where is a score normalization function; is the weight of score , which is set to be 0.7 in our application so as to give more relevance to the use of our basic scheme, the polar coordinate-based scheme. If the final matching score is larger than a predefined threshold , then the template image and query image are considered to be matching.

4. Experimental Results and Security Analysis

Three databases (DB1, DB2, and DB3) of FVC2002 and one database (DB2) of FVC2004 were used to evaluate the proposed fingerprint cancelable template design. Detailed information about these four databases is listed in Table 1. To extract minutiae from fingerprint images, a software package called VeriFinger 4.0 from Neurotechnology [53] was utilized.

The performance of the proposed fingerprint cancelable template is evaluated by four performance indices, namely, genuine acceptance rate (GAR), false acceptance rate (FAR), false rejection rate (FRR), and equal error rate (EER). GAR is defined as the ratio of successful genuine tests to the total number of genuine tests. FAR is defined as the ratio of successful imposter tests to the total number of imposter tests, FRR (=1-GAR) is defined as the ratio of failed genuine tests to the total number of genuine tests, and EER is defined as the error rate when FRR and FAR are the same. For all the databases, the 1VS1 matching protocol and standard FVC matching protocol [42] were utilized in the experiments. In the 1VS1 matching protocol, the first two images of each finger were chosen for testing, while, in the FVC matching protocol, all eight images of each finger were used.

4.1. Performance of the Proposed System under the Lost Key Attack

The lost key attack was tested in the experiments by allocating the same key to all the genuine and imposter tests. Three different cases were evaluated as follows.

Case 1 (performance of the system under two different instances). The performance of the system under three different instances is compared: (i)Instance 1: using only the basic polar coordinate-based scheme(ii)Instance 2: using only the Delaunay triangulation-based scheme(iii)Instance 3: combining the polar coordinate-based scheme and Delaunay triangulation-based scheme

The polar coordinate-based scheme in both instances 1 and 3 uses Algorithm 1 and the same parameter settings . The comparison between Instances 1, 2, and 3 is conducted over the database FVC2002 DB2 using the 1VS1 matching protocol. ROC curves are drawn in Figure 5. It can be observed from Figure 5 that Instance 3 (EER = 0.64%) performs better than Instance 1 (EER = 1.00%) and Instance 2 (EER = 2.00%), which only uses the single structure under the same parameter setting ().

Case 2 (performance of the proposed system using the untransformed feature sets and the transformed feature sets). Performance of the proposed system that uses the untransformed feature sets and the transformed feature sets was tested to evaluate the effect of feature transformation using Algorithm 1 and parameter settings . This test was conducted on two databases, FVC2002 DB2 and DB3 with the 1VS1 matching protocol. For database DB2, we obtained EER = 0.62% using the untransformed feature sets and EER = 0.64% using the transformed feature sets. For database DB3, EER = 4% was obtained using the untransformed feature sets, while EER = 4.57% was obtained with the transformed feature sets. In this case, matching performance becomes worse than that before feature transformation.

Case 3 (effect of different parameter settings on the performance of the proposed system). The different parameter settings of and in Algorithms 1 and 2 result in a trade-off between security and matching performance [54]. Here, we investigate the effect of different parameter settings of and on the system’s matching performance over databases FVC2002 DB2 and DB3 using the 1VS1 matching protocol. As shown in Table 2, choosing a smaller value of in Algorithm 1 makes matching performance worse. This is because the modulo operation in Algorithm 1 is a many-to-one mapping and smaller increases the possibility of multiple ‘1’s folded to the same position, leading to a decrease in feature discriminative ability. Algorithm 2 is proposed to increase the difficulty of obtaining the original feature vector. Under the same setting of , we can see from Table 2 that a larger value of reduces matching accuracy, for example, on database FVC2002DB2, EER is 3.37% when , as opposed to EER is 1.00% when . Both EERs are worse than EER = 0.64% when (which is equivalent to just using Algorithm 1) because the feature-dependent segment permutation is controlled by the binary code and larger , which represents a longer portion of the original feature vector, is more likely to make incorrect.

From the above analysis, we can see that smaller or larger can decrease matching performance. However, such a parameter setting makes the retrieval of the original feature vector harder under the assumption that random projection based transformation is conquered. Detailed security analysis is discussed in Section 4.4.

We also evaluated the performance of the proposed system using different values of Y over databases FVC2002 DB2 and DB3 with the 1VS1 matching protocol under the parameter setting in Algorithm 1. Security of the first feature set is provided by the random projection matrix which is of size , (). The feature vector , extracted from the polar coordinate-base d structure, is transformed using Algorithm 1 and (1). The smaller the value of Y is set, the more the dimensions are reduced. A lower-dimensional transformed feature vector is more secure as less information of the original feature vector is kept. In the tests, we set the value of Y to be 300 and 50, respectively. The EER performance is listed in Table 3. It can be observed that the smaller the value of Y, the higher the EER. This is because less information about the original features is preserved with more dimension cut (smaller Y), leading to performance degradation. Moreover, under the same parameter setting (), the proposed system demonstrates better performance on database FVC2002 DB2 with EER = 0.64% than the performance on FVC2002 DB3 with EER = 4.57%. The main reason for this is the vast difference of image quality between these two databases. The first two images from database FVC2002 DB2 have much better quality than the first two images of FVC2002 DB3 [10, 11].

4.2. Performance Comparison with the Similar Work

In this section, we compare the performance of the proposed system with other similar systems. The EER comparison under the lost key scenario is reported in Table 4. It can be observed that the proposed method outperforms all the existing similar methods under the 1VS1 matching protocol. Under the FVC matching protocol, the performance of proposed method is worse than that of original 2P-MCC64,64; however, 2P-MCC suffers from the ARM, as analyzed in Section 1.1. By contrast, the proposed scheme is resilient to this attack. In [8] transformation parameters are derived directly from biometrics via quantization, causing them rarely to be identical for the same user when large minutia variance exists in fingerprint images, which leads to poor performance. The structures in [911] are constructed by each minutia with all other minutiae in the image, resulting in more computational complexity. By contrast, in the proposed method structures are only formed by each minutia and its local neighbors in the range of . More importantly, the methods [7, 914] are vulnerable to the ARM attack [47].

4.3. Revocability and Unlinkability

Revocability is an essential property that a qualified cancelable template design should possess. Once a template is compromised, another transformed template should be generated and the newly generated template should be totally different from the compromised template and suffer no performance degradation. To measure revocability of the proposed system, we generated 50 different templates from the image of each finger by 50 different user specific keys. Then the transformed templates were matched against the original ones. The imposter distribution (with different keys) and pseudo-imposter distribution on database FVC2002 DB2 are shown in Figure 6. It can be seen that the imposter distribution and pseudo-imposter distribution are similar, which means that even if multiple templates are generated from the same image, they are distinct from the original template and there is no performance degradation. The mean and standard derivations of the imposter distribution are 0.2457 and 0.0408, respectively, while the values of these two indicators of the pseudo-imposter distribution are 0.2419 and 0.0042, respectively.

To protect users’ privacy, unlinkability is another essential property of cancelable fingerprint templates. It requires that the transformed templates generated from the same finger using different keys should be different to one another, as if they were transformed from different fingers. In this way, templates from the same individual used in different applications cannot be cross-matched. To verify the unlinkability property of the proposed method, two indicators [55], the false cross-match rate (FCMR) and the false non-cross-match rate (FNCMR), are used in two cases over database FVC2002 DB2. Case 1: FNCMR is the ratio of unsuccessful matching attempts between the transformed templates of the first and second images of each finger. Feature transformation is based on different keys. Case 2: FCMR is the ratio of successful matching attempts between the transformed templates of the first image from each finger and the first image of different fingers. The FCMR and FNCMR curves are shown in Figure 7, from which it is clear that , which conforms to the expected behavior of a cross-comparator [55].

4.4. Security Analysis

Given a template image , we obtain feature sets and from the polar coordinate-based and Delaunay triangulation-based structures. The polar coordinate-based feature set is protected by algorithms included in the polar coordinate-based scheme. Specifically, each element in first goes through the proposed feature decorrection algorithm, Algorithm 1 or 2, which transforms it into another format , making the transformed feature sets uncorrelated in different applications, governed by application-specific parameters, and . Since the DFT is invertible, is further protected by random projection through the projection matrix , as shown in (1), which is essentially a many-to-one mapping. is of size and is set to be smaller than X. Hence, the transformed feature vector has a reduced dimension compared with . The adversary can only launch the ARM attack by obtaining multiple transformation matrices and transformed feature sets from the same original feature set. Thanks to the feature decorrelation algorithm, Algorithm 1 or 2, the ARM attack can be defended by the proposed method, because the transformed feature set is uncorrelated in different applications, which is achieved by varying the application-specific parameters, and .

Here we give an example to demonstrate how the proposed method defends the ARM. The whole transformation process is divided into two parts: A and B, as shown in Figure 8. Part A represents the procedure of Algorithm 1 and Part B represents the random projection based transformation. In Part A, with the original feature vector of length , which only contains values of 0 and 1, under the ARM scenario, we assume that the same is used in two applications. Without loss of generality, is chosen for Application 1 and for Application 2, where . The folding function in Algorithm 1 is expected to change the input bit stream, yielding two different binary strings for Applications 1 and 2, respectively. The purpose of the DFT (Step 3 in Algorithm 1) is to convert the two new (folded) binary strings into different real-value vectors, i.e., for Application 1 and for Application 2. In Part B, the feature vectors and are, respectively, transformed into and by random projection. Due to the absence of correlation between and , even if the adversary acquires the transformed feature vectors and as well as the projection matrices and , he/she cannot launch the ARM to obtain either or , because it would be impossible for the ARM to find sufficient number of relevant systems equations matching the number of independent unknown variables.

It is worth noting that the outputs, and , of Part A are the inputs to Part B as shown in the overlapped area of Part A and Part B in Figure 8. and are intermediate products, which are not stored in the database or on the smart card, because only the resultant feature vectors and are needed for similarity score calculation. Therefore, the adversary does not know and unless he/she can recover them. One may argue that if the random projection based transformation (or Part B) is conquered, then can be retrieved. We now show why this is computationally infeasible. The random projection based transformation in (1) effectively constitutes an underdetermined system of linear equations. Since the projection matrix is of size with Y being smaller than X, rank() is no greater than , which is less than the number of unknowns, namely, elements of . It is a well-known result in linear algebra [56] that when the coefficient and augmented matrices of (1) have the same rank, (1) has an infinite number of solutions. Clearly, is just one solution among so many solutions, making the search for tremendously hard, especially with having a relatively flat spectrum due to the DFT.

Let us analyze the security provided by Algorithms 1 and 2, under the assumption that is obtained by adversary. The DFT in Algorithm 1 is invertible and not meant to protect . The DFT is applied for the purpose of rendering a dense data representation of the frequency samples of so that the search space for cannot be narrowed down [57]. The modulo operation in Algorithm 1 can provide certain protection to the original feature vector depending on parameter settings. With an element of value ‘1’ found at position in , where , the number of positions in that could yield the value of ‘1’ at position in is . The number of possibilities that could result in the value of ‘1’ at position in is in theory, where means choosing from positions, but, in reality, it is unlikely that a majority of these positions in contain ‘1’. Without loss of generality, we assume that at most two of these positions are ‘1’ for the rest of our discussions unless stated otherwise; then the computational complexity is bits in order to search all the possible combinations by brute force attack. Suppose that there are   ‘1’s in ; then the computational complexity for brute force search will be . When , under the parameter settings and , , the computational complexity is 330 bits. However, if is set to be a large value, e.g., , for elements of value ‘1’ found in from positions 1 to 14560, say for value ‘1’ found at position 123, its original position in only has three possibilities, either position 123 or position 20123 or both. In this case, the number of possibilities provided by the modulo operation is considerably reduced. To strengthen the security of Algorithm 1 for the case that is set to be a large value, e.g., 20000, we propose Algorithm 2 on top of Algorithm 1.

In Algorithm 2, the first part of original feature vector is first divided into segments and each segment, e.g., , is permutated by a segment-based binary code, e.g., . Take the segment of length as an example. For convenience, trivial segments containing full ‘0’s or full ‘1’s are excluded in the following discussion as permutation does not change the segment bit distribution. For a permuted segment , there exists number of different segments originating from through the circular shift permutation. One-bit difference in one segment implies a different entire feature vector which contains segments. The computational effort to determine all possible due to the circular shift permutation will be under the brute force attack. Assume there are nontrivial segments that contain both zero and nonzero elements, then the actual computational complexity reduces to be under the brute force attack. Take, for example, and , which has been observed in many of our experiments. In this case, the number of security bits imposed by Algorithm 2 is = 120 bits, which is a substantial amount. Clearly, the added security from Algorithm 2 is at the expense of matching performance, as shown in Section 4.1, which reflects the fact that there is always a compromise between security and recognition accuracy. Note that, for a given permuted segment, it is hard to tell whether it is produced by an actual permutation or it has not been permuted at all. Therefore instead of can be used to account for all possible shift combinations which can help increase the security strength.

Regarding the security of the second feature set , since the feature is obtained from under the guidance of feature codes , which are generated from their corresponding triangle feature and not saved in the database or on the smart card, the adversary has no idea about the original location of a triangle feature, even if is hacked. Under this situation, the adversary may try to guess the feature code through the brute force attack. The number of security bits for calculating one feature code, e.g., , which is given by through (2), can be expressed by because the minutiae orientation range is , the edge length is , and the range of an angle is . In the experiments, the quantization steps for calculating the feature code were set to be , = 25 pixels and = on database FVC2002 DB2. With these quantization settings, the number of security bits is about 14 bits for one feature code. However, this feature code can only be used to find the original location of one triangle in and the adversary cannot verify the correction of the original position. To exactly restore , the original locations of triangles have to be determined at the same time, which means that the security of is bits. The average value of is 32 on database FVC2002 DB2. Since a certain degree of error tolerance is allowed in the matching process, the real security is lower than bits, depending on the matching score threshold . Note that, even if is revealed, the original location of minutiae in the fingerprint image remains unavailable, because only contains the relative information of the three minutiae forming the Delaunay triangle rather than the absolute coordinates on the image. This means that compromising would not threaten the security of .

Some other issues are discussed here: (1) In our application, polar coordinate-based local structures are employed. Assume that and are two neighbor minutiae and are considered as the origin of the polar coordinate of two polar coordinate-based local structures and . Since and are neighbor minutiae, some of the minutiae points included in local structure are also included in . The feature vectors, e.g., and , extracted from local structures, and , respectively, may have some correlation; however, they are different. Here ‘correlation’ indicates the existence of some common minutiae in different polar coordinate-based structures, but it does not mean that the features extracted from different polar coordinate-based structures are the same, because the features extracted from different polar coordinate-based structures are based on different reference minutiae and the information of the reference minutiae is nonpublic. For instance, according to Section 2.1, the positions of ‘1’ in and are decided by the triplet values and . The radial distance is equal to , but and are different from and , respectively. A moderate variation on the orientation or/and its related line angle can change the values of or . So even if minutiae and are neighbor minutiae, the triplet values are different from , which leads to different feature vectors and . As the radial distance is equal to , so and may have some correlation that can be utilized by the adversary. However, any correlation based attack needs at least a compromised feature vector, e.g., or , as a base, which unfortunately is infeasible because or has been decorrelated and protected by the folding, permutation, and random projection. (2) Assume the number of ‘1’s, in feature vector , is . According to our statistics on the databases, is 34, which is much smaller than the length (e.g., ) of the feature vector . If these ‘1’s are uniformly distributed on , the computation complexity is about bits in order to correctly recover all the 34 ‘1’s in by brute force attack. One may claim that these binary ‘1’s are not uniformly distributed. We give an example here. Assume that all the 34 ‘1’s are only located in the first 1000 bins out of a total of 34560 bins of the feature vector . Although this is a highly unlikely case, computational complexity for this example is about bits under the brute force attack, which demonstrates that the proposed method is still secure enough. Actually, it is nearly impossible if one can generate a binary string that is absolutely uniform. Our statistics of the probability distribution of bit ‘1’ over unfolded raw features, e.g., , shows that the features are with a good degree of uniform distribution. For example, the maximum probability of being ‘1’ of a bin is , which shows that dominant probability does not exist. Moreover, even after removing bins with probability being ‘1’ that are equal to or less than 20% of the maximum probability, the remaining bins are still more than a thousand. The estimated figures, e.g., 330, 120, and 384 bits, in our security analysis, may be not tight. However, these could be the best analytical results one can get unless an accurate distribution model can be developed, which is far beyond the scope of this paper as such distribution model can be a research topic by its own. (3) There is a type of statistical attack, named Moore-Penrose inverse [58], to find possible solutions of the linear system (random projection is a linear system). If the linear system has any solutions, they are given by , where is the Moore-Penrose inverse of . It is well known that a Moore-Penrose inverse based solution has the minimum Euclidean norm . There is no theoretic basis for any deterministic relationship between the Moore-Penrose inverse based solution and the ground truth , if is random. Some probabilistic relationship might exist for some cases, for example, based on the values of the Moore-Penrose inverse based solution , one may estimate the position of ‘1’s in binary vector in by general statistical relation that large values in corresponding to ‘1’s in . However, for a specific application, the defender can select suitable random projection matrix, e.g., , so that this statistical relation can be lessened. We have experimentally verified that a solution that is based on the Moore-Penrose inverse incorporating the statistical relation can find only one correct position of ‘1’ in .

4.5. Suitability/Feasibility on Mobile Devices

In this section, the template size and computational complexity of the proposed system are discussed. In our application, the template consists of two parts. One is the transformed polar coordinate-based feature set and the other one is the transformed Delaunay triangulation-based feature set . The size of each element in depends on the parameter setting Y of the transformation matrix. If Y is set to be 300, the size of is 4.7 KB, and then the size of is KB. The feature set is of size bits . In terms of matching time, it takes about 0.000171 seconds to match an element from against an element from . Assume that there are elements in template feature set and elements in query feature set , then the matching time between and is seconds. The matching time between and is about 0.00872 seconds.

Take the fingerprint image 1_1.tif and image 1_2.tif in FVC2002 DB2 for example. Assume that image 1_1.tif is template image and 1_2.tif is the query image. The value of is 31 in the template, so the template size is 148.2 (=) KB. The value of is 33 in the query, so the matching time between the template and query is 0.18263 (=) seconds. The above experiment is conducted using MATLAB on a laptop with Intel processor: i5-2450M dual-core CPU of 2.50 GHz, 2.50 GHz, RAM of 8GB, and Operation System of 64-bit Win 7. With the powerful storage and computing capability of today's mobile devices, e.g., smartphones, the imposed storage size and computational load are never an issue.

5. Conclusion

In this paper, a new fingerprint cancelable template system has been proposed for mobile device authentication. The new cancelable template can mitigate the negative impact of nonlinear distortion by combining multiple feature sets. Since the proposed system with two feature sets contains more feature information than most existing cancelable templates with just a single feature set, the discriminative power of the proposed system gets increased and thus recognition performance is enhanced. Furthermore, the proposed method can defeat the ARM attack through eliminating the feature correlation in different applications, which is a clear advantage over those existing cancelable templates that are vulnerable to the ARM attack. In the future work, the proposed scheme will be applied to other biometrics such as cancelable palmprint [59].

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Acknowledgments

This paper is supported by the Natural Science Foundation of China through Projects 61672083, 61370190, 61532021, 61472429, and 61402029 and by Defence Science and Technology Group (DST) of Australia through Project CERA 221.