Abstract

One of the key challenges in the area of signal processing on graphs is to design transforms and dictionary methods to identify and exploit structure in signals on weighted graphs. In this paper, we first generalize graph Fourier transform (GFT) to spectral graph fractional Fourier transform (SGFRFT), which is then used to define a novel transform named spectral graph fractional wavelet transform (SGFRWT), which is a generalized and extended version of spectral graph wavelet transform (SGWT). A fast algorithm for SGFRWT is also derived and implemented based on Fourier series approximation. Some potential applications of SGFRWT are also presented.

1. Introduction

In traditional signal processing, the most commonly used tools are transforms, among which Fourier transform and wavelet transform play key roles [1]. Fourier transform and wavelet transform were generalized and extended in many contexts in recent years.

On the one hand, Fourier transform and wavelet transform were extended to fractional domain, obtaining fractional Fourier transform (FRFT) [223] and fractional wavelet transform (FRWT) [2430]. From a mathematical viewpoint, the FRFT can be seen as a parametric Fourier transform [7], whose fractional order θ is a free parameter which offers more flexibility and generalization properties compared to the classical Fourier transform. From a signal processing point of view, the FRFT can be interpreted as a rotation in the time-frequency plane, i.e., the unified time-frequency transform. With the order θ increasing from 0 to 1, the FRFT can show the characteristics of the signal changing from the time domain to the frequency domain [8]. There are roughly three main research directions for investigating FRFT: firstly, the application of FRFT to deal with many signal processing problems [9], for example, filtering, compression, image encryption, digital watermarking, pattern recognition, edge detection, antennas, radar and sonar, and communication; secondly, the discretization algorithms of the FRFT [1017]; and lastly, the extension of the fractional idea of FRFT to other transforms, for example, fractional cosine, sine and Hartley transforms [1821], fractional Krawtchouk transform [22], and short-time FRFT [23]. By cascading of the FRFT and the ordinary wavelet transform, Mendlovic et al. [24] first proposed FRWT. Recently, by introducing a new structure of the fractional convolution [25] associated with the FRFT, Shi et al. [26] proposed a simplified definition of the FRWT which analyzes the signal in time-frequency-FRFD domain, where FRFD denotes fractional Fourier domain. The definition of FRWT in [26] was further improved by Prasad et al. [27], who analyzed the signal only in time-FRFD domain. More recently, Dai et al. [28] presented a new FRWT which displays the time and FRFD-frequency information jointly in the time-FRFD-frequency plane. Srivastava et al. [29] proposed a certain family of fractional wavelet transformations and Shi et al. [30] proposed a novel fractional wavelet packet transform. In general, compared to traditional transforms, fractional transforms may lead to (1) better signal filtering results due to the rotation of the time–frequency plane [9]; (2) better image compression ratios [31], image recognition results [32], and image segmentation results [33] due to the flexible choices of the fractional orders; and (3) better image encryption [34] and image watermarking [35] performances since the fractional order can be used as an additional secret key.

On the other hand, Fourier transform and wavelet transform were extended to graph domain, obtaining graph Fourier transform (GFT) [3643] and graph wavelet transform (GWT) [4449] to handle signal defined on the vertices of weighted graphs. Two basic approaches to signal processing on graphs have been considered: the first is rooted in the spectral graph theory [50] and builds upon the graph Laplacian matrix [36]. Although spectral decomposition of generalized graph Laplacian with negative edges has been considered for the construction of graph frequency domain [51, 52], in this paper, we limit this approach to undirected graphs with real and nonnegative edge weights for the sake of clarity. The second approach, discrete signal processing on graphs [40, 41], is rooted in the algebraic signal processing theory [53, 54] and builds on the graph shift operator. The latter works as the elementary operator that generates all linear shift-invariant filters for signals with a given structure. In particular, the graph Fourier transform in this framework expands a graph signal into a basis of eigenvectors of the adjacency matrix, and the corresponding spectrum is then given by the associated eigenvalues. Besides graph-based transforms [3649], recent research works on graph also include, among others, sampling and interpolation on graphs [55, 56], graph signal recovery [5760], semisupervised classification on graphs [61, 62], graph dictionary learning [63, 64], and graph convolutional neural networks [6572]. Please refer to [73] for more references on graph signal processing.

To the authors’ knowledge, there are only few research works done for discrete signal processing on fractional graph domain, which is a combination of fractional transform domain and graph transform domain. Wang et al. [74, 75] proposed a definition of fractional Fourier transform on graphs named graph fractional Fourier transform (GFRFT), which is based on algebraic signal processing theory [53, 54]. However, similar to Fourier transform, GFRFT is a global transform and does not provide useful localization properties in fractional graph domain. Furthermore, GFRFT does not allow multiresolution analysis of graph signals. In this paper, based on spectral graph theory, we first propose a new spectral graph fractional Fourier transform (SGFRFT), which is then used to define a new spectral graph fractional wavelet transform (SGFRWT), which can be seen as an extended version of spectral graph wavelet transform (SGWT) [44].

The remainder of the paper is organized as follows. Section 2 recalls the main foundations of fractional wavelet transform, spectral graph theory, and spectral graph wavelet transform (SGWT). The spectral graph fractional Fourier transform (SGFRFT) and the spectral graph fractional wavelet transform (SGFRWT) are defined in Section 3. Section 4 is dedicated to the Fourier series approximation-based fast algorithm for forward and inverse SGFRWT. Several applications of SGFRWT for addressing different problems are shown in Section 5. Section 6 concludes the paper.

2. Preliminary

In the following, we first recall the forward and inverse fractional Fourier transform (FRFT) [25] and then the forward and inverse fractional wavelet transform (FRWT) [24, 2628] and also show how scaling operator may be expressed in the FRFT domain. Table 1 summarizes the definitions of various transforms for classical signal processing and also those for graph signal processing.

2.1. Fractional Fourier Transform (FRFT)

The θ-order forward FRFT [25] is the decomposition of a function f according to the fractional Fourier kernel, i.e.,wherewhere θ indicates the rotation angle of the transform and represents the Dirac distribution.

The inverse FRFT is given by [25]

The kernel in equation (2) can be obtained by its spectral expansion [5]:

in equation (5) is the kth-order normalized Hermite function [11, 12], which is the eigenfunction of the Fourier transform. From (5), we can see that FRFT is a generalized version of the Fourier transform, since they share the same eigenfunction and the eigenvalues of the FRFT are the θth root of the eigenvalues of the Fourier transform [2]. When the rotation angle θ = π/2, the FRFT defaults to Fourier transform.

The discrete form of equation (5) can be expressed as follows [11, 12]:where diag(.) denotes a diagonal matrix formed from its vector argument. denotes the θ-order forward FRFT matrix of size . , vk is the kth-order DFT Hermite eigenvector. Note that becomes identity matrix when θ = 0 and Fourier matrix when θ = π/2.

2.2. Fractional Wavelet Transform (FRWT)

There are several definitions for fractional wavelet transform (FRWT) [24, 2628]. In this part, we choose the definition in [28] since it displays the signal in the joint time-FRFD-frequency plane.

The θ-order forward FRWT is defined as [28]wherewith .

The unitarity property of the FRFT applied to (7) leads to an equivalent form of FRWT:where stands for the θ-order FRFT of f. Here also, as for the classical CWT [1], the scaling is operated in the transformed domain “u” and not directly in the time domain.

The inverse of the FRWT is given by [28]when the following admissibility condition is satisfied [28]:

2.3. Spectral Graph Theory and Spectral Graph Wavelet Transform

In this section, we briefly review the spectral graph theory and also the spectral graph wavelet transform [44].

2.3.1. Spectral Graph Theory

We consider an undirected, connected, weighted graph , composed of a finite set of vertices (with ), a set of edges , and an adjacency symmetric and positive-valued matrix W. A real-valued signal f defined on the vertices of the graph is an vector where each entry is the value assigned to the vertex n. The (nonnormalized) graph Laplacian operator is the matrix defined by , where D is the diagonal degree matrix defined by . dm,m is the degree of vertex m.

As the matrix L is real and symmetric, then it is diagonalizable and its eigenvectors , sorted according to the ascending order of the corresponding eigenvalues , form an orthonormal basis. Therefore, with the unitary column matrix:and the diagonal matrix , we havewhere the superscript H denotes the Hermitian transpose operation. The graph Fourier transform (GFT) of the signal f is then the sequence provided by the scalar products [36]:and f can be recovered (inverse GFT) by means of the reconstruction formula:

Using vectors and matrices notations, equations (14) and (15) become

The graph Fourier transform obeys the Parseval theorem; that is, for any signals f and h defined on the graph , we have

2.3.2. Spectral Graph Wavelet Transform (SGWT)

The spectral graph wavelet transform (SGWT) of the signal f with the kernel is defined by [44]whereand the kernel is continuous positive-valued function defined on satisfying

Using equation (14), the SGWT becomeswith

The signal f can be recovered up to its mean value using the inverse formula [44]:

3. Spectral Graph Fractional Transforms

We first propose a novel spectral graph fractional Fourier transform (SGFRFT) in Section 3.1 and then give the definition of spectral graph fractional wavelet transform (SGFRWT) in Section 3.2.

3.1. Spectral Graph Fractional Fourier Transform

Similar to (13), we define the graph fractional Laplacian operator as follows:where 0 < θ ≤ 1 and γ is the power matrix given bywith being the unitary (i.e., inverse graph Fourier transform matrix) shown in (12). Note that as is unitary, then is also unitary. Andthat is,

Considering , and from (27), we can easily get . The reason why we choose as the definition of graph fractional Laplacian operator will be discussed in Section 3.2.

The forward spectral graph fractional Fourier transform (SGFRFT) of any signal f defined on the vertices of the graph is defined byor by the following matrix form:

For θ = 1, the proposed SGFRFT in (29) is no more than the GFT in (18) defined by Shuman et al. [36]. For other values of θ, we can compute by using Schur–Padé algorithm [76, 77] which is found to be superior in accuracy and stability to several alternatives, including eigendecomposition method and also the approaches based on the formula .

The inverse SGFRFT is given byor by the following matrix form:

The graph fractional Fourier transform obeys the Parseval theorem; that is, for any signals f and h defined on the graph , we have

Note that the forward and inverse SGFRFTs defined here are different from the forward and inverse graph fractional Fourier transforms (GFRFTs) [74, 75], which are, respectively, given bywhereand is obtained by the Jordan decomposition of the adjacency matrix W as follows [74, 75]:where is a matrix composed of eigenvector and JW is a block-diagonal matrix. For more details, please refer to Appendix A of [74].

From (13), (29), (31), and (33)–(36), we can see that the differences are that the proposed SGFRFTs are based on the decomposition of the graph Laplacian matrix L, while the GFRFTs are based on the decomposition of the adjacency matrix W.

3.2. Spectral Graph Fractional Wavelet Transform (SGFRWT)

Similar to (21), we can define the spectral graph fractional wavelet transform (SGFRWT) operator as follows:where

Why we choose in (24) as the definition of graph fractional Laplacian operator and choose in (39) as the basis of SGFRWT? There are two reasons. (1) We want to define the basis of SGFRWT based on a new function instead of the same function as basis of SGWT in (23). The SGFRWT and SGWT will have different characteristics since they are based on different functions and , respectively. (2) We want to define the basis of  SGFRWT which changes directly with . Since the parameter acts on the eigenvalues of the Fourier matrix in the form of exponents in the spectral expansion of FRFT in (5), similarly, we also want to obtain the form that the parameter can directly affect in the proposed definition of SGFRWT. Two other possible definitions of SGFRWT are also discussed in Appendices A and B, respectively.

From (37), we can see that the proposed SGFRWT is a generalization of SGWT in (21) defined by Hammond et al. [44]; that is, when θ = 1, the proposed SGFRWT defaults to the SGWT. The proposed SGFRWT can be seen as a parametric generalization of SGWT [7]. Furthermore, the proposed SGFRWT converts a graph signal into an intermediate domain between vertex and spectral graph wavelet. With the order θ increasing from 0 to 1, the SGFRWT can provide much more domain choices of expressions for a graph signal.

The scaling functions are then obtained byand the coefficients by

Stable recovery of the original signal f from the SGFRWT coefficients will be assured if the quantity , where sj denotes jth scale, is bounded away from zero on the fractional Laplacian . Moreover, under the same conditions, Lemmas 5.1, 5.2, 5.3, and 5.4 and Theorems 5.5 and 5.6 in [44] are verified by the SGFRWT, whose properties are shown in Appendix C.

4. Fourier Series Approximation and Fast SGFRWT

If we compute the SGFRWT by using equation (37) directly, we have to compute the entire set of eigenvectors and eigenvalues of shown in (24). Both the computational complexity and the memory consumption become unacceptable when the data is larger than hundreds of thousands or millions of dimensions. Therefore, in this section, we propose a fast algorithm for computing the SGFRWT based on approximating the scaled generating kernels by low-order Fourier series. It should be noted that the proposed fast SGFRWT algorithm is an extension of the fast SGWT algorithm from real domain to complex domain. The proposed fast SGFRWT algorithm is different from the fast graph Fourier transform proposed by Magoarou et al. [39], who use the modified Jacobi eigenvalue algorithm for approximating the graph Laplacian matrix. Note that the proposed SGFRWT algorithm can approximate an arbitrary matrix while the algorithm in [39] is only practicable for approximating symmetric matrix.

The following lemma shows that the polynomial approximation of may be taken over a finite range containing the spectrum of .

Lemma 1. Let rmax ≥ rN−1 be any upper bound on the fractional spectrum of . For fixed s> 0, let be a polynomial approximant of with L error . Then the approximate SGFRWT coefficients satisfy .

The proof of Lemma 1 is similar to Lemma 6.1 in [44].

In [44], Hammond et al. approximated shown in (19) by truncated Chebyshev expansions. The reason is that is real as the original shown in (13) is a real matrix. However, the Chebyshev polynomial approximation method is not suitable for our problem of approximating shown in (38), because may be complex as shown in (33) may be a complex matrix. Therefore, in this part, we will use the truncated Fourier series expansion instead of Chebyshev expansions to approximate shown in (38).

The Fourier series of a general function is given by [78]where

We now assume a fixed set of wavelet scales sn. For each n, approximating for can be done by scaling the domain of , , using the transformation x = yrmax/(2π). Then, can be written bywith

Note that

The above equation is due to the fact that

For each scale sj, can be approximated by the truncated Fourier expansion (42) to terms; that is,

We may choose an appropriate Mj in practice to achieve a balance between accuracy and computational complexity. We can also use exactly the same scheme to approximate the scaling function kernel hθ shown in (40) by p0. Then, the approximated SGFRWT wavelet and scaling function coefficients are, respectively, given bywhere

Note that (52) is easily obtained by applying the function Fk(.) in (46) to the matrix Lθ, and we have

The efficiency of this algorithm depends on the following recursive formula:

As the signal , taking equation (53) into consideration, equation (54) can only be computed for , and the other can be obtained by conjugate operation.

The computational complexity of the proposed fast SGFRWT algorithm is analyzed as follows:(1)The computation of from in (25) by using Schur–Padé algorithm [76, 77] needs O(N3) operations.(2)The computation cost of each in (54) is O(6|E|), where |E| is the number of nonzero edges in the graph, and 6|E| because is a complex matrix and a complex multiplication requires 4 real multiplications and 2 real additions (6 operations). Therefore, the computation of all needs O(6Mj|E|) operations.(3)It requires O(6N) operations at scale j for each k ≤ Mj in (50) and (51). Therefore, the computation of (50) and (51) needs operations.

Therefore, the total computational complexity of the fast SGFRWT algorithm is .

Implementation of (54) requires memory of size 2N. The total memory requirement for the proposed fast SGFRWT is .

4.1. Fast Computation of Adjoint

Given a fixed set of wavelet scales , and including the scaling functions , the SGFRWT can be seen as a linear map , where denotes the complex domain. Let denote the approximated fractional wavelet transform by using the fast SGFRWT algorithm. In this section, we show that both the adjoint and the composition can be computed efficiently by using the Fourier series approximation.

For any , we consider η as the concatenation with each for 0 ≤ j ≤ J; we have

Considering , we have

Similar to (50) and (51), the adjoint operator in (56) can be approximated by

In the following, we derive a fast algorithm for the computation of , which is an approximation of the composition . In general, when computing , we first apply and then apply by the fast SGFRWT, which needs 2J Fourier series expansions. However, the computational complexity of can further be reduced. Note that

In the following, we derive a fast algorithm for the computation of .

Lemma 2. Set , which has degree . Then, can be computed bywhere and .

Proof. Note that is used in the above derivation.
Therefore,The proof of Lemma 2 is now completed.
Combining (58) and (59), we haveFrom (62), we can see that can be computed by a single Fourier series expansion with twice the length, which reduces the computational cost by a factor J comparing to the direct computation of .

4.2. Reconstruction by Using the Fast Adjoint Algorithm

In this part, we show the reconstruction of proposed SGFRWT by using the fast adjoint algorithm in Section 4.1.

The SGFRWT is an overcomplete transform since it maps an input signal f of size N to the N(J + 1) factional wavelets coefficients

The pseudoinverse of the above equation can be obtained by solving the following square matrix equation:

Equation (64) can be solved by the conjugate gradients method [79], whose computation in each step is dominated by applying to a single vector. What is more, can be fast approximated by shown in (62). Therefore, we can reconstruct input signal f from c.

5. Application Examples

Similar to [44], in this section, we will show several application examples of the SGFRWT on different real and synthetic datasets by modifying the GSPBOX toolbox [80]. The first and the third experiments are implemented in MATLAB programming language on a PC machine, which sets up Microsoft Windows 7 operating system and has an Intel® Core™ i7-2600 CPU with speed of 3.40 GHz and 16 GB RAM. The second experiment is implemented using PyTorch on a PC machine, which sets up Ubuntu 16.04 operating system and has an Intel® Core™ i7-4790 CPU with speed of 4.00 GHz and 32 GB RAM and has also two NVIDIA GTX1080-Ti GPUs.

5.1. SGFRWT Design Details

We should design two kernels in SGFRWT. One is the wavelet kernel , and the other is the scaling function h(x).

For the wavelet kernel g(x), we choose the same cubic spline in [44] as follows:where α and β are integers and they are used to determine the decay rate of spline. x1 and x2 are boundary of transitions region. Note that is normalized such that  =  = 1. The coefficients of the cubic polynomial s(x) are determined by s(x1) = s(x2) = 1,  = α/x1, and  = −β/x2. All of the examples in this paper were produced using α = β = 2, x1 = 1, and x2 = 2; in this case, s(x) = −5 + 11x 6x2 + x3. For the SGFRWT, we can easily substitute x in (65) by shown in (24) to get .

For the scaling function kernel h(x), we take with .

5.2. SGFRWT Application Examples
5.2.1. SGFRWT on Synthetic Graph Data

In the first example, we perform the proposed SGFRWT on “Swiss roll,” which is a dataset whose points are sampled randomly from a two-dimensional (2D) manifold as follows:

Figure 1 shows the Swiss roll data cloud. We take 500 points sampled uniformly on the manifold and then build a weighted graph, whose edge weight is defined as follows:

Note that the definition of the edge weight is a critical issue in graph construction. The most direct way to compute the edge weight is based on a given (dis)similarity measure. In practice, the edge weight is generally defined using different measures for better interpretability. As discussed in [81], Gaussian kernel and its variants (i.e., heat kernel or RBF kernel) are one of the most popular schemes to assign edge weights for a graph. Therefore, we take the Gaussian kernel in (67) to calculate the edge weight to simplify our discussion in this paper. We perform SGFRWT with fractional order θ ∈ [0.1, 1.0] on Swiss roll data cloud with J = 5 wavelet scales, and the results of some fractional orders θ ∈ {0.1, 0.5, 0.8, 1.0} are shown in Figure 2. The last column of Figure 2 shows the results of performing SGWT [44] on Swiss roll, from which we can see that SGWT [44] searches for the optimal expression domain only along the dimension of scale. However, the proposed SGFRWT searches for the optimal expression domain not only along the dimension of scale but also along the dimension of fractional order. That is, compared with SGWT, the proposed SGFRWT can express Swiss roll in higher dimensions, which makes the expression space larger and is therefore more likely to obtain a better expressive domain for the original problem. Just like [3133], the fractional order θ is a hyperparameter depending on the original problem and you should search for a suitable value of θ. For example, the fractional order θ = 0.4 is an appropriate value for gland segmentation when using fractional wavelet scattering network [33].

As a comparison, we also give the results of the proposed spectral graph fractional Fourier transform (SGFRFT) and graph fractional Fourier transform (GFRFT) proposed in [74, 75] on the “Swiss roll” in Figure 3, respectively. As we can see from Figure 3, the results of the proposed SGFRFT are different from those of GFRFT in [74, 75]. Furthermore, both SGFRFT and GFRFT are global transform and are not easily to be used for local analysis. However, let us see the third column of Figure 2, that is, the results of SGFRWT with J = 5 and fractional order θ = 0.8; the localization is more and more obvious along scale dimension. Comparing the third column of Figure 2 with the third column of Figure 3, we can see that, compared with SGFRFT and GFRFT, the proposed SGFRWT allows localization in fractional graph domain, which may lead to better local analysis in spectral graph domain. The experiment results of Figure 3 verify the local properties of SGFRWT shown in Appendix C (i.e., Lemmas C2–C4 and Theorem C5) to a certain extent.

5.2.2. SGFRWT on MNIST Database

In the second example, we will see that the proposed SGFRWT can be used as a data augmentation method in the preprocessing of convolutional neural networks, which require as many input images as possible. Because, as shown in the first example, the proposed SGFRWT allows deriving many images from the original image by considering several fractional orders. In the following, a series of experiments are performed on the MNIST dataset to verify the effectiveness of the proposed SGFRWT as a method of data augmentation.

We construct the graph for each image in MNIST dataset. The vertices of the graph are made up of pixels in the image and each pixel is connected with its vertical and horizontal neighbor pixel. The edge weight is calculated by a Gauss kernel weighting function [36] as follows: where is the weight on the edge connection of the vertex and , and dist (i, j) represents the distance between pixel i and pixel j.

In order to prove the effectiveness of SGWT and SGFRWT, we augment MNIST dataset by the following strategies: (i) The original MNIST database has a training set of 60,000 images and a testing set of 10,000 images, which are used as a control group. (ii) For each image of the MNIST training set, we perform SGWT [44] with J = 5 wavelet scales and generate a total of 360,000 images. The original 10,000 test images are still used as the testing set. (iii) For each image of the MNIST training set, we apply the proposed SGFRWT with J = 5 wavelet scales and fractional order θ ∈ {0.2, 0.4, 0.6, 0.8, 1.0}, generating a total of 1,800,000 training images. The details of the generated four datasets are shown in Table 1. Figure 4 shows all sample images by processing an image of the digit “zero” by SGFRWT with J = 5 wavelet scales and fractional order θ ∈ {0.2, 0.4, 0.6, 0.8, 1.0}.

To evaluate the data augmentation performance of the proposed approach, we train and test a simple convolutional neural network, which is shown in Figure 5, using the three datasets mentioned above. The CNN model consists of 2 convolutional layers, 2 pooling layers, and a fully connected layer with a 10-way Softmax classifier. The inputs of the CNN are images of size 28 × 28. The first convolutional layer has 32 kernels with size 5 × 5, while the other layer has 64 kernels with size 3 × 3. The Rectified Linear Units (ReLUs) are chosen as the activation function. Every convolutional layer is followed by a nonoverlapping max-pooling with filter of size 2 × 2. Stochastic gradient descent algorithm is employed to optimize the network with a learning rate of 0.001. When the network converges, we evaluate the classification accuracy on the testing set with an average value of 10 times. The last column of Table 2 shows the classification performance comparison of the three different datasets. As we can see from Table 2, if we use traditional data augmentation methods (flipping, rotating, and adding noise), then the recognition rate can achieve 98.41%. When the training dataset of MNIST is augmented by SGWT without traditional data augmentation, the recognition rate can increase to 98.71%. From the last row of Table 2, we can see that our proposed SGFRWT can further improve the accuracy.

5.2.3. Face Recognition by Fusing SGFRWT Features

In the third example, the application of SGFRWT to provide features for face recognition is investigated. We use four classical face databases, whose MATLAB formats are provided in [82, 83]. (1) Yale Database [84]. It contains 165 grayscale images (15 individuals, 11 images per individual). (2) AR Database [85]. We use a nonoccluded subset of AR database containing 700 images (50 male subjects and 50 female subjects, 7 images per subject). (3) CMU PIE Face Database [86]. We also use a nonoccluded subset of CMU PIE database which contains 11560 images (68 persons, 170 images per person). (4) ORL Database [87]. It contains 400 grayscale images (40 individuals, 10 images per individual). Note that all the images are resized to 32 × 32 pixels.

The experiment is carried out by the following three steps:(1)For each image of the four face databases, we apply the proposed SGFRWT with wavelet scale J = 10 and fractional order θ ∈ {0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0}, obtaining 100 times the images of original datasets.(2)We then extract the SGFRWT features of three fractional orders which have the highest trace ration [88] as follows:where tr(.) denotes the trace. and are between-class scatter matrix and the within-class scatter matrix, respectively,where the superscript T denotes transposition. C denotes the number of classes. A is the mean of Ai and Ai is the mean image of class Xi and Ni is the number of images in class Xi.| X(θ, j)| denotes the modulus of SGFRWT coefficient with the fractional order θ and the wavelet scale j.(3)The SGFRWT features of three fractional orders are fused by canonical correlation analysis (CCA) [89] to obtain the feature vector, whose dimension is then reduced by principal component analysis (PCA) and Linear Discriminant Analysis (LDA). Finally, we feed the feature vector to the support vector machine (SVM).

We conduct experiments on 18 different datasets which are constructed by four classical face databases mentioned above. Each set of datasets is compared with three different methods (baseline, SGWT, and SGFRWT), and we evaluate the classification accuracy on the testing set with an average value of 10 times. The recognition comparison results of the proposed SGFRWT feature building, the SGWT feature, and the baseline (only PCA + LDA + SVM used) on four face databases are shown in Table 3, from which we can see that the proposed SGFRWT feature building approach can achieve promising results and perform better than SGWT feature and also the SVM baseline.

6. Conclusion

This paper investigates the issue of extension of spectral graph wavelet transform (SGWT) to fractional domain. The main contributions of this paper can be summarized as follows. (1) A novel transform named spectral graph fractional wavelet transform (SGFRWT) is defined. (2) A Fourier series approximation-based fast algorithm for SGFRWT is derived and implemented since the SGFRWT includes complex domain computations compared to SGWT. (3) Applications of SGFRWT to synthetic and real datasets are also given to highlight its potential usefulness. In summary, the proposed fractional spectral graph wavelets provide a new choice for the graph signal processing. Further research may include the extension of the proposed SGFRWT for dealing with the directed graphs [40, 41] and the extension of the idea of SGFRWT to critically sampled graph wavelets like GraphBio [46].

Appendix

A. Definition of SGFRWT by Using the Idea of Traditional FRWT

According to (8), we can define the SGFRWT in (37) in another form by using the idea of traditional FRWT as follows:where

From (A.3), we can see that this definition has two defects:(1)The basis of SGFRWT is still based on ; however, we want to define the basis of SGFRWT based on a new function, for example, . If we use the definition of (A.3), then the characteristic of will be very similar to in (22) since they are based on the same function .(2)We want to define the basis of SGFRWT which changes directly with ; that is, the parameter should have an effect on directly.

Therefore, we do not use (A.1)–(A.3) to define the SGFRWT.

B. Definition of SGFRWT by Using Another Graph Fractional Laplacian Operator Lθ

Different from the definition in (24), the graph fractional Laplacian operator can also be defined as follows:

Comparing (B.1) and (13), we can see that the decomposition of L and Lθ shares the same χ. That is, if we define Lθ as the graph fractional Laplacian operator, then, similar to (21), we can define the spectral graph fractional wavelet transform (SGFRWT) operator as follows:where

From (B.4), we can see that this definition has one defect: the basis of SGFRWT is based on ; however, we want to define the basis of SGFRWT based on a new function, for example, . If we use the definition of (B.4), then the characteristic of will be very similar to in (21) since they are based on the same function .

Therefore, we do not use (B.1)–(B.4) to define the SGFRWT.

C. Some Properties of SGFRWT

In this appendix, we give some properties of proposed SGFRWT without proof, since the proof of these properties is very similar to that of those properties of SGWT in [44].

Lemma C.1. If the SGFRWT kernel satisfies the following admissibility condition:and , thenwhere . In particular, the complete reconstruction is given by .

Lemma c.1 shows that the mean of f may not be recovered from the zero-mean fractional graph wavelets. Lemma c.1 is a generalization of Lemma 5.1 in [44].

Lemma C.2. Let be a weighted graph, the graph fractional Laplacian operator, and t > 0 an integer. For any two vertices m and n, if then .

Note that is the shortest-path distance, i.e., the minimum number of edges for any paths connecting m and n. Lemma c.2 shows the localization result for integer powers of the fractional Laplacian . Lemma c.2 is a generalization of Lemma 5.2 in [44].

Lemma C.3. Let and be the fractional wavelets at scale s generated by the fractional kernels and . If for all , then for each vertex m. Additionally, .

Lemma c.3 shows that if two fractional kernels and are close to each other in some sense, then the resulting graph fractional wavelets should be close to each other. Lemma 3.3 is a generalization of Lemma c.3 in [44].

Lemma C.4. Let be K + 1 times continuously differentiable, satisfying for all i < K, and . Assume that there is some such that for all . Then, for , we have for all .

Lemma c.4 shows that if a fractional kernel has a zero of integer multiplicity at the origin, then can be approximated by a single monomial for small scales. Lemma c.4 is a generalization of Lemma 5.4 in [44].

Theorem C.1. Let be a weighted graph with fractional Laplacian . Let gθ be a kernel satisfying the hypothesis of Lemma 3.4, with constants and B. Let m and n be vertices of such that . Then, there exist constants D and , such that for all .

Theorem c.1 shows that the localization of the fractional wavelet must include a renormalization factor in the limit of small scales. In general, as for all m and n, due to the normalization chosen for the graph fractional wavelets. Theorem c.1 is a generalization of Theorem 5.5 in [44].

Theorem C.2. Given a set of scales , the set forms a frame with bounds A, B given by and , where .

Note that the basic definition of a frame is as follows: Given a Hilbert space , a set of vectors form a frame with frame bounds A and B if the inequality holds for all . Theorem C.6 shows that the numerical stability of recovering the vector f from the inner product depends on the frame bounds A and B. Theorem c.2 is a generalization of Theorem 5.6 in [44].

Data Availability

The data used in Section 5.2.1 are synthetic graph data. The data used in Section 5.2.2 are MNIST dataset, which are downloaded from the homepage of Yann LeCun (http://yann.lecun.com/exdb/mnist/). The data used in Section 5.2.3 are supported in [82, 83].

Conflicts of Interest

The authors declare that they have no conflicts of interest regarding the publication of this paper.

Acknowledgments

This work was supported in part by the National Natural Science Foundation of China under Grants 61876037, 31800825, 61871117, 61871124, and 61773117, in part by the National Science and Technology Major Project of the Ministry of Science and Technology of China under Grant 2018ZX10201002-003, in part by the Short-Term Recruitment Program of Foreign Experts under Grant WQ20163200398, and in part by INSERM under the Grant IAL.