Abstract

Estimating the cosmological microwave background is of utmost importance for cosmology. However, its estimation from full-sky surveys such as WMAP or more recently Planck is challenging: CMB maps are generally estimated via the application of some source separation techniques which never prevent the final map from being contaminated with noise and foreground residuals. These spurious contaminations whether noise or foreground residuals are well known to be a plague for most cosmologically relevant tests or evaluations; this includes CMB lensing reconstruction or non-Gaussian signatures search. Noise reduction is generally performed by applying a simple Wiener filter in spherical harmonics; however, this does not account for the non-stationarity of the noise. Foreground contamination is usually tackled by masking the most intense residuals detected in the map, which makes CMB evaluation harder to perform. In this paper, we introduce a novel noise reduction framework coined LIW-Filtering for Linear Iterative Wavelet Filtering which is able to account for the noise spatial variability thanks to a wavelet-based modeling while keeping the highly desired linearity of the Wiener filter. We further show that the same filtering technique can effectively perform foreground contamination reduction thus providing a globally cleaner CMB map. Numerical results on simulated Planck data are provided.

1. Introduction

In mid-2009, European Spatial Agency (ESA) put in orbit the latest space observatory Planck to investigate the Cosmological Microwave Background (CMB). These data are of particular scientific importance as it will provide more insight into the understanding of the birth of our universe. Most cosmological parameters can be derived from the study of these CMB data. After a series of successful CMB experiments (Archeops, Boomerang, Maxima, COBE, and WMAP [1]), the Planck ESA mission is providing more accurate data which will be the reference full-sky high-resolution CMB data for the next decades. More precisely, recovering useful scientific information requires disentangling the CMB from the contribution of several astrophysical components, namely, Galactic emissions from dust and synchrotron, Sunyaev-Zel’dovich (SZ) clusters, Free-Free emissions, CO emission to only name a few, see [2]. Classically, the CMB map is estimated via the application of some source separation methods. However, the application of very sophisticated source separation methods (see [36]) does not prevent the final estimated map from being contaminated with various spurious components: (i) instrumental noise and (ii) foreground residuals. Dealing with these various sources of contaminations is of paramount importance for most cosmologically relevant tests or evaluations: this includes CMB lensing reconstruction [7] or non-Gaussian signatures search. Tackling the problem of noise and foreground residual reduction is therefore important and is at the heart of this paper.

(a) Instrumental Noise
The very specific scanning pattern in full-sky CMB experiments leads to an instrumental noise closely Gaussian that exhibits significant spatial variation: Planck noise is far from being homogeneous. More precisely, the statistics of the noise vary spatially; in the field of statistics, it is said that the noise is nonstationary. As an illustration, Figure 1 displays a Planck simulated root mean square (RMS) map of instrumental noise at GHz. These data originate from the publicly available simulations described in [3]. Furthermore, while noise is insignificant at large scale, it is largely predominant at small scale (e.g., typically for for Planck). Most source separation methods applied so far to CMB data are linear methods: the estimated CMB map can be expressed as a linear combination of all the pixels of the input observations. As such, the propagated instrumental noise in the CMB maps is also typically assumed nonstationary Gaussian.
Noise can be a significant limitation for most cosmological tests or reconstructions. High noise level can hamper non-Gaussianity tests as the noisy map is likely to be “more Gaussian” than the noise-free map. The classical approach in the field of cosmology generally consists in reducing the noise of the CMB via a Wiener filter in spherical harmonics. If stands for the theoretical power spectrum of the CMB and the power spectrum of the noise, the Wiener filter is defined as follows: where (resp., ) stands for the spherical harmonics coefficients of the filtered map (resp, the input map ). While this approach makes profit of some knowledge of the energy distribution of the noise in frequency, it does not account for its spatial variations or nonstationarity. It is very likely that such a nonstationarity may create global non-Gaussian signatures at high frequency when such variations are sharp. Strong nonstationarities are known to have a strong impact on CMB lensing reconstruction (see [8]), creating an undesired reconstruction bias.
Another less traditional but commonly encountered noise reducing technique is the local Wiener filtering. It boils down to apply the Wiener filter in the Fourier space on patches in the pixel domain. The main drawback of this method is that while large patches should be favored to capture the nonstationarity of noise, they are enable to capture correlations at scales larger than the patch size.

(b) Foreground Residual
The first evaluation of already sophisticated CMB-dedicated source separation methods has been performed in [3]. Since, novel very effective methods have been proposed Needlet ILC [4], Generalized ILC [9], SMICA [10] or GMCA [6]. However, due to the very high complexity of the data, none of these very effective methods is able to provide a CMB map that is guaranteed to be free of foreground contaminations. If the residual contamination is generally low at large scales (e.g., for for Planck), it is generally significant at higher frequencies. It is likely that the remaining contamination mainly originates from point sources or strong features in the galactic plane. Obviously the presence of these residuals highly biases any cosmological tests to be performed on contaminated CMB map. The usual approach consists in masking the known spurious pixels of the CMB prior to any post-processing (e.g., 20 to of the sky are usually masked). However masking raises several issues: (i) it limits the number of samples to which any test can be applied, limiting its statistical relevance and (ii) masking generally makes any post-processing much more complicated to perform properly (e.g., inpainting of the mask is generally needed prior to any CMB lensing reconstruction [7]). Avoiding or at least limiting the extent of the mask to be applied should greatly help improving any cosmological test to be made on the estimated CMB map. Furthermore, masking does not prevent from the presence of residuals of undetected point sources which are likely be be numerous all over the sky. Further reducing the amount of foreground residuals should clearly be helpful.

(c) Contribution of This Paper
The contribution of this paper is twofold:(i)It introduces a novel noise reduction framework that, in opposition to the classical spherical-harmonics based Wiener filter, accounts for the nonstationarity of the noise. This new method preserves the linearity of the filtering. In fact, linearity is crucial in this field as it makes the study of error propagation via Monte-Carlo simulations much more convenient to carry out. In this framework, we make use of a simple, but efficient, wavelet-based modeling of the noise that allows for the modeling of nonstationarities at different scales. This will be discussed in Section 3.(ii)We discuss an extension of this method to further estimate an approximate contribution of the foreground residuals per wavelet scales by means of RMS maps. Thereby contamination reduction could be tackled within the same filtering framework. The noise/contamination reduction problem, recast as a quadratically regularized least square problem, is then effectively solved using recently introduced algorithms based on proximal calculus. Extensive numerical experiments are provided that show the effectiveness of the proposed method in reducing noise in Section 2.3 and foreground residuals in Section 3.3.

2. CMB Map Filtering with Nonstationary Noise

2.1. CMB Map Filtering

In a very general context and more precisely in the context of Planck, it is customary to assume that the input noise data, denoted by in the following, is modeled as the linear combination of the sought after noise-free signal and the noise term :Decorrelation between and is also classically assumed. In what follows, we will assume that each of these signals is sampled on the sphere according to the Healpix [11] sampling grid. The sampled CMB signal is assumed to be composed of samples . Recovering an estimate of can then be formulated as an inverse problem. According to the Bayesian way of solving inverse problems, some a priori knowledge about the signal of interest has to be expressed. This inference framework is particularly well suited to the CMB denoising issue as strong prior assumptions can be formulated. Indeed, since [12], it is well established that the cosmological microwave background can be approximated as an isotropic Gaussian random field with a high accuracy. This particularly means that it is fully characterized by its power spectrum .

More precisely, the CMB can be defined by its expansion in spherical harmonics: Assuming that the CMB is fully Gaussian with power spectrum , its spherical harmonics are independently distributed following a Gaussian distribution with mean 0 and variance : The power spectrum of the CMB is generally estimated with the pseudo-power spectrum defined as follows: where stands for the conjugate of . The a priori probability distribution of the CMB map is a normal distribution with covariance matrix where stands for the spherical harmonics basis and its adjoint. The matrix is diagonal with . This yields the following CMB prior probability distribution: In this section, we assume that the instrumental noise can be modeled as a nonstationary but uncorrelated Gaussian field in the pixel domain such that

Written differently, is a realization of a multivalued Gaussian variable with mean 0 and covariance matrix . The case of correlated noise will be discussed later on in this section. From the Gaussian modeling of , the likelihood function is trivially obtained as follows: Following Bayes rule, the posterior distribution for is the following: The CMB map can be estimated using the maximum a posteriori (MAP) estimator with the following analytical solution: The reader would here recognize a classical Wiener filter. Contrary to the classically used Wiener filter in spherical harmonics, the solution of (11) explicitly account for the nonstationarity of the noise. However, the computational evaluation of this Wiener-like solution raises some numerical issues: the noise covariance matrix is diagonal in the pixel domain while the CMB covariance matrix is diagonal in the spherical harmonics domain. The solution filter that appears in (11) is formulated in the pixel domain:In this expression, the matrix is the inverse covariance matrix of the CMB in the pixel domain. This matrix is clearly not diagonal. Recalling that is of the order of for WMAP and for Planck, storing and handling this matrix is way too unrealistic. As an alternative, the most effective approach in this situation would amount to solve the linear system of equations using a conjugate gradient (CG, [13]) for instance. However, in the next section, we will rather make use of recently introduced proximal algorithms. This optimization framework leads to algorithms which exhibits a lower speed of convergence than CG but with the gain of more flexibility to handle different filtering techniques or better constrain the solution.

2.1.1. Iterative Filtering

As said clearly in the previous paragraph, computing the MAP solution requires inverting a system of equations which turns to be intractable due to the large scale of Planck data. The most straightforward numerical solver is the well-known conjugate gradient. However, while this solver is known to be a fast and accurate numerical method for solving linear systems of equations, it lacks the flexibility of more modern optimization techniques. For this reason we rather opt for the more flexible optimization framework of proximal calculus (see [14] and references therein).

(a) A Flexible Optimization Framework: Forward-Backward Splitting
The problem in (12) can be written as minimization of the sum of two operators and : The function is convex and lower semicontinuous and differentiable; the function is strictly convex, continuous and differentiable. Such problem as therefore a unique solution (see [14]). If denotes the gradient of , we will also assumed that this gradient is -Lipschitz. In the specific setting of interest in this article, and . In this context, an effective algorithmic framework is the forward-backward splitting (FPS) technique defined in [14]. In the paradigm of proximal calculus, any convex lower semicontinuous function (see [14]) admits a well-defined proximal operator: In the FBS framework, the solution to the problem in (14) is computed via an iterative fixed point algorithm such that at iteration As proved in [14], the convergence of the above fixed point algorithm is guaranteed under the condition ; note that in the setting of CMB map recovery with uncorrelated nonstationary noise, .

(b) The Proximal Operator of
In the problem of interest in this paper, . As this function is continuous and differentiable, it is a fortiori lower semicontinuous. Following the definition in (16), the proximal operator of is defined as follows: The solution to this problem has a simple closed-form expression: The proximal operator of is a mere Wiener filter in the spherical harmonics space.To sum everything up, the resulting iterative Wiener-like filtering is as we can see in Algorithm 1.

(1) Initialization: set the number of iterations , the step size , and the starting point .
(2) Main loop: apply iteratively the following steps at each iteration :
 (a) Forward step of the FPS:
 (b) Backward step of the FPS:
(3) Stop when .

(c) Computational Cost
The computational cost of this iterative Wiener algorithm is particularly low. The first step (2-a) only requires entrywise multiplications and additions of vectors. It is therefore of the order of . The second step (2-b) is by far the most computationally demanding; it requires entrywise multiplications and additions of vectors of the order of (where generally ) and single forward/backward spherical harmonics transforms. In the numerical experiments of Sections 2.3 and 3.3, the starting point will be chosen as having entries equal to zero. The algorithm stops when the relative variance—in euclidian norm—between consecutive estimates of the CMB map is lower than a certain threshold: . The total number of iterations is generally no greater than .

(d) Linearity and Bias of the Iterative Wiener Filter
The proposed iterative filtering technique preserves the linearity of the overall CMB map processing. In fact, in every step of the proposed algorithm, each output depends linearly on their inputs. This property is essential as it allows for a simple way to study the propagation of errors via Monte-Carlo simulations; each simulation then has to undergo exactly the same processing step described previously. It is worthwhile to outline that the solution to the proposed iterative Wiener filter is exactly the solution of the Wiener filter described in (11). Therefore, the proposed method exhibits exactly the same bias and variance as the Wiener estimator. Similarly to the global Wiener filter, the bias of this estimator does not vanish and can be shown to depend on the noise statistics and the signal itself. The same holds for the variance as well.

2.2. The Case of Correlated Noise

Assuming a perfect calibration of the data, the nonstationarities of instrumental noise mainly originate from the nonuniform scanning of the sky. The lower the number of scanning time in one area is, the higher the noise level is and vice versa. Figure 1 shows the RMS map of noise at channel GHz; mathematically this is no more than the square root of the diagonal of . However this is also well known that, contrary to what has been assumed in the previous section, the noise is far from being uncorrelated. There are mainly two origins for the correlation of noise:(i)The way the sky is scanned by Planck makes the noise correlated along the scanning direction. This means that the noise is correlated along elongated patterns in the pixel domain; for numerical reasons, this can hardly be accounted for in the pixelwise covariance matrix .(ii)Most modern CMB-dedicated source separation methods, Needlet ILC [4], Generalized ILC [9], SMICA [10] or GMCA [6] rely on a local analysis of the data in the wavelet or needlet domain. Furthermore, the final CMB map is generally obtained as the mixture of observations that do not share the same resolution. This means that the noise that finally contaminates the CMB is, in the pixel domain, correlated.

It then seems natural to model non-stationary-correlated noise: (i) in the wavelet domain to capture its correlation and (ii) locally to capture its nonstationarity.

(1) Wavelets: the Basics
Wavelets are well known to be the tool of choice to analyze nonstationary signals [15]. When considering spherical data, there exist several implementations of wavelets on the sphere. In this paper, we will make use of the isotropic undecimated wavelet transform (IUWT) introduced in [16]. The IUWT can be seen as an extension of the celebrated à trous algorithm to spherical data (see the seminal book [15] and references therein).As any wavelet transform, the IUWT is first defined by a scaling function defined by a fixed frequency cut-off . As emphasized in [16], this scaling function is built so that it is azimuth-invariant on the sphere (i.e., it does not depend on ). Classically, each rescaled version of the scaling function is associated to a low-pass filter or equivalently in spherical harmonics. Traditionally, the wavelet-based multiscale analysis is performed by computing successive smooth approximations of . These smooth approximations are obtained by convolving with dyadically rescaled versions of the scaling function: where is the total number of scales. The so-called wavelet function at scale , , is defined as the difference of two consecutive rescaled scaling functions: The wavelet function at scale can be equivalently associated to a high-pass filter or equivalently in spherical harmonics. The wavelet coefficients at scale are then computed by convolving the smooth approximation of at scale with the th wavelet function . In spherical harmonics, this amounts to a simple multiplication of the signal’s spherical harmonic coefficients with the wavelet filter . The decomposition being redundant, several strategies can be designed to reconstruct from the wavelet coefficients and the coarse resolution . The simplest is the one advocated by the à trous algorithm: In the following, noise modeling will be performed in each wavelet scale; that is, on each set of coefficients .

(2) Wavelet-Based Statistical Modeling of Correlated Noise
As emphasized previously, one elegant way to model the instrumental noise is to consider the distribution of its expansion coefficients in the spherical wavelet domain. The basic idea consists in assuming that the wavelet coefficients of noise are approximately decorrelated. This idea takes its roots in the field of multiscale statistical modeling: it has been shown that wavelets exhibit (almost)-decorrelating properties for some classes of nonstationary stochastic processes. To give only one example, this is the case for fractional Brownian motion [17] which has been extensively used to model power-law following stochastic processes. In the following, will denote the wavelet coefficient of the noise term in scale at pixel with the convention: corresponds to the finest scale and to the coarse resolution. From this definition, we will assume that these wavelet coefficients verify:where stands for the local variance of the noise at pixel and for the Kronecker delta function. This entails that the covariance matrix of the noise in wavelet scale is diagonal and equal to: The local standard deviation of the noise, , has to be estimated from the data beforehand. In the numerical experiments in Section 2.3, these parameters are estimated from a single noise realization. It is commonly assumed that one has access to noise realizations which generally originate from simulations or from the data themselves. Assuming that the local variances of the noise vary slowly from one pixel to another, it can be approximated at pixel by the empirical estimate of the variance in a fixed surrounding neighborhood; in the following this neighborhood will be a patch of size centered about the pixel : where stands for the indices of the pixels that compose the patch that surrounds the pixel . Note that the patch size will highly depend on how fast the noise variance varies spatially. This particularly means that should be smaller in the finest scale and larger when increases. Following the natural dyadic scaling of the wavelet transform, the patch size in scale will scale as follows: where stands for the patch size at the finest scale.

2.3. Numerical Experiments
2.3.1. Experimental Dataset

In the following, numerical experiments will be performed on simulated Planck data described in [3]; to our knowledge, these are the only full-sky simulated data that are publicly available. The input CMB map has been computed by applying a recently introduced source separation technique coined Local-GMCA (L-GMCA, see [6, 18]) to Planck simulated data. These data were already deployed in [3] to evaluate the performances of source separation methods. For more technical details on these data, we refer the interested reader to [3]. To our knowledge this dataset is the only full-sky realistic Planck simulated dataset that is publicly available. It contains contributions as various as synchrotron and free-free emission, anomalous dust, dust, SZ effect and CMB. The instrumental noise on each of these Planck simulated observations was assumed to be uncorrelated but nonstationarity.

The noise that contaminates the raw CMB map at the output of most component separation techniques is nonstationary and correlated. The CMB map estimates being estimated from several observations at different resolutions, the output noise is obviously correlated. Source separation techniques like L-GMCA [6, 18] or Needlet-ILC [4] also analyze the data on small patches in different wavelet or needlet scales. Such local/multiscale process is also at the origin of the correlation of the noise that contaminates the CMB map.

2.4. Experimental Results

In this section, we particularly focus on the noise reduction aspect of the proposed method. The reference denoising method in the field of astrophysics and more specifically CMB data analysis is the global Wiener filter applied in spherical harmonics. In opposition to this classical filtering technique, the proposed iterative filtering approach accounts for the nonstationarity of the noise; thereby, it should particularly provide better solution than global Wiener.

As detailed in the previous section, the modeling of the noise contribution is performed in the wavelet domain: the noise is assumed to be nonstationary but decorrelated at each wavelet scale . In each wavelet scale, the noise covariance matrix is estimated locally from a single noise realization. The noise covariance per wavelet scale is evaluated locally on patches of size where, by convention, is defined as the finest wavelet scale. As an illustration, Figure 8 displays the estimated noise RMS map at scale .

The global Wiener solution is computed by filtering the original CMB map with the linear filter defined in spherical harmonics by (11). The CMB power spectrum is a WMAP7 best-fit estimate; the noise power spectrum is estimated via the pseudo-power spectrum of the noise realization.

Iterative Wiener has been applied to the original noisy CMB map . The single parameter to be tuned is the number of iterations ; in the following experiments, it has been fixed to 250. Figure 2 shows the noisy map, the LIW-Filtering solution and the noise realization used to estimate the noise local variance in the finest wavelet scale. Figure 3 displays the residual maps of the global Wiener and LIW-Filtering solution. As expected the nonstationarity footprint of the noise appears clearly on the residual LIW-Filtering map where the bias of the estimated map is lower.

Figure 4 features the power spectra of the original and filtered maps. We would like to point out that these power spectra have been computed from the raw maps; no masking has been applied. This particularly means that these estimates are likely to be biased by strong foreground from the galactic plane. The first observation is that the global Wiener filter does not fully remove the noise at high contrary to the iterative filtering technique. This can be explained by the presence of significant nonstationarities at high which can produce a substantial bias at high frequency. Interestingly, the iterative technique succeed in largely reducing noise at high thus supporting the significant contribution of the noise nonstationarities in producing a bias on the power spectrum. More technically, it is worth noting that the power spectrum of a stochastic process is properly defined as long as this process is wide sense stationary [19]. Obviously, the nonstationarities of the noise make it depart from this assumption; thereby the power spectrum has no rigorous statistical meaning in this context. Said differently, the power spectrum of the noise does not contain all the information that a nonstationary noise does in the pixel domain.

It is well known that the Wiener filter provides a CMB map that has a biased power spectrum. Thereby the Wiener filter—in red in Figure 4—yield a power spectrum that is negatively biased at high when the noise highly predominates upon the CMB—typically for .

Figure 5 shows the power spectrum of the residual: . This quantity is particularly well suited to visualize the power discrepancy in frequency between the original CMB map and its estimate . As expected, the global Wiener filtering reduces the contribution of noise at high (see the blue line in Figure 5). Still, the filtered map largely departs from the original map. Interestingly, the proposed iterative filtering technique (in red in Figure 5) largely outperforms the classical Wiener filter from to . More precisely, the residual power is reduced by up to an order of magnitude for .

A more complete measure of discrepancy between the original map and one of its estimate consists in evaluating the mean squared error (MSE) in the wavelet domain. To that end, Figure 6 displays the MSE of different estimators of the CMB map in 5 wavelet scales in logarithmic scale. The normalized MSE between some CMB map estimate and the “true” CMB map is computed as follows: . Again, this figure shows that the iterative technique which directly account for the nonstationarity of the noise, performs better than the global Wiener filter. The impact of the noise tends to vanish for larger scales (typically for ).

To better illustrate the differences between the global Wiener filter and its iterative counterparts, Figure 7 displays the MSE of the different estimators of the CMB map which has been computed in the wavelet domain per bands of latitude of width 10°. By convention, the galactic plane is centered around the latitude 0°. Figure 7(a) displays the MSE which has been evaluated from the finest wavelet scale ; this corresponds to an analysis window that ranges from to . Again, taken globally, one can point out the clear improvement between the global Wiener filter and its iterative version. As expected the Wiener filter is optimal in the sense of the MSE. More interestingly, the MSE of the two filtering techniques seems to be rather constant for high latitudes and sharply increases in the range . As already emphasized, the impact of any of these linear filtering techniques is the more significant when the signal to noise ratio (SNR) decrease. This is obviously the case at smaller scales. At larger scales, the SNR increases and the impact of the filtering is much less significant. This is exactly what can be observed on the plot on the right of Figure 7(b): the MSE of any of the filtered CMB maps at scale , which grossly correspond to an analysis window ranging from to , are approximately the same.

As a brief conclusion for these experimental results, it appears clearly that the nonstationarity of the noise has an impact on the CMB estimate. The most classical noise reducing technique in the field, that is, the global Wiener filter, only relies on the power spectra of the CMB and the noise . Obviously, the behavior of the noise power spectrum contains almost no information about its spatial behavior. These experiments have shown that the noise nonstationary has a clear impact on the CMB. Further accounting for the spatial variations of the noise variance helps improving the noise reduction process by a dramatic amount.

3. From Noise Reduction to Contamination Reduction

As stated in the introduction in Section 1, one of the purposes of this paper is to tackle the problem of post-separation contamination reduction. In this setting, contamination generally means instrumental noise and residuals of galactic foregrounds. In the case of Planck, CMB maps are generally obtained by applying sources separation methods. Most of these methods provide a solution that can be formulated as a linear combination of the original Planck multichannel observations. Therefore, it is relevant to assume that these contaminants essentially contribute additively to the CMB map: where stands for the foreground contamination and for the instrumental noise. Furthermore, it is also natural to assume that these three components , and are mutually uncorrelated. The filtering technique introduced in the previous sections mainly relies on the fact that the contaminants are characterized by their covariance matrix . The questions that we aim at tackling in this paragraph is: How far do we know and how can it be estimated from the data ?

3.1. Foreground Residuals Modeling

Unlike noise, the foreground residual are generally non-Gaussian, with the exception of the CIB (see [20]). We further restrict the modeling of residuals to their second order statistics thus opting for a Gaussian approximation of their contribution. The contribution of foreground residuals are obviously not known in advance; this means that their covariance matrix has to be estimated from the data directly. From the uncorrelation of the different components, the additive contamination model in (29) then reads at the level of second statistics:

In this equation, the correlation of foreground pixels implies that is non-diagonal. Similarly to the noise, second order dependencies of foreground pixels are way too complex to be modeled in the pixel domain.

So as to capture the correlation of foreground pixels, a natural and simple strategy boils down to adopting the wavelet-based statistical modeling used to model correlated noise in the previous Section 2.2.(2). To this end, will denote the wavelet coefficient of in scale at pixel with the convention: corresponds to the finest scale and to the coarse resolution. From this definition, we will assume that these wavelet coefficients verify: where stands for the local variance of the foreground at pixel . The covariance matrix of the foreground residual in wavelet scale is diagonal and equal to: In practical situations, has to be estimated from the data themselves. Similarly to the modeling of noise, the local noise variance of the foreground will be approximated by the empirical estimate of the variance in a surrounding neighborhood of size centered about the pixel :

3.2. Contamination Filtering

In the rest of this paper, we will generally use the term contamination for the contribution of both the foreground residuals and the instrumental noise. In the following, as suggested previously, we also opt for exactly the same model to capture the pixel dependencies of the instrumental noise. Therefore, we will now define locally at pixel and scale the variance of contamination (noise and foreground) as . The joint contribution of noise and foreground residuals will be evaluated jointly from the data.

From the estimation of the local variance ; one has to recall that the contaminants’ variance is not directly accessible but is equal to:where stands for the variance of the CMB and . Let use note that, from the stationarity of the CMB map, does not explicitly depend on ; this holds as long as the patch size at scale is large enough compared to the CMB fluctuations (i.e., correlation length) in the th wavelet scale. The value of the CMB variance at scale th is directly related to the CMB power within this scale; its value can be derived from the CMB power spectrum and the wavelet analysis filters. To that end, let us denote by the spherical harmonics filter from which the wavelet scale is defined. As emphasized in Section 1 the spherical harmonics coefficients of the CMB map should be mutually uncorrelated. This particularly entails that the power of the CMB in the th wavelet scale is exactly: It follows that the variance of the contamination at pixel is approximately equal to: Thereby, from the a priori knowledge of the CMB power spectrum , an approximate contribution of the residual can be evaluated at each pixel and each wavelet scale. The reader has to keep in mind that the measurement of this contribution is only approximate and has to be considered under the light of the following assumptions: (1) the contribution of the residual is modeled as a nonstationary Gaussian field where the covariance matrix is diagonal in each wavelet scales and (2) its variance being evaluated on small patches of size , it is assumed to be approximately stationary in a given analysis patches. However, even with these approximations, our model offers a much more realistic modeling of the complexity of the data than the traditional Wiener method. Improvements of this modeling will be discussed in Section 4.1.

Let . Then, in the algorithm introduced in Section 2, the noise matrix is substituted with the following covariance matrix:

where stands for the isotropic wavelet transform and its adjoint. Along with the iterative technique introduced in Section 2, the resulting wavelet-based filtering will be coined LIW-Filtering for Linear Iterative Wiener Filtering in the following.

3.3. Contamination Reduction on Planck Simulated Data

The dataset used in this section is exactly the same that we used to study noise reduction. It is described in Section 2.3.

As emphasized in Section 3, the wavelet-based noise/contamination variance modeling used so far makes it possible to account for the contribution of the foreground contamination in the filtering process. Estimating the contribution of the foreground contamination, as detailed in Section 3, makes the reduction of contamination possible. In the following experiments, the contamination, which includes the contribution of the noise and an approximate contribution of the foreground residuals, has been modeled in the wavelet domain [16] using 6 scales. Within each wavelet scale, with the exception of the coarsest scale to which no filtering is applied, the contribution of the contamination is assumed to be Gaussian and locally stationary on patches of size . Figure 9 features the estimated variance map in logarithmic scale. As a remark, if the morphology of this contaminant variance map resembles at first glance the noise-only variance map, the dynamic range of the former is clearly larger by 3 orders of magnitudes. These differences are mostly significant on the galactic plane and on local features that are likely point sources or residuals of SZ clusters.

The proposed Wiener-based iterative method has been applied on the aforementioned Planck simulated data. Figure 10 shows the difference map between the noise-only LIW-Filtering solution and the proposed contamination-aware LIW-Filtering solution in the finest wavelet scale. It appears clearly that the difference between the map are mainly concentrated on the galactic plane and on likely point sources at high latitude.

Figure 11 displays the power spectra of the original and filtered CMB maps along with the theoretical power spectrum of the simulated map in black dash-dotted line. At first glance, accounting for the contribution of the foreground residuals, even approximately, yields improvements from . Perhaps more interesting, Figure 12 shows the power spectrum of the residual maps : the contaminant-aware filtering technique clearly outperforms the previous methods from to with a dramatic improvement in the range .

Figure 13 displays the wavelet-based MSE we described in the previous paragraph. The results of the contaminant-aware filtering technique have been further added to the plot in Figure 6. Let us recall that the noise reducing techniques described in the previous section mainly help reducing the noise in the noise-dominant regime. This particularly explained why the MSE remains almost unaltered in wavelet scales . Interestingly, accounting for an approximate contribution of the foregrounds helps reducing their impact in larger scales, typically for .

Figure 14 presents the MSE computed at different latitudes in the finest wavelet scale — plot on Figure 14(a)—and —plot on Figure 14(b). Accounting for the contaminant contribution improves the MSE of the filtered map at all latitudes. Interestingly, the MSE is decreased by a large amount—a factor of approximately 5—on the galactic (latitude 0°). The same holds in a slighter amount in wavelet scale .

Assuming that the Planck instrumental noise is Gaussian is widely considered as a reasonable assumption. This is obviously not the case for foreground residuals the presence of which is likely to largely distort the search for non-Gaussian features in the CMB. Thereby, reducing the amount of contaminant should help preventing non-Gaussianity test from the non-Gaussian impact of foreground residuals. In the field of CMB non-Gaussianity evaluation, a classical technique boils down to measuring higher-order statistics in the wavelet domain [21]. One of the statistics of choice is the statistics of order 4, denoted , a.k.a. the kurtosis. The kurtosis of the wavelet coefficients of the estimated CMB map has been evaluated in 5 scales; non-Gaussianity test results are shown in Figure 15. The first observation is that the proposed method helps reducing non-Gaussian contamination even when only the noise is modeled as a spurious component. This suggests that the nonstationary behavior of the noise can generate undesired non-Gaussian signatures. Thereby, accounting for the nonstationary behavior of the noise in the noise reduction process decreases the kurtosis of the filtered map by an order of magnitude in the finest wavelet scale.

The red dashed line in Figure 15 shows a clear reduction of the kurtosis of the filtered map when the contribution of the foreground residuals is modeled. More precisely, the value of is grossly reduced by 4 orders of magnitude for , 2 orders of magnitude for and 1 order of magnitude for . While noise-only filtering yields map that departs from the Gaussian assumption for , the contamination-aware filtered map is now compatible with the Gaussian assumption in all wavelet scales with .

Figure 16 presents more detailed non-Gaussianity tests; the three plots that this figure displays show the evolution of , in the three finest wavelet scales , which has been evaluated in bands of latitude of width . The galactic plane is centered about the point . The non-Gaussian contamination mainly originates from the presence of galactic foreground residuals which are mainly concentrated on the galactic plane. This explains the very large value of in the range of latitude . The second well-known source of non-Gaussian signatures at spatial high frequency are the radio and infrared point sources. These foregrounds are roughly uniformly scattered all over the sky. Point sources are likely to be the predominant source of non-Gaussian signatures at high latitude thus explaining the relatively large value of in this range of latitudes. As shown in Figure 16(a), further filtering the contamination (see the red dashed line) yields a dramatic reduction of up to 5 orders of magnitude on the galactic plane. The filtered CMB map is likely to be compatible with the Gaussian assumption at every latitude, even on the galactic plane. In the second wavelet scale , the same reduction of non-Gaussian signatures is also visible by a lesser extent, mainly on the galactic plane in the range . In the third wavelet scale, the proposed filtering is likely to reduce the non-Gaussian signatures in the galactic plane; in this range of scales (i.e., the third wavelet scale roughly corresponds to an analysis window that spans the range ), the CMB is largely predominant which entails that: (i) the input map is already quite compatible with the Gaussian assumption and (ii) the impact of the filtering may be less visible when non-Gaussianity is measured.

4. Discussion, Prospects, and Conclusion

4.1. Discussion and Prospects
4.1.1. Filtering the Maps, for What Use?

Important cosmological tests and evaluations are performed on estimated CMB maps; this is particularly the case for CMB lensing reconstruction [7] and non-Gaussian signature detection to only name two. These tests are particularly sensitive to spurious components that contaminate the estimated CMB map, whether it is noise or foreground residuals. This is therefore crucial to estimate a CMB map that is the less contaminated possible. The second point we would like to emphasize is that the aforementioned cosmological tests or evaluations are generally performed on the full-sky data; prior to any post-processing the estimated map is masked to get rid of the impact of galactic foregrounds and point sources. This has two major consequences: (i) it limits the size of the samples to which these tests/evaluations are performed thus increasing the statistical variance of these tests and (ii) any of these tests has to deal with this mask (e.g., via an inpainting procedure in [7]) thus making the analysis of the CMB map generally more complex.

As shown in the previous experiments, the proposed filtering clearly limits the impact of noise by taking into account its nonstationary behavior. Furthermore, results are presented that clearly show that modeling the contribution of foreground residuals make contamination reduction effective even on the galactic plane. This has two major consequences: (i) it makes it possible the use of a much smaller mask prior to any analysis of the estimated CMB map and (ii) it helps reducing the non-Gaussian features that originate from the presence of foreground residuals.

4.1.2. Beyond the Proposed Methods

Whether noise or foreground residual is at stake, the central assumption that is at the heart of the modeling of contamination is the decorrelation hypothesis we made in Section 3. It is assumed that the contaminants have potentially nonstationary but decorrelated samples in each wavelet scales. The basic idea that supports this assumption is the well-known decorrelating property of wavelets bases. If this assumption almost holds for rather theoretical stochastic processes (e.g., fractional Brownian motion [17] to only name one), this is only roughly approximate for more general signals such as galactic foreground or complex correlated noise. The basic idea behind this assumption is that the multiscale analysis basis is able to capture the correlation morphology of the component to be modeled. In surveys involving raster scanning such as WMAP or Planck, it is very likely that instrumental noise will be highly correlated along the scanning direction. This means that the correlation of the Planck instrumental noise is better modeled as a decorrelated stochastic process in a multiscale signal representation that best represent elongated structures such as the Curvelet tight frame [16]. We can extrapolate the same argument to the modeling of foreground residuals.

The contamination modeling used so far in this paper makes profit of the decorrelation assumption; it particularly helps simplifying the filtering process by only requiring the handling of diagonal matrices (i.e., root mean squared maps). Departing from the decorrelation assumption will largely increase the complexity of the proposed filtering techniques. However, a straightforward way of extending the proposed methods is to choose multiscale signal representations which are better adapted to the morphology or structure of the signal to be modeled.

4.1.3. Potential Impact of Contamination Reduction on CMB Non-Gaussianities

It is important to wonder whether the contamination reduction filtering introduced in Section 3 might affect or reduce potential CMB-related non-Gaussianities. One important point is that the way the local variance of the contamination is estimated in Section 3 leads, in practice, to an estimation of foreground contaminations that largely exceeds the average CMB power in each wavelet scale. Therefore, the filtering should have a much lesser impact on non-Gaussianities whose magnitude is of the order or lower than the magnitude of the CMB. It is worth pointing out that the rule in (36) to estimate the local contamination variance can also be chosen differently to only account for non-Gaussianities that are guaranteed to largely exceed the magnitude of the CMB thus avoiding for sure any effect on potential CMB-related non-Gaussianities. Future work will focus on evaluating the potential impact of the proposed filtering technique on CMB lensing reconstruction and non-Gaussianity detection.

4.2. Conclusion

Cosmological microwave background maps estimated from full-sky surveys such as WMAP or more recently Planck generally suffers from various sources of contaminations: (i) instrumental noise is generally nonstationary which may generate non-Gaussian signatures and (ii) foreground residuals generally remain even after the application of state-of-the-art source separation methods. In this context, the most classical denoising technique, aka. global Wiener filter; despite its simplicity, it is not able to account for the nonstationarity of the instrumental noise. To that end, we introduce a novel noise reduction technique coined LIW-Filtering for Linear Iterative Wavelet Filtering which combines the linearity of the global Wiener filter while accounting for the potential nonstationarity of the noise. In this framework, the noise is modeled as a nonstationary but decorrelated process in the wavelet domain. The denoising problem then takes the very simple form of a quadratically regularized least square where the noise covariance matrix is diagonal in the wavelet domain and the signal covariance matrix is also diagonal in the spherical harmonic domain (i.e., CMB power spectrum). The solution is computed using recently introduced proximal algorithms. When noise reduction is at stake, we showed that the proposed iterative technique succeeds in reducing the mean squared error (MSE) of the filtered solution with respect to the global Wiener filter classically applied in the field. Moreover, the modeling/estimation framework introduced in this paper makes the reduction of foreground contamination possible. Similar to the nonstationary instrumental noise, foreground contamination is modeled as a nonstationary but decorrelated process in the wavelet domain. Numerical experiments show that: (i) the MSE of the filtered map is improved; specifically on the galactic plane, and (ii) very interestingly, non-Gaussian signatures are also dramatically reduced. This particularly provides arguments supporting the application of the proposed filtering technique as post-processing step to be applied to the CMB with the crucial aim of reducing noise and perhaps more importantly foreground contamination. It is also important to point out that, similarly to the global Wiener filter, LIW-Filtering is—at this acronym indicates—a linear filtering technique. This means that LIW-Filtering is also relevant when studying errors and their propagation on the estimated CMB map via Monte-Carlo simulations are unavoidable.

Future work will focus on refining the modeling of noise and foreground residuals. Without losing the simplicity of this approach, we will more specifically focus on studying more adapted signal representation to better model the noise/foreground contamination spatial behavior.

The developed IDL code will be released with the next version of ISAP (Interactive Sparse astronomical data Analysis Packages) via the following web site: http://jstarck.free.fr/isap.html.

Acknowledgments

This work was supported by the French National Agency for Research (ANR-08-EMER-009-01) and the European Research Council Grant SparseAstro (ERC-228261).