## Testing the Gaussianity and Statistical Isotropy of the Universe

View this Special IssueReview Article | Open Access

# Primordial Non-Gaussianity in the Large-Scale Structure of the Universe

**Academic Editor:**Dragan Huterer

#### Abstract

Primordial non-Gaussianity is a potentially powerful discriminant of the physical mechanisms that generated the cosmological fluctuations observed today. Any detection of significant non-Gaussianity would thus have profound implications for our understanding of cosmic structure formation. The large-scale mass distribution in the Universe is a sensitive probe of the nature of initial conditions. Recent theoretical progress together with rapid developments in observational techniques will enable us to critically confront predictions of inflationary scenarios and set constraints as competitive as those from the Cosmic Microwave Background. In this paper, we review past and current efforts in the search for primordial non-Gaussianity in the large-scale structure of the Universe.

#### 1. Introduction

In generic inflationary models based on the slow roll of a scalar field, primordial curvature perturbations are produced by the inflaton field as it slowly rolls down its potential [1–4]. Most of these scenarios predict a nearly scale-invariant spectrum of adiabatic curvature fluctuations, a relatively small amount of gravity waves, and tiny deviations from Gaussianity in the primeval distribution of curvature perturbations [5–7]. Although the latest measurements of the cosmic microwave background (CMB) anisotropies favor a slightly red power spectrum [8], no significant detection of a -mode or some level of primordial non-Gaussianity (NG) has thus far been reported from CMB observations.

While the presence of a -mode can only be tested with CMB measurements, primordial deviations from Gaussianity can leave a detectable signature in the distribution of CMB anisotropies *and* in the large-scale structure (LSS) of the Universe. Until recently, it was widely accepted that measurement of the CMB furnished the best probe of primordial non-Gaussianity [9]. However, these conclusions did not take into account the anomalous scale dependence of the galaxy power spectrum and bispectrum arising from primordial NG of the local type [10, 11]. These theoretical results, together with rapid developments in observational techniques, will provide large amount of LSS data to critically confront predictions of non-Gaussian models. In particular, galaxy clustering should provide independent constraints on the magnitude of primordial non-Gaussianity as competitive as those from the CMB and, in the long run, may even give the best constraints.

The purpose of this paper is to provide an overview of the search for a primordial non-Gaussian signal in the large-scale structure. We will begin by briefly summarizing how non-Gaussianity arises in inflationary models (Section 2). Next, we will discuss the impact of primordial non-Gaussianity on the mass distribution in the low-redshift Universe (Section 3). The main body of this paper is Section 4, where we describe in detail a number of methods exploiting the abundance and clustering properties of observed tracers of the LSS to constrain the amount of initial non-Gaussianity. We conclude with a discussion of present and forecasted constraints achievable with LSS surveys (Section 5).

#### 2. Models and Observables

Because they assume (i) a single dynamical field (the inflation), (ii) canonical kinetic energy terms (i.e., perturbations propagate at the speed of light), (iii) slow roll (i.e., the timescale over which the inflaton field changes are much larger than the Hubble rate), and (iv) an initial vacuum state, single-field slow-roll models lead to a small level of primordial non-Gaussianity [6, 7, 12]. The lowest-order statistics sensitive to non-Gaussian features in the initial distribution of scalar perturbations (we will adopt the standard CMB convention in which is Bardeen's curvature perturbation in the matter era) is the 3-point function or bispectrum , which is a function of any triangle (as follows from statistical homogeneity which we assume throughout this paper). It has been recently shown that, in the squeezed limit , the bispectrum of *any* single-field slow-roll inflationary model asymptotes to the local shape (defined in (3)) [13–15]. The corresponding nonlinear parameter predicted by these models is
where is the tilt or spectral index of the power spectrum , which is accurately measured to be [8]. Therefore, any robust measurement of well above this level would thus rule out single-field slow-roll inflation.

##### 2.1. The Shape of the Primordial Bispectrum

Large, potentially detectable amount of Gaussianity can be produced when at least one of the assumptions listed above is violated, that is, by multiple scalar fields [16, 17], nonlinearities in the relation between the primordial scalar perturbations and the inflaton field [7, 12], interactions of scalar fields [18], a modified dispersion relation, or a departure from the adiabatic Bunch-Davies ground state [19]. Generation of a large non-Gaussian signal is also expected at reheating [20] and in the ekpyrotic scenario [21]. Each of these physical mechanisms leaves a distinct signature in the primordial 3-point function , a measurement of which would thus provide a wealth of information about the physics of primordial fluctuations. Although the configuration shape of the primordial bispectrum can be extremely complex in some models, there are broadly three classes of shape characterizing the local, equilateral, and folded type of primordial non-Gaussianity [22, 23]. The magnitude of each template “X” is controlled by a dimensionless nonlinear parameter which we seek to constrain using CMB or LSS observations (instead of attempting a model-independent measurement of ).

Any non-Gaussianity generated outside the horizon induces a 3-point function that is peaked on squeezed or collapsed triangles for realistic values of the scalar spectral index. The resulting non-Gaussianity depends only on the local value of Bardeen's curvature potential and can thus be conveniently parameterized up to third order by [7, 9, 12, 24] where is an isotropic Gaussian random field and , are dimensionless, phenomenological parameters. Since curvature perturbations are of magnitude , the cubic-order correction should always be negligibly small compared to the quadratic one when . However, this condition is not satisfied by some multifield inflationary models such as the curvaton scenario, in which a large and a small can be simultaneously produced [17]. The quadratic term generates the 3-point function at leading order where (cyc.) denotes all cyclic permutations of the indices and is the power spectrum of the Gaussian part of the Bardeen potential. The cubic-order terms generate a trispectrum at leading order.

Equilateral type of non-Gaussianity, which arises in inflationary models with higher-derivative operators such as the DBI model, is well described by the factorizable form [25] It can be easily checked that the signal is largest in the equilateral configurations and suppressed in the squeezed limit . Note that, in single-field slow-roll inflation, the 3-point function is a linear combination of the local and equilateral shapes [13].

As a third template, we consider the folded or flattened shape which is maximized for [26]: and approximate the non-Gaussianity due to modification of the initial Bunch-Davies vacuum in canonical single-field action (the actual 3-point function is not factorizable). As in the previous example, is suppressed in the squeezed configurations. Unlike , however, the folded shape induces a scale-dependent bias at large scales (see Section 4.3).

##### 2.2. Statistics of the Linear Mass Density Field

Bardeen's curvature potential is related to the linear density perturbation at redshift through
where
Here, is the matter transfer function normalized to unity as , is the present-day matter density, and is the linear growth rate normalized to . -point correlators of the linear mass density field can thus be written as
Smoothing inevitably arises when comparing observations of the large-scale structure with theoretical predictions from, for example, perturbation theory (PT), which are valid only in the weakly nonlinear regime [27], or from the spherical collapse model which ignores the strongly nonlinear internal dynamics of the collapsing regions [28, 29]. For this reason, we introduce the *smoothed* linear density field
where is a (spherically symmetric) window function of characteristic radius or mass scale that smoothes out the small-scale nonlinear fluctuations. We will assume a top-hat filter throughout. Furthermore, since and are equivalent variables, we shall indistinctly use the notations and in what follows.

##### 2.3. Topological Defects Models

In addition to inflationary scenarios, there is a whole class of models, known as topological defect models, in which cosmological fluctuations are sourced by an inhomogeneously distributed component which contributes a small fraction of the total energy momentum tensor [30, 31]. The density field is obtained as the convolution of a discrete set of points with a specific density profile. Defects are deeply rooted in particle physics as they are expected to form at a phase transition. Since the early Universe may have plausibly undergone several phase transitions, it is rather unlikely that no defects at all were formed. Furthermore, high-redshift tracers of the LSS may be superior to CMB at finding non-Gaussianity sourced by topological defects [32]. However, CMB observations already provide stringent limits on the energy density of a defect component [8], so we shall only minimally discuss the imprint of these scenarios in the large-scale structure. Phenomenological defect models are, for instance, in which the initial matter density (rather than the curvature perturbation ) contains a term proportional to the square of a Gaussian scalar field [9], or the model (also known as isocurvature CDM model) in which [33].

#### 3. Evolution of the Matter Density Field with Primordial NG

In this section, we summarize a number of results relative to the effect of primordial NG on the mass density field. These will be useful to understand the complexity that arises when considering biased tracers of the density field (see Section 4).

##### 3.1. Setting Up Non-Gaussian Initial Conditions

Investigating the impact of non-Gaussian initial conditions (ICs) on the large-scale structure traced by galaxies and so forth requires simulations large enough so that many long-wavelength modes are sampled. At the same time, the simulations should resolve dark-matter halos hosting luminous red galaxies (LRGs) or quasars (QSOs), so that one can construct halo samples whose statistical properties mimic as closely as possible those of the real data. This favors the utilization of pure N-body simulations for which a large dynamical range can be achieved.

The evolution of the matter density field with primordial non-Gaussianity has been studied in series of large cosmological N-body simulations seeded with Gaussian and non-Gaussian initial conditions; see, for example, [11, 34–43]. For generic non-Gaussian (scalar) random fields, we face the problem of setting up numerical simulations with a prescribed correlation structure [44]. While an implementation of the equilateral and folded bispectrum shape requires the calculation of several computationally demanding convolutions, the operation is straightforward for primordial NG described by a local mapping such as the or the model. In the latter case, the local transformation is performed before multiplication by the matter transfer function (computed with publicly available Boltzmann codes [45, 46]). The (dimensionless) power spectrum of the Gaussian part of the Bardeen potential is the usual power-law . Unless otherwise stated, we shall assume a normalization at the pivot point . To date, essentially all numerical studies of structure formation with inflationary non-Gaussianity have implemented the local shape solely, so we will focus on this model in what follows.

Non-Gaussian corrections to the primordial curvature perturbations can renormalize the input (unrenormalized) power spectrum of fluctuations used to seed the simulations [47]. For the local model with , renormalization effects are unlikely to be noticeable due to the limited dynamical range of current cosmological simulations. However, they can be significant, for example, in simulations of a local cubic coupling with a large primordial trispectrum [48]. The cubic-order term renormalizes the amplitude of the power spectrum of initial curvature perturbations to , where For scale-invariant initial conditions, has a logarithmic divergence at large and small scales. In practice, however, the finite box size and the resolution of the simulations naturally furnish a low- and high- cutoff. The effective correction to the amplitude of density fluctuations in the model thus is where is the number of mesh points along one dimension and is the simulation box length. For , , and , for instance, we obtain .

To generate the initial particle distribution, the Zeldovich approximation is commonly used instead of the exact gravitational dynamics. This effectively corresponds to starting from non-Gaussian initial conditions [49]. Since the transition to the true gravitational dynamics proceeds rather gradually [50], one should ensure that the initial expansion factor is much smaller than that of the outputs analyzed. Alternatively, it is possible to generate more accurate ICs based on second-order Lagrangian perturbation theory (2LPT) [51]. At fixed initial expansion factor, they reduce transients such that the true dynamics is recovered more rapidly [52].

##### 3.2. Mass Density Probability Distribution

In the absence of primordial NG, the probability distribution function (PDF) of the initial smoothed density field (the probability that a randomly placed cell of volume has some specific density) is Gaussian. Namely, all normalized or reduced *smoothed* cumulants of order are zero. An obvious signature of primordial NG would thus be an initially nonvanishing skewness or kurtosis [36, 53, 54]. Here, the subscript denotes the connected piece of the -point moment that cannot be simplified into a sum over products of lower-order moments. At third order, for instance, the cumulant of the smoothed density field is an integral of the 3-point function:
where
is the bispectrum of the smoothed linear density fluctuations at redshift . Note that, owing to , the product does not depend on redshift. Over the range of scale accessible to LSS observations, is a monotonic decreasing function of that is of magnitude 10^{-4} for the local, equilateral, and folded templates discussed above (Figure 1). Strictly speaking, all reduced moments should be specified to fully characterize the density PDF, but a reasonable description of the density distribution can be achieved with moments up to the fourth order.

Numerical and analytic studies generally find that a density PDF initially skewed towards positive values produces more overdense regions while a negatively skewed distribution produces larger voids. Gravitational instabilities also generate a positive skewness in the density field, reflecting the fact that the evolved density distribution exhibits an extended tail towards large overdensities [49, 55–59]. This gravitationally induced signal eventually dominates the primordial contribution such that, at fixed normalization amplitude, non-Gaussian models deviate more strongly from the Gaussian paradigm at high redshift. The time evolution of the normalized cumulants can be worked out for generic Gaussian and non-Gaussian ICs using, for example, PT or the spherical collapse approximation. For Gaussian ICs, PT predicts the normalized cumulants to be time independent to the lowest nonvanishing order, with a skewness , whereas, for non-Gaussian ICs, the linear contribution to the cumulants decays as [60, 61].

The persistence of the primordial hierarchical amplitude sensitively depends upon the magnitude of , , relative to unity. For example, an initially large nonvanishing kurtosis could source skewness with a time-dependence and amplitude similar to those induced by nonlinear gravitational evolution [60]. Although there is an infinite class of non-Gaussian models, we can broadly divide them into weakly and strongly non-Gaussian. In weak NG models, the primeval signal in the normalized cumulants is rapidly obliterated by gravity-induced non-Gaussianity. This is the case of hierarchical scaling models where -point correlation functions satisfy with at large-scales. By contrast, strongly NG initial conditions dominate the evolution of the normalized cumulants. This occurs when the hierarchy of correlation functions obeys the dimensional scaling , which arises in the particular case of initial conditions [62] or in defect models such as texture [37, 63, 64]. These scaling laws have been successfully confronted with numerical investigations of the evolution of cumulants [37, 38]. We note that the scaling of the contribution induced by gravity is, however, different for the kurtosis [65], suggesting that the latter is a better probe of the nature of initial conditions.

Although gravitational clustering tends to erase the memory of initial conditions, numerical simulations of non-Gaussian initial conditions show that the occurrence of highly underdense and overdense regions is very sensitive to the presence of primordial NG. In fact, the imprint of primordial NG is best preserved in the negative tail of the PDF of the evolved (and smoothed) density field [40]. A satisfactory description of this effect can be obtained from an Edgeworth expansion of the initial smoothed overdensity field [66]. At high densities , the non-Gaussian modification approximately scales as whereas, at low densities , the deviation is steeper, . Taking into account the weak-scale dependence of further enhances this asymmetry.

##### 3.3. Power Spectrum and Bispectrum

Primordial-non-Gaussianity also imprints a signature in Fourier-space statistics of the matter density field. Positive values of tend to increase the small-scale matter power spectrum [10, 40, 67] and the large-scale matter bispectrum [10, 68]. In the weakly nonlinear regime where 1-loop PT applies, the Fourier mode of the density field for growing-mode initial conditions reads [56, 69] The kernel , where is the cosine of the angle between and , describes the nonlinear 2nd-order evolution of the density field. It is nearly independent of and and vanishes in the (squeezed) limit as a consequence of the causality of gravitational instability. At 1-loop PT, (15) generates the mass power spectrum Here, is the linear matter power spectrum at redshift , and are the standard one-loop contributions in the case of Gaussian ICs [69, 70], and is the correction due to primordial NG [67]. This terms scales as , so the effect of non-Gaussianity is largest at low redshift. Moreover, because vanishes in the squeezed limit, (17) is strongly suppressed at small wavenumbers, even in the local model for which is maximized in the same limit (i.e., ). For , the magnitude of this correction is at a percent level in the weakly nonlinear regime [41, 42, 71], in good agreement with the measurements (see Figure 2). Extensions of the renormalization group description of dark-matter clustering [72] to non-Gaussian initial density, and velocity perturbations can improve the agreement up to wavenumbers [73, 74].

It is also instructive to compare measurements of the matter bispectrum with perturbative predictions. To second order in PT, the matter bispectrum is the sum of a primordial contribution and two terms induced by gravitational instability [56, 75] (we will henceforth omit the explicit -dependence for brevity): where is the primordial trispectrum of the density field. A similar expression can also be derived for the matter trispectrum, which turns out to be less sensitive to gravitationally induced nonlinearities [76]. The reduced bispectrum is conveniently defined as For Gaussian initial conditions, is independent of time and, at tree-level PT, is constant and equal to for equilateral configurations [56]. For general triangles, moreover, it approximately retains this simple behavior, with a dependence on triangle shape through [10]. Figure 3 illustrates the effect of primordial NG of the local type on the shape dependence of for a particular set of triangle configurations. As can be seen, the inclusion of 1-loop corrections greatly improves the agreement with the numerical data [77]. An important feature that is not apparent in Figure 3 is the fact that the primordial part to the reduced-matter bispectrum scales as for approximately equilateral triangles (and under the assumption that is scale independent) [10]. This anomalous scaling considerably raises the ability of the matter bispectrum to constrain primordial NG of the local type. Unfortunately, neither the matter bispectrum nor the power spectrum is directly observable with the large-scale structure of the Universe. Temperature anisotropies in the redshifted 21 cm background from the prereionization epoch could in principle furnish a direct measurement of these quantities [78–80], but foreground contamination may severely hamper any detection. Weak lensing is another direct probe of the dark matter, although we can only observe it in projection along the line of sight [81].

As we will see shortly, however, this large-scale anomalous scaling is also present in the bispectrum and power spectrum of observable tracers of the large-scale structure such as galaxies. It is this unique signature that will make future all-sky LSS surveys competitive with CMB experiments.

##### 3.4. Velocities

Primordial non-Gaussianity also leaves a signature in the large-scale coherent bulk motions which, in the linear regime, are directly related to the linear density field [55]. The various non-Gaussian models considered by [82] tend to have lower-velocity dispersion and bulk flow than fiducial Gaussian model, regardless of the sign of the skewness. However, while the probability distribution of velocity components is sensitive to primordial NG of the local type, in defect models it can be very close to Gaussian, even when the density field is strongly non-Gaussian, as a consequence of the central limit theorem [83, 84]. In this regard, correlation of velocity differences could provide a better test of non-Gaussian initial conditions [85].

To measure peculiar velocities, one must subtract the Hubble flow from the observed redshift. This requires an estimate of the distance which is only available for nearby galaxies and clusters (although, e.g., the kinetic Sunyaev-Zel'dovich (kSZ) effect could be used to measure the bulk motions of distant galaxy clusters [86]). So far, measurements of the local galaxy density and velocity fields [87] as well as reconstruction of the initial density PDF from galaxy density and velocity data [88] are consistent with Gaussian initial conditions.

#### 4. LSS Probe of Primordial Non-Gaussianity

Discrete and continuous tracers of the large-scale structure, such as galaxies, the Ly forest, the 21 cm hydrogen line, and so forth, provide a distorted image of the matter density field. In CDM cosmologies, galaxies form inside overdense regions [89] and this introduces a bias between the mass and the galaxy distribution [90]. As a result, distinct samples of galaxies trace the matter distribution differently, the most luminous galaxies preferentially residing in the most massive DM halos. This biasing effect, which concerns most tracers of the large-scale structure, remains to be fully understood. Models of galaxy clustering assume, for instance, that the galaxy biasing relation only depends on the local mass density, but the actual mapping could be more complex [91, 92]. Because of biasing, tracers of the large-scale structure will be affected by primordial non-Gaussianity in a different way than the mass density field. In this section, we describe a number of methods exploiting the abundance and clustering properties of biased tracers to constrain the level of primordial NG. We focus on galaxy clustering as it provides the tightest limits on primordial NG (see Section 5).

##### 4.1. Halo Finding Algorithm

Locating groups of bound particles, or DM halos, in simulations is central to the methods described below. In practice, we aim at extracting halo catalogs with statistical properties similar to those of observed galaxies or quasars. This, however, proves to be quite difficult because the relation between observed galaxies and DM halos is somewhat uncertain. Furthermore, there is freedom at defining a halo mass.

An important ingredient is the choice of the halo identification algorithm. Halo finders can be broadly divided into two categories: friends-of-friends (FOF) finders [93] and spherical overdensity (SO) finders [94]. While the mass of an SO halo is defined by the radius at which the inner overdensity exceeds (typically a few hundred times the background density ), the mass of an FOF halo is given by the number of particles within a linking length from each other ( in unit of mean interparticle distance). These definitions are somewhat arbitrary and may suit specific purposes only. In what follows, we shall present mainly results for SO halos as their mass estimate is more closely connected to the predictions of the spherical collapse model, on which most of the analytic formulae presented in this section are based. The question of how the spherical overdensity masses can be mapped onto friends-of-friends masses remains a matter of debate (e.g., [95]). Clearly, however, since the peak height depends on the halo mass through the variance (see below), any systematic difference will be reflected in the value of associated to a specific halo sample. As we will see shortly, this affects the size of the fractional deviation from the Gaussian mass function.

Catalogs of mock galaxies with luminosities comparable to those of the targeted survey provide an additional layer of complication that can be used, among others, to assess the impact of observational errors on the non-Gaussian signal. However, most numerical studies of cosmic structure formation with primordial NG have not yet addressed this level of sophistication, so we will discuss results based on statistics of dark matter halos only.

##### 4.2. Abundances of Voids and Bound Objects

It has long been recognized that departure from Gaussianity can significantly affect the abundance of highly biased tracers of the mass density field, as their frequency sensitively depends upon the tails of the initial density PDF [96–98]. The (extended) Press-Schechter approach has been extensively applied to ascertain the effect of primordial NG on the high mass tail of the mass function.

###### 4.2.1. Press-Schechter Approach

The Press-Schechter theory [99] and its extentions based on excursion sets [100–102] predict that the number density of halos of mass at redshift is entirely specified by a multiplicity function , where the peak height or significance is the typical amplitude of fluctuations that produce those halos. Here and henceforth, denotes the variance of the initial density field smoothed on mass scale and linearly extrapolated to present epoch, whereas is the critical linear overdensity for (spherical) collapse at redshift . In the standard Press-Schechter approach, is related to the level excursion probability that the linear density contrast of a region of mass exceeds , where the last equality assumes Gaussian initial conditions. The factor of 2 is introduced to account for the contribution of low density regions embedded in overdensities at scale . In the extended Press-Schechter theory, evolves with and is the probability that a trajectory is absorbed by the constant barrier (as is appropriate in the spherical collapse approximation) on mass scale . In general, the exact form of depends on the barrier shape [103] and the filter shape [104]. Note also that , which ensures that all the mass is contained in halos.

Despite the fact that the Press-Schechter mass function overpredicts (underpredicts) the abundance of low (high) mass objects, it can be used to estimate the fractional deviation from Gaussianity. In this formalism, the non-Gaussian fractional correction to the multiplicity function is , which is readily computed once the non-Gaussian density PDF is known. In the simple extensions proposed by [105, 106], is expressed as the inverse transform of a cumulant-generating function. In [106], the saddle-point technique is applied directly to . The resulting Edgeworth expansion is then used to obtain . Neglecting cumulants beyond the skewness, one obtain (we momentarily drop the subscript for convenience) after integration over regions above . In [105], it is the level excursion probability that is calculated within the saddle-point approximation. This approximation better asymptotes to the exact large mass tail, which exponentially deviates from the Gaussian tail. To enforce the normalization of the resulting mass function, one may define with , and use [105, 107] The fractional deviation from the Gaussian mass function then becomes Both formulae have been shown to give reasonable agreement with numerical simulations of non-Gaussian cosmologies [41, 108, 109] (but note that [11, 110] have reached somewhat different conclusions). Expanding at the first order in shows that these two theoretical expectations differ in the coefficient of the term. Therefore, it is interesting to consider also the approximation which is designed to match better the Edgeworth expansion of [106] when the peak height is [48]. When the primordial trispectrum is large (i.e., when ), terms involving the kurtosis must be included [48, 105, 106, 111]. In this case, it is also important to take into account a possible renormalization of the fluctuation amplitude, (12), to which the high mass tail of the mass function is exponentially sensitive [48].

Figure 4 shows the effect of primordial NG of the local type on the mass function of SO halos identified with a redshift-dependent overdensity threshold (motivated by spherical collapse in a CDM cosmology [112]). Overall, the approximation (25) agrees better with the measurements than (24), which somewhat overestimates the data for , and than (22), which is not always positive definite for . However, as can be seen in Figure 5, while the agreement with the data is reasonable for the SO halos, the theory strongly overestimates the effect in the mass function of FOF halos. Reference [109], who use a FOF algorithm with , makes the replacement with to match the Gaussian and non-Gaussian mass functions. A physical motivation of this replacement is provided by [113, 114], who demonstrate that the diffusive nature of the collapse barrier introduces a similar factor , regardless of the initial conditions. However, the value of the diffusion constant was actually measured from simulations that use a SO finder with constant [115]. In the excursion set approach of [116], the value of is obtained by normalizing the Gaussian mass function to simulation (i.e., it has nothing to do with the collapse dynamics) and, therefore, depends on the halo finder. Figure 5 demonstrates that this is also the case for the non-Gaussian correction : choosing as advocated in [109] gives good agreement for FOF halos, but strongly underestimates the effect for SO halos, for which the best-fit is close to unity. As we will see below, the strength of the non-Gaussian bias may also be sensitive to the choice of halo finder.

More sophisticated formulations based on extended Press-Schechter (EPS) theory and/or modifications of the collapse criterion look promising since they can reasonably reproduce both the Gaussian halo counts and the dependence on [114, 117, 118]. The probability of first upcrossing can, in principle, be derived for any non-Gaussian density field and any choice of smoothing filter [119, 120]. For a general filter, the non-Markovian dynamics generates additional terms in the non-Gaussian correction to the mass function that arise from 3-point correlators of the smoothed density at different mass scales [114]. However, large error bars still make difficult to test for the presence of such subleading terms. For generic moving barriers such as those appearing in models of triaxial collapse [121, 122], the leading contribution to the non-Gaussian correction approximately is [117] where and the last equality assumes . For SO halos, (26) with does not fit to the measured correction better than (25). However, the ellipsoidal collapse barrier with matches better the Gaussian mass function for moderate peak height [118].

Parameterizations of the fractional correction based on N-body simulations have also been considered. While [43] considers a fourth-order polynomial fit to account for values of as large as 750, [11] exploits the fact that, for sufficiently small , there is a one-to-one mapping between halos in Gaussian and non-Gaussian cosmologies. In both cases, the fitting functions are consistent with the simulations at the few percent level.

###### 4.2.2. Clusters Abundance

Rich clusters of galaxies trace the rare, high-density peaks in the initial conditions and thus offer the best probe of the high-mass tail of the multiplicity function. To infer the cluster mass function, the X-ray and millimeter windows are better suited than the optical-wave range because selection effects can be understood better (see, however, [123]).

Following early theoretical [96, 97, 124–126] and numerical [35, 127–129] work on the effect of non-Gaussian initial conditions on the multiplicity function of cosmic structures, the abundance of clusters and X-ray counts in non-Gaussian cosmologies has received much attention in the literature. At fixed normalization of the observed abundance of local clusters, the protoclusters associated with high redshift () Ly emitters are much more likely to develop in strongly non-Gaussian models than in the Gaussian paradigm [39, 110, 130]. Considering the redshift evolution of cluster abundances thus can break the degeneracy between the initial density PDF and the background cosmology. Simple extensions of the Press-Schechter formalism similar to those considered above have been shown to capture reasonably well the cluster mass function over a wide range of redshift for various non-Gaussian scenarios [131]. Scaling relations between the cluster mass, X-ray temperature and Compton -parameter calibrated using theory, observations and detailed simulations of cluster formation [132, 133], can be exploited to predict the observed distribution functions of X-ray and SZ signals and assess the capability of cluster surveys to test the nature of the initial conditions [134–140].

An important limitation of this method is that the impact of realistic models of primordial non-Gaussianity on cluster abundances is small compared to systematics in current and upcoming surveys [141–143]. Given the current uncertainties in the redshift evolution of clusters (one commonly assumes that clusters are observed at the epoch they collapse [142]), the selection effects in the calibration of X-ray and SZ fluxes with halo mass, the freedom in the definition of the halo mass, the degeneracy with the normalization amplitude (for positive , the mass function is more enhanced at the high mass end, and this is similar to an increase in the amplitude of fluctuations [144]) and the low number statistics, the prospects of using the cluster mass function only to place competitive limits on with the current data are small. A two-fold improvement in cluster mass calibration is required to provide constraints comparable to CMB measurements [143].

###### 4.2.3. Voids Abundance

The frequency of cosmic voids offers a probe of the low density tail of the initial PDF [145]. The Press-Schechter formalism can also be applied to ascertain the sensitivity of the voids abundance to non-Gaussian initial conditions. Voids are defined as regions of mass whose density is less than some critical value or, alternatively, whose three eigenvalues of the tidal tensor [146] lie below some critical value [66, 118, 145, 147]. An important aspect in the calculation of the mass function of voids is the overcounting of voids located inside collapsing regions. This voids-in-clouds problem, as identified by [148]), can be solved within the excursion set theory by studying a two barriers problem: for halos and for voids. Including this effect reduces the frequency of the smallest voids [118]. Neglecting this complication notwithstanding, the differential number density of voids of radius is [145, 147] where . While a positive produces more massive halos, it generates fewer large voids [118, 145]. Hence, the effect is qualitatively different from a simple rescaling of the normalization amplitude . A joint analysis of both abundances of clusters and cosmic voids might thus provide interesting constraints on the shape of the primordial 3-point function. There are, however, several caveats to this method, including the fact that there is no unique way to define voids [145]. Clearly, voids identification algorithms will have to be tested on numerical simulations [149] before a robust method can be applied to real data.

##### 4.3. Galaxy 2-Point Correlation

Before [90] showed that, in Gaussian cosmologies, correlations of galaxies and clusters can be amplified relative to the mass distribution, it was argued that primeval fluctuations have a non-Gaussian spectrum [150, 151] to explain the observed strong correlation of Abell clusters [152, 153]. Along these lines, [154] pointed out that primordial non-Gaussianity could significantly increase the amplitude of the two-point correlation of galaxies and clusters on large-scales. However, except from [155] who showed that correlations of high density peaks in non-Gaussian models are significantly stronger than in the Gaussian model with identical mass power spectrum, subsequent work focused mostly on abundances (Section 4.2) or higher order statistics such as the bispectrum (Section 4.4). It is only recently that [11] have demonstrated the strong scale-dependent bias arising in non-Gaussian models of the local type.

###### 4.3.1. The Non-Gaussian Bias

In the original derivation of [11], the Laplacian is applied to the local mapping in order to show that, upon substitution of the Poisson equation, the overdensity in the neighborhood of density peaks is spatially modulated by a factor proportional to the local value of . Taking into account the coherent motions induced by gravitational instabilities, the scale-dependent bias correction reads where is the linear, Eulerian bias of halos of mass . The original result missed out a multiplicative factor of which was introduced subsequently by [157] upon a derivation of (28) in the limit of high density peaks. The peak-background split approach [103, 158, 159] promoted by [156] shows that the scale-dependent bias applies to any tracer of the matter density field whose (Gaussian) multiplicity function depends on the local mass density only. In this approach, the Gaussian piece of the potential is decomposed into short- and long-wavelength modes, . The short-wavelength piece of the density field is then given by the convolution where is the transfer function (7). Ignoring the white-noise term, this provides an intuitive explanation of the effect in terms of a local rescaling of the small-scale amplitude of matter fluctuations or, equivalently, a local rescaling of the critical density threshold, Assuming that the mass function depends only on the peak height , the long-wavelength part of the halo overdensity becomes [156] (see also [11, 71, 160]) Upon a Fourier transformation and using the fact that, in the Gaussian case, with the Lagrangian bias , we recover the non-Gaussian bias correction (28) provided that the tracers move coherently with the dark matter, that is, [161]. As emphasized in [11], the scale-dependence arises from the fact that the non-Gaussian curvature perturbation is related to the density through the Poisson equation (6) (so that ). There is no such effect in the (local) model, (10), nor in texture-seeded cosmologies [162] for instance.

The derivation of [157], based on the clustering of regions of the smoothed density field above threshold , is formally valid for high density peaks only. However, it is general enough to apply to any shape of primordial bispectrum. The 2-point correlation function of that level excursion set, which was first derived by [124], can be expressed in the high threshold limit () as where . For most non-Gaussian models in which the primordial 3-point function is the dominant correction, this expansion can be truncated at the third order and Fourier transformed to yield the non-Gaussian correction to the power spectrum. Assuming a small level of primordial NG, we can also write where , and eventually obtain The dependence on the shape of the 3-point function is encoded in the function [157, 163] where . Note that, for , this first order approximation always breaks down at sufficiently small because .

Figure 6 shows the non-Gaussian halo bias (33) induced by the local, equilateral and folded bispectrum [163]. In the local and folded non-Gaussianity, the deviation is negligible at , but increases rapidly with decreasing wavenumber. Still, the large-scale correction is much smaller for the folded template, and nearly absent for the equilateral type, which make them much more difficult to detect with galaxy surveys [163]. To get insights into the behavior of at large-scales, let us identify the dominant contribution to in the limit . Setting and expanding at second order in , we find after some algebra assuming no running scalar index, that is, . The auxiliary function is defined as Hence, we have . As can be seen in Figure 6, these approximations capture relatively well the large scale non-Gaussian bias correction induced by the equilateral and folded type of non-Gaussianity. For a nearly scale-invariant spectrum , the effect scales as and const., respectively. Another important feature of the equilateral and folded non-Gaussian bias is the dependence on the mass scale through the multiplicative factor . Indeed, choosing instead of as done in Figure 6 would suppress the effect by a factor of 3. In the high peak limit, which cancels out the dependence on redshift but enhances the sensitivity to the halo bias, that is, for the equilateral and folded shapes whereas in the local model.

At this point, it is appropriate to mention a few caveats to these calculations. Firstly, (28) assumes that the tracers form after a spherical collapse, which may be a good approximation for the massive halos only. If one instead considers the ellipsoidal collapse dynamics, in which the evolution of a perturbation depends upon the three eigenvalues of the initial tidal shear, should be replaced by its ellipsoidal counterparts which is always larger than the spherical value [121]. In this model, the scale-dependent bias is thus enhanced by a factor [11, 160]. Secondly, (28) assumes that the biasing of the surveyed objects is described by the peak height only or, equivalently, the hosting halo mass . However, this may not be true for quasars whose activity may be triggered by merger of halos [164, 165]. Reference [156] used the EPS formalism to estimate the bias correction induced by mergers so the factor should be replaced by . The validity of this result should be evaluated with cosmological simulations of quasars formation. In this respect, semianalytic models of galaxy formation suggest that merger-triggered objects such as quasars do not cluster much differently than other tracers of the same mass [166]. However, this does not mean that the same should hold for the non-Gaussian scale-dependent bias. Still, since the recent merger model is an extreme case it seems likely that the actual bias correction is . Thirdly, the scale-dependent bias has been derived using the Newtonian approximation to the Poisson equation, so one may wonder whether general relativistic (GR) corrections to may suppress the effect on scales comparable to the Hubble radius. Reference [167] showed how large-scale primordial NG induced by GR corrections propagates onto small scales once cosmological perturbations reenter the Hubble radius in the matter dominated era. This effect generates a scale-dependent bias comparable, albeit of opposite sign to that induced by local NG [163]. More recently, [168] argues that there are no GR corrections to the non-Gaussian bias and that the scaling applies down to smallish wavenumbers.

We can also ask ourselves whether higher-order terms in the series expansion (32) furnish corrections to the non-Gaussian bias similar to (28). The quadratic coupling induces a second order correction to the halo power spectrum which reads [48] Its magnitude relative to the term linear in , (28), is approximately at redshift and for a halo mass . Although its contribution becomes increasingly important at higher redshift, it is fairly small for realistic values of . In local NG model, the power spectrum of biased tracers of the density field can also be obtained from a local Taylor series in the evolved (Eulerian) density contrast and the Gaussian part of the initial (Lagrangian) curvature perturbation [47, 71]. Using this approach, it can be shown that the halo power spectrum arising from the first order terms of the local bias expansion can be cast into the form [47] Hence, we also obtain a second order term proportional to which, however, contributes only at very small wavenumber . All this suggests that (28) is the dominant contribution to the non-Gaussian bias in the wavenumber range .

Finally, a non-Gaussian, scale-dependent bias correction can also arise in the local, deterministic bias ansatz [169] if the initial density field is non-Gaussian. Here, is the th-order bias parameters (here again, the first-order bias is ). In this approach, the correction is induced by the correlation between the linear and quadratic term in the galaxy biasing relation (which is in fact a collapsed or squeezed 3-point function) and thus reads [67, 68] Even though in the high-threshold limit , behaves very differently than for moderate peak height because is proportional to the second derivative of the mass function . So far however, (28) appears to describe reasonably well the numerical results for a wide range of halo bias.

###### 4.3.2. Comparison with Simulations

In order to fully exploit the potential of forthcoming large-scale surveys, a number of studies have tested the theoretical prediction against the outcome of large numerical simulations [11, 41–43, 71, 109].

At the lowest order, there are two additional albeit relatively smaller corrections to the Gaussian bias which arise from the dependence of both the halo number density and the matter power spectrum on primordial NG [41]. Firstly, assuming the peak-background split holds, the change in the mean number density of halos induces a scale-independent offset which we denote . In terms of the non-Gaussian fractional correction to the mass function, this contribution is It is worth noticing that has a sign opposite to that of , because the bias decreases when the mass function goes up. In practice, the approximation (25), which matches well the SO data for , can be used for moderate values of the peak height. For FOF halos with linking length , one should make the replacement with in the calculation of the scale-independent offset. It is sensible to evaluate at a mass scale equal to the average halo mass of the sample. Secondly, we also need to account for the change in the matter power spectrum (see Figure 2 in Section 3). Summarizing, local non-Gaussianity adds a correction to the bias of dark matter halos that reads [41] at first order in . As can be seen in Figure 7, the inclusion of these extra terms substantially improves the comparison between the theory and the simulations. Considering only the scale-dependent shift leads to an apparent suppression of the effect in simulations relative to the theory. Including the scale-independent offset considerably improves the agreement at wavenumbers . Finally, adding the scale-dependent term further adjusts the match at small scale by making the non-Gaussian bias shift less negative. Along these lines, [71] find that the inclusion of to the bias also improves the agreement with measurements of obtained for FOF halos, and show that taking into account second- and higher-order corrections could extend the validity of the theory up to scales .

The non-Gaussian bias correction can be measured in the cross- and autopower spectrum of dark matter halos, and . To compute these quantities, dark matter particles and halo centers are interpolated onto a regular cubical mesh. The resulting dark matter and halo fluctuation fields, and , are then Fourier transformed to yield the matter-matter, halo-matter and halo-halo power spectra , and , respectively. is then corrected for the shot noise, which is assumed to be if dark matter halos are a Poisson sampling of some continuous field. This discreteness correction is negligible for due to the large number of dark matter particles. On linear scales (), the halo bias approaches the constant value which needs to be measured accurately as it controls the strength of the scale-dependent bias correction . In this respect, the ratio is a better proxy for the halo bias since it is less sensitive to shot-noise.

Auto- and crosspower analyses may not agree with each other if the halos and dark matter do not trace each other on scale where the non-Gaussian bias is large, that is, if there is stochasticity. Figure 8 shows and averaged over 8 realizations of the models with [41]. The same Gaussian random seed field was used in each set of runs so as to minimize the sampling variance. Measurements of the non-Gaussian bias correction obtained with the halo-halo or the halo-matter power spectrum are in a good agreement with each other, indicating that non-Gaussianity does not induce stochasticity and the predicted scaling (28) applies equally well for the auto- and cross-power spectrum. However, while a number of numerical studies of the model have confirmed the scaling and the redshift dependence [11, 41, 43, 109], the exact amplitude of the non-Gaussian bias correction remains somewhat debatable. Reference [41] who use SO halos and [71] who use FOF halos find satisfactory agreement with the theory once the scale-independent offset is included. By contrast, see [43], who use the same FOF halos as [71], argue that the scale-dependent piece requires, among others, a multiplicative correction of the form , with . Similarly, [109] who also use FOF halos find that the theory is a good fit to the simulations only upon replacing by in (33), with . Part of the discrepancy may be probably due to the fact that the last two references do not include , which leads to an apparent suppression of the effect (see Figure 7). Another possible source of discrepancy may be choice of the halo finder which, as seen in Figure 5, has an impact on the strength of the non-Gaussian correction to the mass function. This possibility is investigated in Figure 9, which shows the non-Gaussian bias correction obtained with FOF halos. For this low biased sample, the scale-independent correction is and can thus be neglected. The best-fit values of are significantly below the input values of , in agreement with the findings of [43, 109] (note, however, that this suppression is more consistent with being rescaled by and being unchanged). This indicates that the choice of halo finder may also affect the magnitude of the scale-dependent non-Gaussian bias. Discrepancies have also been observed between the theoretical and measured non-Gaussian bias corrections in non-Gaussian models of the local cubic-order coupling [48]. Understanding all these results clearly requires a better modeling of halo clustering.

###### 4.3.3. Redshift-Space Distortions

Peculiar velocities generate systematic differences between the spatial distribution of data in real and redshift space. These redshift-space distortions must be properly taken into account in order to extract from redshift surveys. On the linear scales of interest, the redshift-space power spectrum of biased tracers reads as [171, 172] where and are the density-velocity and velocity-divergence power spectra, is the cosine of the angle between the wavemode and the line of sight and is the logarithmic derivative of the growth factor. For , the 1-loop correction due to primordial NG is identical to (17) provided is replaced by the kernel describing the 2nd order evolution of the velocity divergence [62]. For , this correction is Again, causality implies that vanishes in the limit . For unbiased tracers with , the linear Kaiser relation is thus recovered at large scales (this is consistent with the analysis of [173]). For biased tracers, we still expect the Kaiser formula to be valid, but the distortion parameter should now be equal to , where is the scale-dependent bias induced by the primordial non-Gaussianity.

###### 4.3.4. Mitigating Cosmic Variance and Shot-Noise

Because of the finite number of large scale wavemodes accessible to a survey, any large-scale measurement of the power spectrum is limited by the cosmic (or sampling) variance caused by the random nature of the wavemodes. For discrete tracers such as galaxies, the shot noise is another source of error. Restricting ourselves to weak primordial NG, the relative error on the power spectrum is , where is the number of independent modes measured and is the shot-noise [174]. Under the standard assumption of Poisson sampling, equals the inverse of the number density and causes a scale-independent enhancement of the power spectrum. The extent to which one can improve the observational limits on the nonlinear parameters will strongly depend on our ability to minimize the impact of these two sources of errors. By comparing differently biased tracers of the same surveyed volume [175, 176] and suitably weighting galaxies (by the mass of their host halo for instance) [177, 178], it should be possible to circumvent these problems and considerably improve the detection level.

Figure 9 illustrates how the impact of sampling variance on the measurement of can be mitigated. Namely, the data points show the result of taking the ratio for each set of runs with same Gaussian random seed field before averaging over the realizations. This procedure is equivalent to the multitracers method advocated by [175]. Here, can be thought as mimicking the power spectrum of a nearly unbiased tracer of the mass density field with high number density. Although, in practical applications, using the dark matter field works better [170], in real data should be replaced by a tracer of the same surveyed volume different than the one used to compute . Figure 9 also shows that upon taking out most of the cosmic variance, there is some residual noise caused by the discrete nature of the dark matter halos. As shown recently [178] however, weighting the halos according to their mass can dramatically reduce the shot noise relative to the Poisson expectation, at least when compared against the dark matter. Applying such a weighting may thus significantly improve the error on the nonlinear parameter , but this should be explored in realistic simulations of galaxies, especially because the halo mass may not be easily measurable from real data [170]. This approach undoubtedly deserves further attention as it has the potential to substantially improve the extraction of the primordial non-Gaussian signal from galaxy surveys.

To conclude this section, it is worth noting that, while the PDF of power values has little discriminatory power (for large surveyed volume, it converges towards the Rayleigh distribution as a consequence of the central limit theorem) [179], the covariance of power spectrum measurements (which is sensitive to the selection function, but also to correlations among the phase of the Fourier modes) may provide quantitative limits on certain type of non-Gaussian models [174, 180].

##### 4.4. Galaxy Bispectrum and Higher-Order Statistics

Higher statistics of biased tracers, such as the galaxy bispectrum, are of great interest as they are much more sensitive to the shape of the primordial 3-point function than the power spectrum [10, 42, 68, 181, 182]. Therefore, they could break some of the degeneracies affecting the non-Gaussian halo bias (e. g., the leading order scale-dependent correction to the Gaussian bias induced by the local quadratic and cubic coupling are fully degenerated [48]).

###### 4.4.1. Normalized Cumulants of the Galaxy Distribution

The skewness of the galaxy count probability distribution function could provide constraints on the amount of non-Gaussianity in the initial conditions. As discussed in Section 3, however, it is difficult to disentangle the primordial and gravitational causes of skewness in low redshift data unless the initial density field is strongly non-Gaussian. The first analyzes of galaxy catalogs in terms of count-in-cells densities all reached the conclusion that the skewness (and higher-order moments) of the observed galaxy count PDF is consistent with the value predicted by gravitational instability of initially Gaussian fluctuations [50, 57, 60, 183–185]. Back then however, most of the galaxy samples available were not large enough to accurately determine the at large-scales [186]. Despite the two orders of magnitude increase in surveyed volume, these measurements are still sensitive to cosmic variance, that is, to the presence of massive super-clusters or large voids. Nevertheless, the best estimates of the first normalized cumulants of the galaxy PDF strongly suggest that high order galaxy correlation functions indeed follow the hierarchical scaling predicted by the gravitational clustering of Gaussian ICs [187]. There is no evidence for strong non-Gaussianity in the initial density field as might by seeded by cosmic strings or textures [188].

The genus statistics of constant density surfaces through the galaxy distribution measures the relative abundance of low and high density regions as a function of the smoothing scale and, therefore, could also be used as a diagnostic tool for primordial non-Gaussianity. For a Gaussian random field, the genus curve (i.e., the genus number as a function of the density contrast) is symmetric about regardless the value of . Primordial NG and nonlinear gravitational evolution can disrupt this symmetry [189]. The effect of non-Gaussian ICs on the topology of the galaxy distribution has been explored in a number of papers [35, 190–193]. For large values of and realistic amount of primordial NG, the genus statistics can also be expanded in a series whose coefficients are the normalized cumulants of the smoothed galaxy density field. In other words, the genus statistics essentially provides another measure of the (large-scale) cumulants. So far, measurements from galaxy data are broadly consistent with Gaussian initial conditions [194, 195].

###### 4.4.2. Galaxy Bispectrum

Most of the scale-dependence of the primordial -point functions is integrated out in the normalized cumulants, which makes them weakly sensitive to primordial NG. However, while the effect of non-Gaussian initial conditions, galaxy bias, gravitational instabilities and so forth, are strongly degenerated in the , they imprint distinct signatures in the galaxy bispectrum , an accurate measurement of which could thus constrain the shape of the primordial 3-point function.

In the original derivation of [181], the large-scale (unfiltered) galaxy bispectrum in the model is given by Again, and are the first- and second-order bias parameters that describe the galaxy biasing relation assumed local and deterministic [169]. The first term in the right-hand side is the primordial contribution which, for equilateral configurations and in the model, scales as like in the matter bispectrum, (18). The two last terms are the contribution from nonlinear bias and the tree-level correction from gravitational instabilities, respectively. They have the smallest signal in squeezed configurations.

As recognized by [68, 182], (45) misses an important term that may significantly enhance the sensitivity of the galaxy bispectrum to non-Gaussian initial conditions. This contribution is sourced by the trispectrum of the smoothed mass density field At large-scale, this simplifies to the sum of the linearly evolved primordial trispectrum and a coupling between the primordial bispectrum (linear in ) and the second order PT corrections (through the kernel ). In the case of local non-Gaussianity and for equilateral configurations, the first piece proportional to scales as times the Gaussian tree-level prediction, with the same redshift dependence. Hence, it is similar to the second order correction that appears in the halo power spectrum (see (39)). The second piece linear in generates a signal at large-scales for essentially all triangle shapes in the local model as well as in the case of equilateral NG. This second contribution is maximized in the squeezed limit (where it is one order of magnitude larger than the result obtained by [181]) which helps disentangling it from the Gaussian terms. Note that a strong dependence on triangle shape is also present in other NG scenarios such as the model [62].

This newly derived contributions are claimed to lead to more than one order of magnitude improvement in certain limits [182], but it is not yet clear whether these gains can be fully realized with upcoming galaxy surveys. To accurately predict the constraints that could be achieved with future measurements of the galaxy bispectrum, a comparison of these predictions with the halo bispectrum extracted from numerical simulations is highly desirable. To date, the only numerical study [42] has measured the halo bispectrum for some isosceles triangles (). While the shape dependence is in reasonable agreement with the theory, the observed -dependence appears to depart from the predicted scaling.

##### 4.5. Intergalactic Medium and the Ly Forest

Primordial non-Gaussianity also affects the intergalactic medium (IGM) as a positive enhances the formation of high-mass halos at early times and, therefore, accelerates reionization [196–198]. At lower redshift, small box hydrodynamical simulations of the Ly forest indicate that non-Gaussian initial conditions could leave a detectable signature in the Ly flux PDF, power spectrum and bispectrum [199]. However, while differences appear quite pronounced in the high transmissivity tail of the flux PDF (i.e., in underdense regions), the Ly 1D flux power spectrum seems little affected. Given the small box size of these hydrodynamical simulations, it is worth exploring the effect in large N-body cosmological simulations using a semianalytic modeling of the Ly forest [200], even though such an approach only provides a very crude approximation to the temperature-density diagram of the IGM in hydrodynamical simulations. Figure 10 shows the imprint of local type NG on the Ly 3D flux power spectrum (which is not affected by projection effects) extracted at from a series of large simulations. The Ly transmitted flux is calculated in the Gunn-Peterson approximation [201]. A clear signature similar to the non-Gaussian halo bias can be seen. As expected, it is of opposite sign since the Ly forest is anti-biased relative to the mass density field (overdensities are mapped onto relatively low flux transmission).

To estimate the strength of the signal (see [200] for the details), one can assume that the (real space) optical depth to Ly absorption at comoving position is approximately [202] where is the gas density, is the optical depth at mean gas density and is some parameter that depends on the exact thermal history of the low density IGM. The above relation holds for the moderate overdensities that are responsible for most of the Ly absorption features. To relate the gas density to the smoothed linear density field, we could make the simple ansatz [203]. In this linear approximation, however, the large-scale bias of the Ly flux density field is much larger than that measured in detailed numerical simulations (e.g., at [204]). Therefore, one may want to consider the lognormal mapping [205, 206] to better capture nonlinearities in the gas density field. Expanding at second order and noticing that in the presence of weak non-Gaussianity, the joint PDF can generically be expanded into an Edgeworth series where the primordial 3-point function is the dominant correction, it is straightforward to compute the Ly 3D flux power spectrum for nonzero . Upon a Fourier transformation, we arrive at where is some auxiliary function of . This result is valid for any model of primordial NG characterized by an initial bispectrum. In the