A decade ago cosmological simulations of increasingly higher resolution were used to demonstrate that virialized regions of Cold Dark Matter (CDM) halos are filled with a multitude of dense, gravitationally bound clumps. These dark matter subhalos are central regions of halos that survived strong gravitational tidal forces and dynamical friction during the hierarchical sequence of merging and accretion via which the CDM halos form. Comparisons with observations revealed that there is a glaring discrepancy between abundance of subhalos and luminous satellites of the Milky Way and Andromeda as a function of their circular velocity or bound mass within a fixed aperture. This large discrepancy, which became known as the “substructure” or the “missing satellites” problem, begs for an explanation. In this paper, the author reviews the progress made during the last several years both in quantifying the problem and in exploring possible scenarios in which it could be accommodated and explained in the context of galaxy formation in the framework of the CDM paradigm of structure formation. In particular, he shows that the observed luminosity function, radial distribution, and the remarkable similarity of the inner density profiles of luminous satellites can be understood within hierarchical CDM framework using a simple model in which efficiency of star formation monotonically decreases with decreasing virial mass satellites had before their accretion without any actual sharp galaxy formation threshold.

1. Introduction

In the hierarchical scenario of galaxy formation [1], theoretically rooted in the Cold Dark Matter (CDM) structure formation model [2], galaxies form via cooling and condensation of gas in dark matter halos, which grow via an hierarchical sequence of mergers and accretion. The density perturbations in these models have amplitude that increases with decreasing scale down to 1 comoving parsec or below [3], with the smallest fluctuation scale defined by the specific properties of the particles assumed to constitute the majority of the CDM. Smaller perturbations thus collapse first and then grow and merge to form larger and larger objects, with details of the evolution determined by expansion history of the universe (i.e., by parameters describing the background cosmological model) and by the shape of the density fluctuation power spectrum [4].

An example of such evolution in the flat CDM model is illustrated in Figure 1, which shows collapse of a object. The figure shows that during the early stages of evolution the matter that is incorporated into the final halo collapses into a large number of relatively small clumps with a filamentary, web-like spatial distribution. Further evolution, mediated by the competition between gravity and expansion of space, is a sequence of accretion and mergers that builds objects of progressively larger mass until the single system is formed during the last several billion years of evolution. The figure also shows that cores of some of the small clumps that merge with and are incorporated into larger objects survive until later epochs and are present in the form of halo substructure or subhalos: small dense clumps within virialized regions of larger halos. (Survival of subhalos is not a trivial result and is due to a relatively compact distribution of mass in CDM halos. Ensuring their survival requires a rather large dynamic range in spatial and mass resolution, which had not been achieved until the late 1990s [59].)

The CDM model of structure formation is remarkably successful in explaining a wide range of observations from temperature fluctuations of the cosmic microwave background [10] to galaxy clustering and its evolution [11] both qualitatively and, in many cases, quantitatively. Nevertheless, many key details of the model are still being developed [12, 13] and its testing is by no means complete.

One area of active investigation is testing predictions of the CDM models at scales from a few kpc to tens of pc (i.e., the smallest scales probed by observations of galaxies). In particular, there is still tension between predictions of the central mass distribution in galaxies [14, 15] and sizes and angular momenta of galactic disks and observational results [16, 17]. Notably, this tension has not gone away during the past 10–15 years, even though both theoretical models and observations have improved dramatically.

Another example of tension between CDM predictions and observations that has been actively explored during the last decade is the fact that satellite systems around galaxies of different luminosity are qualitatively different, even though their dark matter halos are expected to be approximately scaled down versions of each other [18], with their total mass as the scaling parameter. Faint dwarf galaxies usually have no luminous satellites at all, Milky Way and Andromeda have a few dozen, but clusters of galaxies often have thousands of satellites around the brightest cluster galaxy.

The number of gravitationally bound satellite subhalos at a fixed mass relative to the mass of their host CDM halo, on the other hand, is expected to be approximately the same [8, 19, 20]. This is illustrated in Figure 2, which shows distribution of dark matter out to approximately two virial radii around the centers of two CDM halos of masses different by two orders of magnitude. It is clear that it is not easy to tell the mass of the halo by simply examining the overall mass distribution or by counting the number of subhalos. This is a visual manifestation of approximate (but not exact, see, e.g., [21, 22]) self-similarity of CDM halos of different mass. If we would compare similar images of distribution of luminous matter around galaxies and clusters, the difference would be striking.

The manifestly different observed satellite populations around galaxies of different luminosities and expected approximately self-similar populations of satellite subhalos around halos of different mass is known as the substructure problem [8, 23, 24]. In the case of the best studied satellite systems of the Milky Way and Andromeda galaxies, the discrepancy between the predicted abundance of small-mass dark matter clumps and the number of observed luminous satellites as a function of circular velocity (see Section 2) has been also referred to as the “missing satellites problem.’’ (The name derived from the title “Where are the missing galactic satellites?’’ of one of the papers originally pointing out the discrepancy [24].) The main goal of this paper is to review theoretical and observational progress in quantifying and understanding the problem over the last decade.

2. Quantifying the Substructure and Luminous Satellite Populations

In order to connect theoretical predictions and observations on a quantitative level, we need descriptive statistics to characterize population of theoretical dark matter subhalos and observed luminous satellites. Ideally, one would like theoretical models to be able to predict properties of stellar populations hosted by dark matter halos and subhalos and make comparisons using statistics involving directly observable quantities, such as galaxy luminosities. In practice, however, this is difficult as such predictions require modeling of still rather uncertain processes shaping properties of galaxies during their formation. In addition, the simulations can reach the highest resolution in the regime when complicated and computationally costly galaxy formation processes are not included and all of the matter in the universe is modeled as a uniform collisionless and dissipationless component (i.e., the component that cannot dissipate the kinetic energy it acquires during gravitational collapse and accompanying gravitational interaction and relaxation processes). Such simulations thus give the most accurate knowledge of the dark matter subhalo populations, but can only predict dynamical subhalo properties such as the depth of their potential well or the total mass of gravitationally bound material. Therefore, in comparisons between theoretical predictions and observations so far, the most common strategy was to find a compromise quantity that can be estimated both in dissipationless simulations and in observations.

2.1. Quantifying the Subhalo Populations

Starting with the first studies that made such comparisons using results of numerical simulations [8, 24] the quantity of choice was the maximum circular velocity, defined as where is the spherically averaged total mass profile about the center of the object. is a measure of the depth of the potential (the potential energy of a self-gravitating system is ) and can be fairly easily computed in a cosmological simulation once the center of a subhalo is determined. (The detailed description of the procedure of identifying the centers of subhalos is beyond the scope of this paper, but is nevertheless pertinent. While many different algorithms are used in the literature [6, 9, 2529], all algorithms boil down to the automated search for density peaks (most often in configuration space, but sometimes in the phase space) field smoothed at a scale comparable to or smaller than the size of the smallest subhalos in the simulations. Once the peaks are identified, the gravitationally bound material around them is usually found by iteratively removing the unbound particles.) The attractive feature of is that it is well defined and does not require estimate of a physical boundary of subhalos, which is often hard to determine. The price is that resolution required to get the correctly for a subhalo is higher, compared, for example, to the total bound mass of subhalo, because is probing the mass distribution in the inner regions of halos.

The total gravitationally bound mass of a subhalo, , is less sensitive to the resolution, but requires careful separation between real subhalo particles and unbound particles from the diffuse halo of the host dark matter halo. This can be quite difficult in the inner regions of the host system where density of the background diffuse halo is comparable to the internal density of subhalo or when two subhalos overlap substantially.

An alternative option is to define mass of a subhalo within a fixed physical radius. For suitably chosen radius value, the mass can be measured unambiguously both in simulations and in observations. We will discuss the measurement of the enclosed mass and comparisons between simulations and observations below in Sections 3 and 4.3 (see Figures 8 and 12).

Figure 3 shows the cumulative circular velocity and mass functions (CVF and CMF) of subhalos within the virial radius (defined as , where is the mean matter density in the universe and , corresponding to the virial overdensity suggested by the spherical collapse model in the CDM cosmology [30]) of a simulated Milky Way sized halo, formation of which was illustrated in Figure 1. Both cumulative functions can be approximated by power laws over the ranges of circular velocity and in units of and virial mass of the host: and with the slopes of and , respectively. At large circular velocities deviations from the power law can be significant due to small numbers of subhalos.

The highest-resolution simulations (to date) of the individual MW-sized DM halos formed in the concordance CDM cosmology [3133] show that the power laws with the slopes in the range indicated above describe the CVF and CMF down to and . Note, however, that over a wider range of subhalo masses the power law can be expected to change slowly reflecting the changing slope of the rms fluctuations as a function of scale, which controls the abundance of halos as a function of mass [34, 35].

The amplitude of the mass and velocity functions is sensitive to the normalization of the power spectrum on small scales [31, 36, 37] and is thus sensitive to the cosmological parameters that control the normalization (such as tilt and normalization ).

For a given cosmology, the normalization of the CVF and CMF scales approximately linearly with the host halo mass [19]: . The halo-to-halo scatter in the normalization of CVF and MCF for a fixed host halo virial mass is described by the Poisson distribution [19]: . The fractional scatter is, therefore, quite small for small and (large ).

The mass and circular velocity functions within a given radius describe the overall abundance of subhalos of different mass, but not their radial distribution. The latter depends rather sensitively on how the subhalo samples are selected [38]. This is because subhalos at different distances from their host halo center on average experience different tidal mass loss, which affects different subhalo properties by different amount. Subhalo mass is the most affected quantity as large fraction of halo mass when it accretes is relatively loosely bound and is usually lost quickly. Although circular velocity is determined by the inner mass distribution in the inner mass of subhalos, it is still affected by tidal stripping (albeit to a less degree and slower than the total mass [39]).

The average mass loss experienced by subhalos increases with decreasing distance to the center of the host halo [38]. Therefore, selecting subhalos based on their current bound mass or circular velocity biases the sample against subhalos at smaller radii and results in the radial distribution much less concentrated than the overall mass distribution of the host halo [6, 26, 3842]. Conversely, one can expect that if the selection of subhalos is made using a quantity not affected by stripping, the bias should be smaller or even disappear altogether.

Figure 4(a) shows the radial distribution of subhalos (the same population as in Figure 3) selected using their current bound mass or circular velocity and density profile of dark matter within an MW-sized host halo. The figure shows that the subhalo distribution is less radially concentrated compared to the overall density profile because selection using current subhalo properties affected by tidal evolution biases the sample against the inner regions. Figure 4(b) shows the radial distribution of subhalos in the same host halo but now selected using circular velocity and mass the subhalos had before accretion (which are of course not affected by the tides). In this case, the radial profile is very close to that of the dark matter distribution. This dependence of the radial profile on the property used for subhalo selection should be kept in mind when the observed and predicted radial distributions are compared. The latter are selected based on their luminosity (i.e., the stellar mass), which may be affected by tides much less than either the total bound mass or circular velocity [38, 43].

Finally, the spatial distribution of satellites is not completely spherically symmetric, but is triaxial, which reflects their accretion along filaments and subsequent evolution in the triaxial potential of their host halos [4446].

2.2. Quantifying Populations of Luminous Galactic Satellites

Although we currently know only a few dozens of nearby satellite galaxies around the Milky Way and Andromeda, these galaxies span a tremendous range of the stellar densities and luminosities. The two brightest satellites of the Milky Way, the Large and Small Magellanic Clouds (LMC and SMC), are easily visible by the naked eye in the southern hemisphere and have, therefore, been known for many hundreds of years, while the faintest satellites have been discovered only very recently using sophisticated search algorithms and the vast data sets of stellar photometry in the Sloan Digital Sky Survey and contain only a few hundred stars [47, 48]. Up until the late 1990s, only a dozen dwarf galaxies were known to exist within 300 kpc of the Milky Way, with a similar number around the Andromeda [49]. These galaxies have luminosities and morphologies of the three types: () dwarf irregular galaxies (dIrrs, e.g., LMC and SMC)—low surface brightness galaxies of irregular appearance which have substantial amount of gas and thus exhibit continuing star formation, () dwarf spheroidal galaxies (dSphs, e.g., Draco or Fornax)—low surface brightness galaxies with spheroidal distribution of stars and no (or very little) ongoing star formation, and () dwarf elliptical galaxies (dEs, e.g., M32)—high-surface brightness, low-luminosity ellipticals with no gas and no current star formation. dSph and dE galaxies tend to be located within 200 kpc of their host galaxies, while dIrr galaxies are distributed more uniformly. This tendency is called the “morphological segregation’’ [49] and appears to exist in other nearby groups of galaxies [50]. The properties of these “classical’’ dwarf galaxies are reviewed extensively by [49] (see also recent study of scaling relations of dwarf galaxies by [51]).

Despite the wide range of observed properties, all of the nearby dwarfs share some common features in their star formation histories (SFHs). The SFHs of all classical dwarfs are characterized by a rather chaotically varying star formation rates. Most bright dwarfs form stars throughout their evolution, although the majority of stars may be formed in several main episodes spread over ten or more billion years [5255], and have at least some fraction of stars that formed in the first two billion years of the evolution of the universe. In terms of their SFHs, the main difference between the dwarf irregular and dwarf spheroidal galaxies is in the presence or absence of star formation in the last two billion years [54].

The radial distribution of the classical satellites around the Milky Way is rather compact. For the dwarf galaxies within 250 kpc of the Milky Way, the median distance to the center of the Galaxy is 70 kpc [39, 57], while the predicted median distance for subhalos is 120–140 kpc [39] (see Figure 13). The distribution of the satellites around the Andromeda galaxies is consistent with that of the Milky Way satellites, but is less accurately determined due to larger errors in distances. The spatial distribution of satellites about the Milky Way and Andromeda is also manifestly nonisotropic with the majority of the satellites found in a flattened structure nearly perpendicular to the disk [5862].

In 1994, a new faint galaxy was discovered in the direction toward the center of the Galaxy (in the Sagittarius constellation [63]). The galaxy is similar to other nearby dwarf spheroidal galaxies in its properties but is remarkably close to the solar system (the distance of only 28 kpc) and is in the process of being torn apart by the tidal interactions with the Milky Way. This interaction has produced a spectacular tidal tail, which has wrapped several times along the orbit of the Sagittarius dwarf [64].

The discovery of this new satellite has alerted researchers in the field to the possibility that other satellites may be lurking undiscovered in our cosmic neighborhood. The advent of wide field photometric surveys, such as the Sloan Digital Sky Survey and the targeted surveys of regions around the Andromeda galaxy, and new search techniques has resulted in dozens of new satellite galaxies discovered during the last decade [47, 48, 6571] with many more discoveries expected in the near future [56, 72]. The majority of the newly discovered galaxies are fainter than the “classical dwarfs’’ known prior to 1998. Due to their extremely low luminosities (as low as 1000  in the case of the Segue 1 [48]), they have collectively been referred to as the “ultra-faint’’ dwarfs. Such low luminosities (and implied stellar masses) indicate an extreme mode of galaxy formation, in which the total population of stars produced during galaxy evolution is smaller than a star cluster formed in a single star formation event in more luminous galaxies. (Alternatively, extremely low luminosities of the ultra-faint dwarfs can also be explained as a highly stripped remnants of more luminous dwarfs [43]. While this possibility is not excluded, it is disfavored by the fact that ultra-faint dwarfs appear to lie on the continuation of the luminosity-metallicity relation of more luminous dwarf galaxies [73].)

More practically, the extreme faintness of the majority of dwarf satellites implies that we have a more or less complete census of them only within the volume of 30–50 kpc of the Milky Way [56, 74]. Figure 5 shows the distance to which the dwarfs of a given luminosity are complete in the SDSS survey, in which the faintest new dwarfs have been discovered. The figure shows that we have a good census of the volume of the Local Group only for the relatively bright luminosities of the “classical’’ satellites. At the fainter luminosities of the ultra-faint dwarfs, on the other hand, we can expect to find many more systems at larger radii in the future deep wide area surveys. The exact number we can expect to be discovered depends on their uncertain radial distribution, but given the numbers of already discovered dwarfs and our current knowledge of the radial distribution of brighter satellites (and expected radial distribution of subhalos), we can reasonably expect that at least a hundred faint satellites exist within 400 kpc of the Milky Way. This is illustrated in Figure 6, which shows the luminosity function of the Milky Way satellites corrected for the volume not yet surveyed under different assumptions about radial distribution of the satellites [56].

The basis for considering these extremely faint stellar systems as bona fide galaxies is the fact that unlike star clusters, they are dark matter dominated: that is, the total mass within their stellar extent is much larger than the stellar mass expected for old stellar populations [48]. The total dynamical masses of these galaxies are derived using kinematics of stars. (These faint dwarf spheroidal galaxies do not have cold gas and therefore their mass profiles cannot be measured using the gas rotation curve, as is commonly done for more massive dIrr galaxies.) High-resolution spectroscopy of the red giant stars in the vicinity of each galaxy provides the radial velocities of these stars. The radial velocities can then be modeled using using the Jeans equilibrium equations to derive the total mass profile [7580]. This modeling requires certain assumptions about the unknown shape of the stellar distribution and velocity distribution of stars, as well as assumptions about the shape and radial profile of the dark matter distribution. The resulting mass profile, therefore, has some uncertainty associated with these assumptions [75, 78, 80].

Additionally, the ultra-faint dwarfs follow scaling relations of the brighter classical satellites such as the luminosity-metallicity relation [73] and, therefore, seem to be the low luminosity brethren within the family of dSph galaxies.

3. Defining the Substructure Problem

As I noted above, comparison of theory and observations in terms of the directly observable quantities such as luminosities is possible only using a galaxy formation model. These models, although actively explored [39, 8187] (see also Section 4.3) are considerably more uncertain than the predictions of dissipationless simulations on the properties of dark matter subhalos. Given that observed dwarf satellites are very dark matter dominated, the dissipative processes leading to formation of their stellar component are expected to have a limited effect on the distribution of the dynamically dominant dark matter. Fruitful comparison between simulation predictions and observations is, therefore, possible if a quantity related to the total mass profile can be measured in the latter.

The first attempts at such comparisons [8, 9] assumed isotropy of the stellar orbits and converted the line-of-sight velocity dispersion of stars in dSph satellites, , to estimate their maximum circular velocities as . The admittedly oversimplistic conversion was adopted simply due to a lack of well-measured velocity profiles and corresponding constraints on the mass distribution at the time. Figure 7 shows such a comparison for the classical satellites of the Milky Way and subhalo populations in Milky Way-sized halos formed in the concordance CDM cosmology.(I did not include the new ultra-faint satellites in the comparison both because their values are much more uncertain and because their total number within the virial radius requires uncertain corrections from the currently observed number that probes only the nearest few dozen kpc. The velocity dispersions of the ultra-faint dwarfs are very similar to each other (5 km/s) and they, therefore, formally have similar values according to this simple conversion method (hence, they would all be “bunched up’’ at about the same  km/s value). The maximum circular velocity of the halos of these galaxies is expected to be reached at radii well beyond the stellar extent and its estimate from the observed velocity dispersions requires substantial extrapolation and assumptions about the density profile outside the radii probed by stars. The errors of the derived values of can, therefore, be quite substantial [75, 88]. I will compare the predicted luminosity function of the luminous satellites using a simple galaxy formation model in Section 4.3 (see Figure 11).)

The observed velocity function is compared to the predicted VF of dark matter subhalos within a 286 kpc radius of Milky Way-sized host halos. In literature, the term “Milky Way-sized’’ is often used to imply a total virial mass of and maximum circular velocity of  km/s. However, there is some uncertainty in these numbers. Therefore, the figure shows the VFs for the host halos with  km/s and  km/s. The former is measured directly in a simulation of the halo of that circular velocity, while the latter VF was rescaled as , using scaling measured statistically in the simulations [19].

The simple conversion of to has justly been criticized as too simplistic [89]. Indeed, the conversion factor requires a good knowledge of mass profile from small radii to the radius . The mass profile derived from the Jeans equation has errors associated with uncertainties in the anisotropy of stellar orbits, as well as with uncertainties of spatial distribution of stellar system and/or its dynamical state [76, 90]. Most importantly, the mass profile is only directly constrained within the radius where stellar velocities are measured, . If this radius is smaller than , conversion factor depends on the form of the density profile assumed for extrapolation. The uncertainties of the derived mass profile within the stellar extent will of course also be magnified increasingly with increasing ratio [90].

Thus, for example, Stoehr et al. [89] have argued that the conversion factor can be quite large (–4) if the density profile is shallow in the inner regions probed by the stars. Such large conversion factor would shift the low points in Figure 7 to the right closer to the subhalo VF [78, 89, 91] and would imply that there is a sharp drop in the luminosity function of satellites below a certain threshold circular velocity ( km/s). High-resolution simulations of individual satellites, on the other hand, have demonstrated that the CDM satellites retain their cuspy inner density profiles, even as they undergo significant tidal stripping [92]. (The profiles can be even steeper than predicted by dissipationless simulations due to the effects of adiabatic contraction during gas condensation towards the center of the dwarf progenitors [93]. Although the effect of baryon condensation in dwarfs are uncertain at present, one can argue that they can reasonably be expected to be modest, given large mass-to-light ratios of faint dwarfs [48].) This implies that is likely not as large as advocated by Stoehr et al. The main uncertainty in its actual value for a specific satellite is then due to the uncertainty in the density profile and ratio .

Recently, Peñarrubia et al. [78] combined the measurements of stellar surface density and profiles to estimate values for individual observed MW satellites under the assumption that their stellar systems are embedded into NFW dark matter potentials. Such procedure by itself does not produce a reliable estimate of , because NFW potentials with a wide range of can fit the observed stellar profiles. To break the degeneracy, Peñarrubia et al. have used the relation between and expected in the concordance CDM cosmology. They showed that this results in estimates of which imply -3 [43, Figure  4]. This estimate does not take into account effects of tidal stripping on the evolution of the - relation. Typically, subhalos located in the inner regions of the halo are expected to have lost 60%–90% of their initial mass by due to tidal stripping [22, 38, 39]. For such mass loss changes only by 20%–30% [39, 78] but should change by a factor of 2-3 [43, 92].

In a subsequent study, Peñarrubia et al. [43] used controlled simulations of subhalo evolution to argue that tidal stripping does not significantly affect their inferred conversion factor [43, Figure  9]. This conclusion, however, was drawn based on the systems in which both stellar system and DM halo were significantly stripped. In such system, is close to the stellar radius and and evolve in sync. For systems with more realistic mass loss and with stars deeply embedded within , however, stellar system (and ) may not be affected, while can evolve significantly. For such systems, the method of [78] will lead to a significant overestimate of . Indeed, the systems in [43, Figure  9] for which the method overestimates (by a factor of 1.4) the most are the systems with moderate total mass loss and least affected stellar systems. Note that even these systems have likely experienced more tidal loss than most of the real dSph satellites.

Another factor in estimates of is anisotropy of stellar velocities in dSph (e.g., [92]). For example, recent analysis of observed velocity dispersion profiles of “classical’’ dSph by [79], in which the anisotropy of stellar orbits was treated as free parameters, results in estimates of of their host subhalos in the range 10–25 km/s, smaller than would be suggested if correction factor was -3.

Regardless of the actual conversion factor value, however, it is clear that it cannot change the main difference between the observed and predicted VFs—the large difference in their slope—unless strongly depends on (there is no observational evidence for this so far).

Another promising approach is to abandon attempts to derive altogether and to measure instead the observed mass within the radius where the uncertainty of the measured mass profile is minimal. Such radius is close to the stellar extent of observed galaxies [78, 80, 88, 90]. Figure 8, adopted from [90], shows comparison of the mass functions of subhalos in the Via Lactea I simulation [94] and observed satellites of the Milky Way, where masses are measured within a fixed physical radius of 600 pc (see also the discussion in Section 4.3 and Figure 12). The figure shows that the simulated and observed mass functions are different, the conclusion similar to that derived from the comparison of the circular velocity functions.

Thus, the discrepancy that is clearly seen in the comparison of circular velocity functions, measured with more uncertainty in observations, persists if the comparison is done using a much better measured quantity. Unfortunately, the stellar distribution in most of the newly discovered ultra-faint dwarf galaxies does not extend out to 600 pc radius and therefore, cannot be measured as reliably for these faint systems as for the classical dwarfs. Similar comparisons have to be carried out using masses within smaller radii. This puts stringent requirements on the resolution of the simulations, as they need to reliably predict mass distribution of subhalos within a few hundred parsec radius. Such high-resolution simulations are now available [3133].

We can draw two main conclusions from the comparisons of the circular velocity functions and the more reliable mass functions of subhalos and observed satellites presented in the previous sections, even taking into account existing uncertainties in deriving circular velocities and the total dynamical masses for the observed satellites. First, the predicted abundance of the most luminous satellites is in reasonable agreement with the data, even though the statistics are small. Most MW-sized halos simulated in the concordance CDM cosmology have 1 SMC/LMC sized (–70 km/s) subhalos within their virial radius.

This is not a trivial fact because the abundance of the most massive satellites is determined by a subtle interplay between the accretion rate of systems of corresponding circular velocity and their disruption by the combined effects of the dynamical friction and tidal stripping [95]. Dynamical friction causes satellites to sink to the center at a rate which depends on the mass and orbital parameters of satellite orbit. Orbital parameters, in turn, depend on the cosmological environment of the accreting host halo and are mediated by the tidal stripping which reduces satellite mass as it sinks, thereby rendering dynamical friction less efficient [9699]. The fact that the concordance CDM model makes an ab initio prediction that the number of massive satellites that can host luminous dwarfs is comparable to observations can therefore be viewed as a success of the model.

Second, the slopes of both the circular velocity function and the mass function are different in simulations and observations. This implies that we cannot simply match all of the luminous satellites to the subhalos with the largest and , as was sometimes advocated [89, 91]. The mass function comparison, in particular, indicates that there should be some subhalos with the that do not host the luminous galaxies, and some that do. As I discuss in Section 4.2, this has a strong implication for the physical interpretation of the difference in terms of galaxy formation scenarios.

In summary, the substructure problem can be stated as the discrepancy in the slopes of the circular velocity and mass functions inferred for observed satellites of the Milky Way and the slopes of these functions predicted for dark matter subhalos in the MW-sized host halos formed in the concordance CDM cosmology.

I believe that stated this way the problem is well defined. Defining the problem in terms of the difference in the actual number of satellites and subhalos is confusing at best, as both numbers are fairly strong functions of subhalo mass or stellar luminosity. Thus, for example, even though the discovery of the ultra-faint dwarfs implies the possible existence of hundreds of them in the halo of the Milky Way [49, 75] (this fact has been used to argue that the substructure problem has been “alleviated’’), the most recent simulations show that more than 100 000 subhalos of mass should exist in the Milky Way [31, 32]. (Indeed, it is not obvious a priori that subhalos of mass - are too small to host luminous stellar systems of stellar mass - [100]—the stellar masses corresponding to the luminosities of the faintest recently discovered dwarfs. After all, the halos of this mass are expected to be hosting formation of the very first stars [101].) The substructure problem stated in the actual numbers of satellites is therefore alive and well and has not been alleviated in the least.

I would like to close this section by a brief discussion of the comparison of spatial distribution of observed satellites and subhalos. As I noted above, the radial distribution of the observed satellites of the Milky Way is more compact than the radial distribution of subhalos selected using their present day mass or circular velocity [39, 57, 102]. In addition, the observed satellites are distributed in a quite flattened structure with its plane almost perpendicular to the disk of the Milky Way [5862]. Although the spatial distribution of all subhalos is expected to be anisotropic, reflecting the anisotropy of their accretion directions along filaments [44, 45] and, possibly, the fact that some of the satellites could have been accreted as part of the same group of galaxies [103], the expected anisotropy is not as strong as that of the observed Milky Way satellites.

Thus, both the radial distribution and anisotropy of the observed satellites do not match the overall distribution of subhalos in CDM halos. This may be yet another side of the same substructure problem coin and the overall spatial distribution of observed satellites needs to be explained together with the differences in the circular velocity function. (Although distribution of satellites around other galaxies is not mapped as accurately as around the Milky Way, there is evidence for similar planar anisotropy around M31 [60]. The evidence for such anisotropy in other disk galaxies is weak at best [104], although measurements for distant galaxies are more difficult as they require very careful handling of projection effects [105].)I will review a few possible explanations for the substructure problem and differences in the spatial distribution in the following section.

4. Possible Solutions

4.1. Modifications to the CDM Model

One possible way to account for the differences of the predicted and observed circular velocity and mass functions is to assume that CDM model is incorrect on the small scales probed by the dwarf galactic satellites. Indeed, the abundance of satellites is sensitive to the amplitude of the power spectrum on the scales corresponding to the total mass of their host halos. For a halo of mass the comoving scale of fluctuations that seed their formation is where is the present-day total matter density in units of the present-day critical density, and is the current Hubble constant in units of km/s/Mpc.

If the amplitude of density fluctuations on such scales is considerably suppressed compared to the concordance CDM model used in most simulations, the abundance of subhalos can then also be suppressed. Such suppression can be achieved either by suitably varying parameters controlling the amplitude of the small-scale power spectrum within the CDM model itself [36], such as the overall normalization of the power spectrum or its large-scale tilt, or by switching to models in which the amplitude at small scales is suppressed, such as the warm dark matter (WDM) structure formation scenarios [36, 106109]. In these models the abundance of satellites is suppressed both because fewer halos of dwarf mass form in the first place (due to smaller initial amplitude of fluctuations) and because halos that do form have a less-concentrated internal mass distribution, which makes them more susceptible to tidal disruption after they accrete onto their host halo. Models in which dark matter was assumed to be self-interacting, a property that can lead to DM evaporation, have also been proposed and discussed [110, 111], but these models both run into a contradiction with other observational properties of galaxies and clusters [112115] and are now strongly disfavored by observational evidence indicating that dark matter self-interaction is weak [116].

The problem, however, is more subtle than simply suppressing the number of satellites. As discussed above, differences exist between observed and predicted slopes of the circular velocity and mass. The slope is controlled by the slope of the primordial fluctuation spectrum around the scale corresponding to the masses of satellite halos and structural properties of the forming halos. It has not yet been demonstrated convincingly whether both the circular velocity and the mass functions can be reproduced in any of the models alternative to CDM. In fact, recent measurements of mass distribution in the central regions of observed satellites put stringent constraints on the phase space density and “warmness’’ of dark matter [75]. At the same time, measurements of the small-scale density power spectrum of the Lyman forest indicate that fluctuations with the amplitude expected in the CDM model at the scales that control the abundance of dwarf mass halos are indeed present in the primordial spectrum [117, 118].

While the inner density distribution in observed satellites may still be affected by dark matter warmness in the allowed range of parameter space [77] (see, however, [79]), the models with such parameters would not suppress the overall abundance of satellites considerably. In fact, observations of flux ratios in the multiple image radio lenses appear to require an amount of substructure which is even larger than what is typically found in the CDM halos [119121], disfavoring models with strongly suppressed abundance of small mass subhalos.

There is thus no compelling reason yet to think that the observed properties of galactic satellite populations are more naturally reproduced in these models. In the subsequent discussion, I will therefore use Occam's razor and focus on the possible explanations of the differences between observed satellites and subhalos in simulations within the CDM model. The prime suspect in producing the discrepancy is the still quite uncertain physics of galaxy formation. After all, a similar problem exists for objects of larger masses and luminosities if we compare the slope of the luminosity function and the halo mass function [122125] or the predicted and observed abundance of galaxies in the nearby low density “field’’ regions [126].

4.2. Physics of Galaxy Formation

Several plausible physical processes can suppress gas accretion and star formation in dwarf dark matter halos. The cosmological UV background, which reionized the universe at , heats the intergalactic gas and establishes a characteristic time-dependent minimum mass for halos that can accrete gas [127134]. The gas in the low-mass halos may be photoevaporated after reionization [135137] or blown away by the first generation of supernovae [138141] (see [142]). At the same time, the ionizing radiation may quickly dissociate molecular hydrogen, the only efficient coolant for low-metallicity gas in such halos, and prevent star formation even before the gas is completely removed [143]. Even if the molecular hydrogen is not dissociated, cooling rate in halos with virial temperature  K is considerably lower than in more massive halos [144] and we can therefore expect the formation of dense gaseous disks and star formation to be suppressed in such halos. (The virial virial temperature is related to the virial mass by , where isothermal temperature profile is assumed for simplicity. The virial mass and radius are related by definition as . Assuming appropriate for regardless of , this gives /.) Another potential galaxy formation suppression mechanism is the gas stripping effect of shocks from galactic outflows and cosmic accretion [145, 146]. Finally, even if the gas is accreted and cools in small-mass halos, it is not guaranteed that it will form stars if gas does not reach metallicities and surface densities sufficient for efficient formation of molecular gas and subsequent star formation [39, 147150].

The combined effect of these processes is likely to leave most of dark matter halos with masses dark, and could have imprinted a distinct signature on the properties of the dwarf galaxies that did manage to form stars before reionization. In fact, if all these suppressing effects are as efficient as is usually thought, it is quite remarkable that galaxies such as the recently discovered ultra-faint dwarfs exist at all. One possibility extensively discussed in literature is that they managed to accrete a certain amount of gas and form stars before the universe was reionized [8184, 87, 151153]. Direct cosmological simulations do show that dwarf galaxies forming at bear striking resemblance to the faint dwarf spheroidal galaxies orbiting the Milky Way [154156] and their predicted abundance around the Milky Way is consistent with estimates of the abundance of the faintest satellites [151]. Alternatively, some authors argued [78, 89, 91, 157] that observed dwarf satellite galaxies could be in much more massive subhalos than was indicated by simple estimates of dynamical masses and circular velocities using stellar velocity dispersions. In this case, the relatively large halo mass could allow an object to resist the suppressing effects of the UV background.

Cosmological simulations also clearly show that the subhalos found within virial radii of larger halos at have on average lost significant amount of mass and have been considerably more massive in the past [22, 38, 39]. The dramatic loss of mass occurs due to the tidal forces that subhalos experience as they orbit in the potential of the host. A significant fraction of the luminous dwarf satellites therefore can be associated with those subhalos that have been substantially more massive in the past and hence more resilient against galaxy formation suppressing processes. Such subhalos could have had a window of opportunity to form their stellar systems even if the subhalos they are embedded in today have relatively small mass [39].

4.3. Models for Luminous Satellite Population

Given the galaxy formation suppression mechanisms and evolutionary scenarios listed above, the models aiming to explain the substructure problem can be split into the following broad classes: () the “threshold galaxy formation models’’ in which luminous satellites are embedded in the most massive subhalos of CDM halos and their relatively small number indicates the suppression of galaxy formation in subhalos of circular velocity smaller than some threshold value [78, 89, 157] and () “selective galaxy formation models’’ in which only a fraction of small subhalos of a given current and mass host luminous satellites while the rest remain dark.

In the second class of models the processes determining whether a subhalo hosts a luminous galaxy can be the reionization epoch [81, 87, 152154, 158]: subhalos that assemble before the intergalactic medium was heated by ionized radiation become luminous. The observed faint dwarfs can then be the “fossils’’ of the pre-reionization epoch [154, 155]. Subhalo may also form a stellar system if its mass assembly history was favorable for galaxy formation [39]: namely, luminous subhalos are those that have had sufficiently large mass during a period of their evolution to allow them to overcome the star formation suppression processes.

Several models using a combination of the processes and scenarios outlined above have been shown to reproduce the gross properties of observed population of satellites reasonably well [39, 84, 85, 87]. How can we test different classes of models and differentiate between specific ones?

First, I think the fact the mass function for observed satellites has a different slope compared to simulation predictions (Figure 8) favors the second class of the selective galaxy formation models, at least for the brighter “classical’’ satellites. Indeed, given what we know about the average mass loss of subhalos, it is more natural to associate the observed systems with the halos of the largest mass prior to accretion [39] rather than with the subhalos with the largest current masses. Second, the predicted number of the weakly evolving pre-reionization objects [84, 151, 153], the extended star formation histories of most of the observed dwarf satellites [5355, 159] (in fact, in terms of star formation the main difference between the dIrr and dSph galaxies appears to be presence or lack of star formation in the last 2 billion years before [54]), and the significant spread in metallicities and certain isotope ratios [160] indicate that majority of “classical” dwarfs have not formed most of their stars before reionization but have formed their stars over rather extended period of time (10 Gyr). It is still possible, however, that a sizable fraction of the ultra-faint dwarfs are the “pre-reionization fossils’’ [151, 155, 161, 162], if star formation efficiency in these objects is greatly suppressed [84] compared to that of brighter dwarfs.

Interesting additional clues and constraints on the galaxy formation models available for the dwarf satellites of the Milky Way are the measurements of the total dynamical mass within their stellar extent. Observations show that the total masses within a fixed aperture of the observed satellites are remarkably similar despite a several order of magnitude span in dwarf satellite luminosities [49, 78, 88, 163, 164]. For example, the range of masses within  kpc shown in Figure 8 is only an order of magnitude. Furthermore, existence of the tight correlation between total dynamical mass within the half-light radius, , and the corresponding radius [79] for bright dSph galaxies implies a very similar inner dark matter density profile of their host halos. Recently, Strigari et al. [88] have shown that the mass estimated within the central 300 pc, , for all of the dwarfs with kinematic data varies by at most a factor of four, while the luminosity of the galaxies varies by more than four orders of magnitude.

These observational measurements put constraints on the range of masses of CDM subhalos that can host observed satellites. To estimate this range, we should first note that for the CDM halos described by the NFW profile [18] with concentration (where is the scale radius—the radius at which the density profile has logarithmic slope of ) the dependence of the mass within a fixed small radius on the total virial mass is and is quite weak for  pc: , as shown in Figure 9. Indeed, even for a halo with the Milky Way mass at we expect , a value not too different from those measured for the nearby dwarf spheroidals. Physically, the weak dependence of the central mass on the total mass of the halo reflects the fact that central regions of halos form very early by mergers of small-mass halos. Given that the rms amplitude of density perturbations on small scales is a weak function of scale, the central regions of halos of different mass form at a similar range of redshifts and thus have similar central densities reflecting the density of the universe when the inner region was assembled. At earlier epochs the dependence is stronger because 300 pc represents a larger fraction of the virial radius of halos.

Note that the relation plotted in Figure 9 is for isolated halos unaffected by tidal stripping. Taking into account effects of tidal stripping results in even flatter relation [86]: , which also has a lower normalization (smaller for a given . This is likely due to a combination of two effects: () the halos of larger mass have lower concentrations and thus can be stripped more efficiently and () the halos of larger mass can sink to smaller radii after they accrete and experience relatively more tidal stripping. Overall, the effect of tidal stripping and heating on appears to be substantial and cannot be neglected.

Finally, Figure 9 shows that the virial mass range corresponding to a given range of is quite different for halos that form at compared to those that form at later epochs due to the rapidly evolving NFW concentration concentration for a fixed halo mass [165]. The mass can therefore be only interpreted in the context of a model for subhalo evolutionary histories.

Several recent studies have used such models to show that the nearly constant central mass of the satellite halos is their natural outcome [8486]. This outcome can be understood as a combination of the weakness of the correlation and the fact that in the galaxy formation models galaxy luminosity must be a nonlinear function of in order to produce a faint-end slope of the galaxy luminosity function much shallower than the slope of the small-mass tail of the halo mass function.

For example, if the faint-end slope of the luminosity function is (i.e., ) and the slope of the halo mass function at small mass end is () and we assume for simplicity a one-to-one monotonic matching between galaxies and halos (see [166168] for the detailed justification for such assumption), the implied slope of the relation is , which for the fiducial values of and gives . In semianalytic models, such a steep nonlinear relation is usually assumed to be set by either suppression of gas accretion due to UV heating or by gas blowout due to SN feedback, e.g., [83]). If I instead assume the faint-end slope of -1.6 as suggested by recent measurement of [125], the relation is shallower but is still nonlinear: .

Regardless of the specific processes producing the nonlinear luminosity-mass relation, for a relation of the form we have , where , using the relation above taking into account effects of tidal stripping. To account for a smaller than a factor of four spread in central masses for approximately four orders of magnitude spread in luminosity one needs –8 or –4, the values not too different from the estimate above. Thus, the weak correlation of the and luminosity will be the natural outcome of any CDM-based galaxy formation model which reproduces the slope of the faint end of the galaxy luminosity function.

Within the framework I just described, the slope of the correlation depends on the slope of the correlation . The models published so far [8486], as well as the simple model above with the slope -3, predict that the slope of the relation is shallow but is nevertheless not zero. Constraining this slope with future observations will tighten constraints on the galaxy formation models and will tell us more about the correlation if such exists.

To illustrate the points just made, Figure 10 shows the relation for the observed nearby dwarfs [88] and subhalos found within  kpc around three different MW-sized halos formed in the concordance cosmological model (see [39, 151] for simulation details).(The simulations used here do not reliably resolve the mass within 300 pc. Therefore, in order to calculate the mass I have used the mass and concentration of each subhalo at the epoch when it was accreted and computed evolution of their density profile given the mass loss they experienced by the present epoch, as measured in cosmological simulations, and using results of controlled high-resolution simulations of tidal evolution [43], which predict the evolution of the NFW concentration of halos as a function of tidal mass loss. The mass was then computed from the evolved density profile.) To assign luminosity to a given subhalo I follow the logic of the model presented in [39], which posits that the brightest observed satellites should correspond to the subhalos which have the largest mass before they were accreted. In this model the luminosity of stellar systems should positively correlate with the mass of its host subhalo before it was accreted onto the MW progenitor, : The power law form of the relation is motivated by the approximately power law form of the galaxy luminosity and halo mass functions at faint luminosities and small masses. The actual parameters were chosen such that luminosities of the most massive subhalos roughly match the luminosities of the most massive satellites, such as the SMC and LMC (). (Figure 10 does not show the most massive subhalos which would correspond to the systems such as the Large Magellanic Clouds, which have luminosities outside the range shown in the figure. This is justified because observational points shown in the figure include only the fainter dwarf spheroidal galaxies.) After all, the first order of business for all the models of satellite population is to reproduce the abundance and luminosities of the most massive ( km/s) satellites. The slope of the relation in (4) was set to reproduce the range of observed satellite luminosities and flatness of the relation. Note that this model does not assume any threshold for formation of galaxies. It simply implies that the efficiency with which baryons are converted into stars, , steadily decreases with decreasing at the rate given by (4). The slope of assumed in the equation above is on the lower end of the slopes suggested by matching of the faint end of galaxy luminosity function and small-mass end of halo mass functions discussed above, but is within the current uncertainties in the slope of the former [125]. Assuming such relation for field halos would therefore also reproduce the observed faint end of galaxy luminosity function within current uncertainties.

Figure 10 shows that the model with parameters of (4) is in agreement with observed measurements of the relation. The results will not change drastically if a somewhat steeper (3–3.5) slope is assumed. (The model of (4) is similar to the model 1B in the recent study by Koposov et al. [84], which assumes that stellar mass scales as , These authors find that the model reproduces the luminosity function of satellites for , , and , which gives , quite similar to the relation given by (4). Koposov et al. adjust parameters to match the observed luminosity function and then show that relation is reproduced, while I adopted the opposite route here. The key difference between the models is that their model assumes a ceiling on the value of , while I assume no such ceiling. Absence of the ceiling on star formation efficiency is actually important for bright satellites (see Figure 14).)

Having fixed the parameters of the relation, we can then ask the question of whether the luminosity function of satellites would be reproduced self-consistently by such a model. We can use the observed luminosity functions corrected for completeness for the faintest dwarfs from [56] (shown in Figure 6) to test this. Figure 11 shows the subhalo luminosity functions constructed using subhalos identified within 417 kpc (the same outer radius used in the construction of the observed luminosity function by [56]) in the simulations and the simple luminosity assignment scheme of (4). The points with (Poisson) error bars and the shaded region show the observed luminosity functions for the classical satellites and for ultra-faint dwarfs, as compiled by [56]. As I noted before, the current samples of the ultra-faint satellites exist only over a fraction of the sky and there are reasons to expect that they are incomplete for distances larger than 50 kpc. The shaded region, therefore, represents the likely bracket of the possibilities. The lower edge of the region is the luminosity function in the case when the observed luminosity function of the ultra-faints is simply corrected for the fractional sky coverage. The upper edge of the shaded region shows a combined correction for both the sky coverage and radial distribution assuming that ultra-faints have the same radial distribution as mass-selected subhalos in the Via Lactea I simulation [56].

Figure 11 shows that although the model and observed luminosity functions do not match perfectly, they are in reasonable agreement. One should remember a number of uncertainties in the model luminosity functions. First, the masses of host halos can be a factor of two or so larger than the mass of the Milky Way and the cosmology of the simulations may not be exactly correct. Second, the relation assumed in the model can be somewhat different from that assumed in the figure: for example, the slope can be somewhat steeper and normalization lower, which would make both the relation flatter and the luminosity functions closer to observations. The fiducial relation in (4) was chosen to have the shallowest slope () still reasonably consistent with observations.

Figure 12 shows comparison of the mass functions for the observed “classical’’ dwarf spheroidal satellites of the Milky Way [90] and predicted mass function for the entire subhalo population of the MW-sized halos within 270 kpc (the largest distance of the observed satellite included in the comparison) and the subhalos with (the smallest dSph luminosity included in the observed sample) with luminosities assigned using (4). I have excluded the Sagittarius dwarf from this comparison as its mass has very large errors [90]. The number of the predicted luminous satellites was reduced by three to account for exclusion of the Sagittarius, SMC, and LMC from the comparison. The figure shows that the model predicts the range of quite similar to that measured for the observed luminous dSphs. The shape of the mass function is also in reasonably good agreement with the data. Although there are somewhat more predicted satellites at small masses, this is likely due to the somewhat larger virial mass of the simulated halos () compared to the mass of the Milky Way (). We expect the number of subhalos to scale approximately linearly with host mass and so the difference in the virial mass of the Milky Way and simulated halos can account for the difference with observations in Figure 12. There is some discrepancy at the largest values, but it is not clear just how significant the discrepancy is given that the typical errors on the measurements for these galaxies are 20%–40%.

Finally, Figure 13 compares cumulative radial distribution of the observed “classical’’ Milky Way satellites within 280 kpc and satellites with similar luminosities and within the same distance from their host halo in the model of (4). The figure also shows the cumulative distribution of all subhalos selected using their current . The predicted distribution of bright luminous satellites is somewhat more radially concentrated than the distribution of the -selected subhalos and is in reasonable agreement with the observed distribution both in its median and in the overall shape.

Thus, the observed relation, the luminosity function, the mass function, and the radial distribution of the observed satellites can all be reproduced simultaneously with such a simple dwarf galaxy formation scenario. The observational uncertainties in these statistics are still quite large, which leaves significant freedom in the parameters of (4) and in its functional form. (It is quite possible that relation between luminosity and mass is more complicated than (4). For example, the normalization of the relation can evolve with redshift because luminosity may be determined both by the mass of the halo at the accretion epoch and by the period of time before its accretion during which it was sufficiently massive to withstand star formation suppressing processes. Such redshift dependence would be an extra parameter which would generate scatter in the relation.) It is also possible that all of these statistics may be reproduced in a drastically different scenario. Nevertheless, the success of such simple model is encouraging and it is interesting to discuss its potential implications.

First of all, (4) implies that all of the observed Milky Way dSph satellites had virial masses when they were accreted and these masses may span the range up to (the actual range depends sensitively on the slope of the relation). This shows that progenitors of the observed satellites could have had a wide range of virial masses, even though the range of their and masses is narrow.

An interesting implication of the value of lowest mass of the range of masses above is that whatever gas the small-mass halos () are able to accrete, it should remain largely unused for star formation, and of course would not be blown away by supernovae (given that the model implies that such objects should have no stars or supernovae). If some of this gas is neutral, it can contribute to HI absorption lines in the spectra of quasars and distant galaxies. If this gas is enriched, it can also produce absorption lines of heavier elements. At lower redshifts, the neutral gas in the otherwise starless or very faint halos could manifest itself in the form of the High Velocity Clouds (HVCs) abundant in the Local Group [169172] and around other galaxies.

Second, as I noted above the slope of the relation required to explain the weak dependence of on luminosity is not surprising, given what we know about the faint-end slope of galaxy luminosity function and what we expect about the slope of the mass function of their host halos in CDM scenario [173]. The implied normalization of the relation, however, is quite interesting. For example, it indicates that halo of should have luminosity of . Converting it to stellar mass assuming (appropriate for old populations, e.g., [174]) gives . Results of cosmological simulations with UV heating of gas show that halos of should have been able to accrete almost all of their universal share of baryons, (assuming suggested by the WMAP measurements [10]), even in the presence of realistic UV heating [133, 134]. The derived stellar mass thus implies that only (i.e., ) of the expected baryon mass was converted into stars in such objects. Such small efficiency for systems accretion onto which is not suppressed by the UV heating implies that star formation is dramatically suppressed by some other mechanism.

In fact, the implied efficiency of baryon conversion into the stars, , in this model is a steep, power law function of mass and circular velocity (), as shown in Figure 14. The figure also shows the functional form one would have expected if the dependence of the efficiency on circular velocity was determined by the fraction of gas halos are able to accrete in the presence of the UV radiation. This functional form is almost identical to the fiducial model 3B of Koposov et al. [84] (I used slightly larger value for critical velocity because I use rather than the virial circular velocity used by these authors). The two models have similar behavior at  km/s, but the UV heating model asymptotes to a fixed value of for more massive systems. This model would therefore underpredict luminosities of the most massive satellites in MW-sized halos, which was noticed by Koposov et al. in its failure to reproduce the bright end of the satellite luminosity function. Moreover, the estimates of the efficiency for more luminous galaxies, such as the Milky Way, are in the range 0.05–0.2 [175178], which would lie roughly on the continuation of the relation in Figure 14 to larger circular velocities shown by symbols. (The relation could plausibly flatten at larger , as expected from the halo modeling of the galaxy population, e.g., [173].)

These considerations indicate a very interesting possibility that while the UV heating can mediate accretion of gas into very small mass halos, the efficiency with which the accreted gas is converted into stars in the known luminous galaxies is dramatically suppressed by some other mass-dependent mechanism. This strong suppression operates not only for halos in which gas accretion is suppressed but for halos of larger masses as well. Note that this suppression mechanism is unlikely to be due to blowout of gas by supernovae, which is expected to give (e.g., [140]), a much shallower relation than the scaling in Figure 14.

While discussion of the nature of this suppression mechanism is outside the scope of this paper, the rapidly improving observational data on the satellite population should shed light on the possible mechanisms.

The exercise presented in this section illustrates just how powerful the combination of luminosity function and radial distribution of satellites, high-quality resolved kinematics data and inferred dynamical constraints on the total mass profile, measurements of star formation histories and enrichment histories can be in understanding formation of dwarf galaxies. The rapidly improving constraints on the mass profiles of the dSph galaxies down to the smallest luminosities [79, 88] should further constrain the range of subhalo masses hosting the observed satellites and, by inference, the efficiency of star formation in such small halos. The hints that star formation efficiency is actually a monotonic function of halo mass from galaxies such as the Milky Way to the faintest known galaxies, such as Segue 1, indicates that the physics learned from the “near-field cosmology’’ studies of the nearest dwarfs can potentially give us important insights into formation of more massive galaxies as well.


The author would like to thank Brant Robertson, Anatoly Klypin, Nick Gnedin, Jorge Peñarrubia, Erik Tollerud, and James Bullock for useful comments on the manuscript and stimulating discussions on the topics related to the subject of this paper. This work was partially funded by the NSF grants AST-0507666, and AST-0708154. The research was also partially supported by the Kavli Institute for Cosmological Physics at the University of Chicago through grant NSF PHY-0551142 and an endowment from the Kavli Foundation. He also would like to thank Kavli Institute for Theoretical Physics (KITP) in Santa Barbara and organizers of the 2008 KITP workshop “Back to the Galaxy II’’ where some of the work presented here was carried out, for hospitality and wonderful atmosphere. He has made extensive use of the NASA Astrophysics Data System and arXiv.org preprint server during writing of this paper.