Abstract

Pearl millet (Pennisetum glaucum) is one the most important crops in Burkina Faso. Knowledge of the genetic parameters of different pearl millet landraces grown in Burkina Faso is of great importance for improvement and breeding programs of pearl millet. In this study, 20 pairs of SSR markers were used to compare and analyze the genetic diversity of 86 landrace populations. Results show a total of 67 alleles detected and each locus varies between 2 and 6 with an average of 4 alleles per marker. The probability to having two different alleles per locus was 0.61. The values of genetic diversity parameters of pearl millet populations from the northern region of the country were higher than those from the others regions. The analysis indicates that, over time, alleles are lost and new alleles are added. Analysis of molecular variance showed genetic variation was primarily a function of differences in cultivated populations. Most pairwise FST comparisons between provenances were at a statistically significant level. Our study will provide a scientific basis for the effective management and utilization of germplasm resources of pearl millet populations.

1. Introduction

In Burkina Faso, economy is largely based on crop production and livestock, which engage 90% of the workforce and contribute to 39% of the gross domestic product [1]. Pearl Millet (Pennisetum glaucum) is one the most important cereals where daily consumption per capita is around 70 gr [2]. Its growing areas have increased continuously, and it is cultivated in traditional farming systems. The greatest morphological diversity of this species is found in West Africa, south of the Sahara Desert, and north of the forest zone [3, 4]. Because of its versatility, pearl millet is still a fundamental component of production systems and diets in Sahelian countries, where many prefer it to sorghum for culinary purposes.

The situation of pearl millet landraces is becoming very serious. Climate change impacts this resource. Its conservation represents a high priority for the country. Now, strategies for managing resources have resulted in the use of a molecular tool [5, 6]. At present, molecular characterization is becoming the capital tool in methods and programs for improving agricultural genetic resources [7]. The importance of microsatellite markers is to deepen investigation knowledge of genetic diversity. According to the work in [8], the use of molecular markers has overcome limitations of morphological markers in characterization by identifying variations directly from DNA. Because of their high degree of polymorphism, microsatellites have been used in the study of genetic diversity and in the evolutionary history of pearl millet by several authors [5, 911].

Polymorphism of pearl millet is widely studied throughout the world. However, assessing the genetic diversity of landrace pearl millet in Burkina Faso may provide a better understanding of how to insert landrace pearl millet in a breeding national program. The present study seeks to highlight the genetic diversity of pearl millet landraces using codominant markers (SSRs). Therefore, the aims of the present study are (1) to estimate the level of genetic diversity of pearl millet landraces in Burkina Faso; (2) to assess the distribution of allelic frequencies between geographic regions; and (3) to find potential genetic groups of cultivated pearl millet.

2. Materials and Methods

2.1. Plant Material and Microsatellite Markers Used

In 2014, samples were collected in three agroclimatic zones in different villages of Burkina Faso including 10 landraces of pearl millet from the east zone, 36 landraces from north, 7 landraces from west, and 33 landraces from south (Figure 1). This plant material had been previously used for morphophysiological diversity assessment during the 2015 rainy season in Burkina Faso [12].

A set of 20 microsatellites (Table 1) specific to Pennisetum glaucum (L.) R. Br. and extremely polymorphic was chosen for this study [5, 13].

2.2. Methods
2.2.1. DNA Extraction

Seeds of each accession were sown in pots in a green house in 2016 at the Life and Earth Sciences Training and Research Unit of the University of Joseph KI-ZERBBO. Fresh leaves eighteen (18) days old were collected and packaged in aluminum foil. DNA from each accession plant was extracted by the method in [14]. 20 mg of each sample was ground to which 2000 μl of Tris-EDTA-Sorbitol (TES) buffer was added. After centrifugation (, 10 min, 4°C), the DNA was precipitated from the supernatant using 750 μl MATAB (Mixed Alkyl Trimethyl Ammonium Bromide) and the samples were run in a water bath at 65°C, 2 h 30 min with manual shaking of the tubes every 15 min. 750 μl chloroform and iso-amyl alcohol (CIAA, 24 : 1) was brought into the tubes and recentrifuged (, 4°C, 10 min). Three levels of separations were obtained, and for each sample, the supernatant was taken and put into a tube. 750 μl of iso-propanol stored at −20°C was added to the volume (1 : 1) to precipitate the total DNA. After centrifugation (, 4°C, 10 min), the supernatant was precipitated. The tubes were dried for 1/2 day at room temperature and stored in Tris-EDTA buffer (TE, −20°C).

2.2.2. PCR Amplification

The method used was similar to that in [9, 10]. PCR amplification was performed in a total reaction volume of 20 μl containing 2 μl of genomic DNA, 2.5 μl of buffer, 0.5 μl of each primer (sense and reverse), 0.5 μl of dNTPs, and 0.2 μl of Taq DNA polymerase. PCR program consisted of five steps: (1) initial DNA denaturation of 94°C for 3 min, (2) denaturation of 94°C for 30 s, (3) primer hybridization for 30 s (dependent on primer temperature), (4) elongation at 72°C for 1 min and repeat of 35 cycles for steps 2, 3, and 4, and (5) final elongation at 72°C for 7 min.

2.2.3. Agarose Gel Migration

For the migration, 10 μl of each PCR sample was put into each well on 3% agarose gel. A molecular size standard between 50 and 500 bp was placed as control. The size of the different alleles could be determined very accurately. PCR products were separated under a voltage of 80 V for 1 h 30 min. DNA fragments were separated during migration according to their length, with shorter fragments migrating faster than longer fragments. The gel was transferred to ethidium bromide solution (5%) for 20 min and washed with distilled water for about 3 min. The fragments were visualized using a transilluminator equipped with a camera that detects fluorescent molecules present on the amplified fragments.

2.3. Genetic Diversity Parameters

The degree of polymorphism at each locus was assessed through the Polymorphism Information Content (PIC) using the formula [15] PIC = 1 − ∑P2ij or relative frequency of the jth allele for the ith locus of the set of all alleles. Furthermore, we estimated for each locus the following descriptors of genetic diversity: total number of alleles (A), number of rare alleles (Ar, less than 5%), number of private alleles (Apr), effective number of alleles (Ae), allelic richness (Arich), and Shannon index (I). The expected, heterozygosity (He) was calculated according to the work in [16]. The gene diversity was calculated as n/(n − 1) (1 −  − Ho/2n), where n is the number of individuals, qi is the frequency of the ith allele, and Ho is the number of observed heterozygotes. For genetic differentiation between the groups, F statistics (FST) were estimated according to the work in [17] and genetic distance per pair FST was calculated by the method of [18].

2.4. Data Analysis

Analysis of genetic data was performed using GenAlex 6.501 software used for the calculations of these different molecular genetic diversity parameters. Genetix and DaRwin software were used for the construction of the phylogenetic tree of diversity by the neighbor-joining algorithm method.

3. Results

3.1. Overall Genetic Diversity of the Millet Population
3.1.1. Number of Alleles Observed Per Marker

A total of 67 different alleles were detected using 18 polymorphic microsatellites (Table 2). The number of alleles ranged from 2 for markers Xpsmp 2236, Xpsmp 2231, and Xpgird 49 to 6 for markers Xpsmp2001 and Xpsmp 2273 with an average of 3.72 alleles per marker. The average number of effective alleles found was 3.13. The difference between these two averages is explained by the presence of rare alleles (Ar), whose frequency is less than 5%.

3.1.2. Polymorphism Information Content (PIC) and Shannon Index (I)

The PIC values in the pearl millet population ranged from 0.11 (Xpsmp 2027) to 0.99 (Xpsmp 2263) with an average of 0.68 (Table 2). 83% of the primers revealed a number of alleles from 3 to 6 with a PIC between 0.57 and 0.99 per locus. The collection, with 7 rare alleles out of 67 different alleles detected, has a polymorphism rate (P%) of 97.5%. The number of specific alleles estimated by the Shannon index showed a specific richness ranging from 0.44 to 1.45. Primers Xpsmp2236, Xpsmp2263, Xpsmp2231, Xpsmp2027, and Xpgird49 showed low Shannon index probability (Table 2).

3.1.3. Expected Heterozygosity (He)

The primers studied all showed expected heterozygosity values different from zero. The allele sizes were observed to range from 500 bp to 50 bp (Figure 2). The distribution of markers according to their degree of heterozygosity is given in Table 2, showing the existence of a large variability among markers. An average observed heterozygosity (He) of 0.61 was obtained with values ranging from 0.21 (Xpsmp 2027) to 0.98 (Xpsmp 2273) with an average of 0.61.

3.1.4. Rare Alleles (f ˂ 5%)

Seven alleles showed a frequency of less than 5% (Table 2). If this number alleles is high for a given marker, it puts the allelic richness of the marker into perspective. The number frequency of alleles is lower, ranging from 0 to 2, the maximum being reached with the markers having the highest number of observed alleles (Xpsmp 2273 and Xpsmp 2001).

3.2. Genetic Diversity of Accessions between Different Latitudes

Table 3 compares the genetic diversity of the accessions collected between different degrees of latitude. It can be seen that agroclimatic differences have a weak effect on the average genetic diversity studied. Sixty-seven (67) different alleles are detected with average allele number per marker according to latitude degree varying from 2.44 (latitude 14°) to 3.22 (latitude 11°). Latitude 10° had the lowest number of alleles (37 alleles). Accessions collected in latitude degrees 12° and 14° had relatively fewer alleles (44–49). The highest allele number was observed at 11° and 13° North latitude (58–55). These accounted for 87% of the total allelic diversity. The number of effective alleles for 12° latitude was nearly similar to that obtained for 10° and 13° latitudes but slightly higher than that for 14° and 11° latitudes. In addition, the accessions of latitude degrees 10° and 11° with a relatively low Shannon index reveal a better allelic originality. The private alleles found were 5, revealing that these accessions are likely to possess gene originality to be compared with other accessions. These alleles are from accessions from 10th, 11th, and 13th degree North latitude areas. The analysis also showed 4 others rare alleles, present only in the population of accessions from latitudes 11°, 12°, and 13°. Statistics based on expected heterozygosity revealed a genetic diversity (He) of 0.42 for the accessions from 14° North latitude was significantly low compared to other latitude degrees, showing low intrapopulation diversity in this climatic gradient.

3.3. Genetic Diversity of Landraces According to the Geographic Region

The organization of the collection according to the individual contributions of each geographic region to the total diversity is presented in Table 4. There were nonsignificant differences between regions for the effective number of alleles, which ranged from 2.75 (west) to 3.29 (south). This effective number of alleles is nearly similar for that of the east, north, and south regions. Gene diversity (He) for regional contributions show little variation among regions except for that of the West region (0.37). The number of alleles per locus between geographic regions varied from 2.72 (west) to 3.33 (south). Rare alleles were present in accessions from both the north and south. Statistics based on the number of alleles found indicate that two regions, namely, the north and south, can alone represent the total allelic diversity of the collection (100%).

3.4. Structuring of Accessions of Genetic Diversity

The analysis showed that genetic structuring is weakly related to the factor degree of latitude or geographic regions. This result is revealed by the existence of a nonsignificant genetic differentiation of accessions for the factors interdegrees of latitude and interregions (Tables 5 and 6 ). Indeed, the genetic distance between 14° latitude and each of the other four latitudes is greater than the observed difference between each of the four latitudes. However, the highest minimum Nei distance was observed between 10° and 14° latitude (0.196). In addition, the extreme geographic genetic distances of the accessions were observed between the east and the west (0.120). The correlation studied between the Nei genetic distance matrix and the geographic distance matrix between populations was found to be highly significant with Mantel's test at the 5% threshold (r = 0.83, ) showing molecular diversity related to geographic distances between cultivated accessions. The analysis indicates that cultivars at 14° latitude are genetically more separated from cultivars at other latitudes. The results presented in Tables 7 and 8 show, respectively, that the genetic similarity index of accessions remains high in each of the factors studied.

3.5. Organization of Genetic Diversity
3.5.1. Distance Table Factor Analysis (DTA)

The 1/2 plane axe explained 15.96% and 10.55% of the genetic variation, respectively (Figure 3). Analysis summarizes existing relationships between the five populations of accessions from different latitudes. In this 1/2 plane, the factorial analysis on the genetic distance table shows a clustering of accessions around the central axis, reflecting the low genetic diversity among the studied accessions. It was noted there is no clear pattern of observed structure among the accessions at different latitudes. However, a majority of accessions were located in the upper part of the biplot of the factor analysis. These accessions show a better relationship compared to those located in the lower part of the biplot. The analysis indicates that most of the accessions at 14° latitude are included in the accessions at 13° latitude. The trend of genetic diversity consisting primarily of two blocks emerged from this analysis.

3.5.2. Racial Structuring of Landraces

Racial structuring resulting from the pairwise comparison of the genetic distances between the 86 accessions according to neighbor-joining is represented by the dendrogram (Figure 4). The different accessions were classified within the three groups. Indeed, the dendrogram indicated a classification without clear grouping criteria. However, three main groups are observed from which secondary subgroups diverge. These three genetic populations (K = 3) showed structuring independent of geographic regions and climatic gradients:(i)Genetic group A from geographical regions south, east, and north represents a part of early accessions from the north and semiearly from the south and east has adapted to several environments.(ii)Genetic group B contains a part of accessions of all the geographical regions. This group gathers the majority of semiearly accessions from all geographical regions.(iii)Genetic group C is subdivided into three genetic subpopulations C1, C2, and C3. The most of the accessions are represented in this group. The accessions of the C1 subgroup include accessions from the north, west, and south that are mostly early. In addition, the C2 subgroup is made up only of accessions from the north and south. The C3 subgroup includes accessions from all regions in which late, semiearly, and early accessions are found.

3.6. Genetic Cluster Analysis of Landraces

The level of diversity of the three identified genetic groups is given in Table 9. Indeed, genetic groups B and C show higher levels of diversity than the population of genetic group A for the heterozygosity rate parameter (He), while the latter has the best allelic richness than either of the two groups. Thus, the analysis of genetic diversity at the allelic level of each genetic population through the diversity indices is as follows:(i)Genetic group A contains 39 alleles on average, an average expected heterozygosity (He) of 0.41, and a polymorphism rate of about 67%. This genetic group with its allelic richness represents 42.85% of the allelic diversity within the total population. Only 5.1% of the alleles are specific for this genetic group. Genetic group A appears to be less heterogeneous than the other two genetic groups, although it is poorly represented in cultivated pearl millet populations.(ii)Genetic group B indicates 53 alleles on average, an expected heterozygosity (He) of 0.80, and a polymorphism rate of around 85%. Its allelic diversity corresponds to 71.43% among the general population studied. This genetic group was the one that presented the rarest alleles (7 alleles) but the lowest PIC (0.55) and, therefore, presents less allelic richness compared to the other genetic groups. In this genetic group, some alleles are likely to disappear more rapidly.(iii)Genetic group C has the largest number of alleles (62 alleles) with an average expected heterozygosity (He) of 0.86, a polymorphism rate of 87%, and a PIC of 0.61. It appears as the genetic group with the highest contribution to the total allelic diversity with 89.80%. This population was fundamentally heterogeneous and is made of a mixture of genotypes from all geographical regions. We find 20.97% of specific alleles for this genetic group.

3.7. Genetic Organization

Table 10 presents the minimum Nei distance between pairs of genetic groups of the populations. It was observed the population of genetic group B is at intermediate genetic distance (0.041 and 0.042) from genetic group A and genetic group C, respectively. The most genetically distant groups (0.660) were genetic group A and genetic group C. Genetic distances estimated between populations (FST) showed a value 0.016 after 300 permutations. The coefficient of genetic differentiation (Gst) between populations was 0.46. These reveal the plausible existence of allelic substitutions occurred during the evolution of three populations after their differentiation.

4. Discussion

4.1. Pearl Millet Landrace Genetic Diversity

The frequency of heterozygotes averaged for two different alleles is high. It reveals that the majority loci show polymorphism to be accounted for by selection-mutation balance. However, polymorphism differs also for different kinds of trait because polymorphism is found in almost all natural populations and occurs at all levels of genetic organization, from DNA sequences to major morphological traits. Polymorphism is considered as the source of the variation of quantitative characters and is a fundamental interest to the selection. In another studies with pearl millet taxonomic groups, frequencies of heterozygotes of 0.77 and 0.74, respectively, were reported [6, 19]. The frequency of heterozygote remains high in cultivated pearl millet populations that coexist with wild forms. There are, however, difficulties connected with the effects of inbreeding and with the genetic load incurred if very many loci are kept polymorphic by heterozygote advantage. The genetic erosion of wild forms due to anthropogenic actions reduces the genetic diversity of pearl millet cultivated. Todays, low level of genetic diversity of pearl millet cultivated in sub-Sahelian Africa could be attributed climatic shift and variations and biotic constraints. The genetic erosion of cultivated species in several countries of sub-Saharan Africa is increasing in the context of climate change according to the work in [20].

Allelic diversity of millet populations from rural farmers is low compared to the allelic diversity reported in [6, 13], using the same markers, where a high average number of alleles per locus was observed.

Indeed, in [6, 13], an average number of alleles per locus of 9.6 and 13.3, respectively, was found. Pearl millet collected from Mali and Niger on the one hand and Sudan on the other hand was used, which are centers of primary pearl millet origin [21, 22]. This could explain why the genetic base of pearl millet grown in Burkina Faso is narrowing. Indeed, allelic richness constitutes a source of evaluation of genetic diversity as revealed in [23, 24]. It was considered that the preservation of the alleles of a species is more important than the maintenance of allelic frequencies. However, the allelic richness of pearl millet populations for the case of this collection is also comparable to that in [6, 13].

Based on the concepts related to the polymorphism of a population, the polymorphism rate obtained in our collection showed a proportion of locus quite polymorphic but lower than that in [6]. This rate is higher than that obtained in [5]. Maintaining such a high frequency of secondary alleles above 20% explains a still low level of inbreeding in the study population. This suggests a high potential for obtaining heterosis of interest.

4.2. Regional Genetic Diversity

Although environmental conditions are highly variable in the geographic regions of the country, the genetic variation of pearl millet between regions shows a nonsignificant difference. This genetic diversity has resulted in a significant number of common alleles among the geographical regions. Previous studies based on enzyme markers have shown that the ecotypes of the central, south central, and eastern regions form predominantly homogeneous groups [25]. The present genetic diversity shows that total genetic diversity can be obtained from two geographical regions of the country, indicating a high rate of new allopatric populations simply following exchanges of accessions between different regions and climate change. A majority of common alleles were identified between accessions from different geographical regions revealing the low allelic diversity between traditional cultivated varieties. In general, the high allogamy of pearl millet (85%) is a source of genetic similarity between accessions as well as farmers’ practices for those reported in [19, 26]. This high similarity of our accessions is explained by interfertility, which is the preferred mode of reproduction of millet. This high similarity between accessions is explained by the permanent exchange of seeds between farmers in different geographical regions.

The fact that traditional seed varieties circulate among farmers through inheritance and donation means that the same varieties are found in several growing areas. According to the work in [27], rural populations take their seeds with them when they move to other growing areas. This analysis of the nonsignificant difference between geographical regions was also revealed in Sudan in [6], thus showing that the structuring of genetic diversity of pearl millet populations is independent of geographical regions. The work in [5, 26] justifies the low level of genetic differentiation in pearl millet by gene flow and seed sharing among farmers. In contrast, in [19], it was shown that the similarity of accessions between geographical regions is linked to high climatic variability leading to selection pressures and over time narrows genetic diversity. Rural farmers tend to select crops varieties for agronomic and resistance performance criteria at the expense of conservation of genetic resource. In the phenotypic characterization, it was observed in each geographical region that the different varieties are identified by their morphological and phenological traits. However, these morphological differences do not seem to have as a corollary a great genetic difference between the accessions. These same observations were reported in [28].

4.3. Genetic Organization

Genetic differentiation between accessions was found to be much more important than that between geographic regions as evidenced by the number of groups and subgroups shown by the neighbor-joining tree. The pearl millet populations appear to form a single gene pool given their small genetic distance. This corroborates with the studies in [25] which revealed a single group with the use of biochemical markers. Molecular markers have been shown to be more informative in genetic diversity studies than enzymatic markers and are more conclusive in a conservation study of plant genetic resources and related species. The genetic diversity structure of the accessions showed three different populations with a classification that does not contain geographical regions. This constitutes a source for the development of heterosis cultivars to increase productivity and their resilience to climatic variability. The average values of the genetic diversity of each of the genetic groups obtained indicate that each population is rich in heterozygotes. Our results reveal the existence of three racial pearl millet populations.

However, this group structuring may still be influenced by the markers used, the sample size, and the geographic distances of the accessions revealed in [29]. With this level of observed genetic diversity, a collection plan to build a core collection would be an option to support breeding programs for this species. The presence of northern accessions in all genetic groups would be explained by population migrations from one area to another on the one hand and cultural and ethnic factors on the other hand. The work in [27, 30] revealed that rural populations take their seeds with them when they move from Sahelian to Sudanese zones. Before, the work in [25] attributed the diversity of the northern ecotypes to their proximity to Nigerian millets, which were recognized as the most varied and diverse in the Sahelian zone.

5. Conclusions

The study revealed the existence of a relatively low genetic diversity within the landraces studied. Twenty markers were used of which eighteen were polymorphic. The structuring of the genetic diversity revealed three genetic groups cultivated in the country. This indicates that this diversity can be used in a breeding program. However, genetic group C groups the maximum number of landraces with a broad genetic base. The genetic differentiation obtained shows that millets are not grouped according to a geographic system but rather on the basis of factors related to selection pressure, environmental constraints, and anthropogenic factors.

Data Availability

The data used to support the findings of this study are available from the corresponding author upon request.

Disclosure

This research is part of a Ph.D. thesis called “Etude de la diversité génétique des populations de mil [Pennisteum glaucum (L.) R.Br.] cultivées au Burkina Faso dans le contexte du changement climatique” [31].

Conflicts of Interest

The authors declare no conflicts of interest regarding the publication of this paper.

Authors’ Contributions

This work was carried out in collaboration between both authors. The first author designed the experiment and analyzed the data. Both of the authors interpreted the data, read the final manuscript, and agreed with all contents.

Acknowledgments

The authors gratefully acknowledge the Biosciences Laboratory of the Joseph KI-ZERBO University for the financial support for the experiment and the input in this manuscript.