Abstract

This work applies the three mode principal components analysis to analyze simultaneously the multiple attributes; to fit of models with additive main effects and multiplicative interaction effects (AMMI models) and the regressions models on sites (SREG models); to evaluate, respectively, the multivariate response of the genotype × environment interaction and the mean response of 36 genotypes of corn tested in 4 locations in Brazil. The results were presented by joint plots to identify the best genotypes for their adaptability and performance in the set of attributes.

1. Introduction

The multienvironment trials are conducted to estimate the genetic stability, to evaluate the performance of the genotypes in different environmental conditions, and to quantify and interpret the genotype × environment interaction (GEI), in such a way to select the best genotypes that will be recombined and planted in other years and other environments in the next selection cycle. Some statistical models are used to evaluate the behavior of the genotypes; the most recently proposed methods are based on the singular value decomposition of the GEI matrix.

In [1], the two-way interaction using the principal components analysis (PCA) are explained and considered in their model only one multiplicative term. An extension of these models was made in [2], which applies the PCA to decompose the two-way interaction in several multiplicative terms. This model was called as additive main effects and multiplicative interaction effects model (AMMI) by Gauch [3].

Another class of the Linear-bilinear model used in multienvironment trials is the sites regression model (SREG) described by [4]. In this case, the main objective is to evaluate the response of the genotype in each environment. The basic difference in relation to AMMI models is that the effect of genotype is introduced into the residual interaction. This model was used in [5] for clustering of environmental data with no cross-interaction.

Gabriel [6] describes the least-square adjustment of the AMMI model, first estimating the additive effects of the model and then making the decomposition in singular values of the matrix () of interaction residuals. The results can be interpreted by graphics called biplot [7] that reflects in reduced dimensions the most important aspects of the GEI, in the case the AMMI model, or the performance of genotypes in different environments, for the case of the model SREG.

The researchers in the multienvironmental trials evaluate multiple attributes and must select the best genotypes taking in to account its adaptability and performance in the set of attributes. With the AMMI and SREG models, it is possible to study the genotypes in a single attribute. However, if we want a response with multiple attributes, it is necessary to apply a statistical technique that allows working with three-dimensional structure data.

In [8], a three mode principal components analysis (PCA3) is proposed, in order to find the least-square estimates in the model of Tucker [9]. This procedure was used to study the GEI with multiples attributes [1013]. The method also was applied by [14, 15] to study the three-way interaction in agricultural experiments where genotypes are tested in different locations over several years.

In this work, we used the three mode principal component analysis to explain, in reduced dimensions, the most important aspects of multiattribute performance of the 36 genotypes of maize tested in 4 regions of Brazil. First, we studied the GEI using the AMMI model with multiple attributes as described in [15, 16] and finally studied the genotypes in relation to the mean environment for multiple attributes as described in [17].

2. Materials and Methods

Data are from experiments with 36 genotypes of maize (first mode, ), evaluated in 4 locations of Brazil (: Rio Branco-AC, : Milagres-CE, : Linhares-ES, cohesive soil Oxisol, : Linhares-ES, soil-LVA-Distrophic LVD11; second mode, ). In each of the experiments, 10 attributes were evaluated: : flowering (days), : plant height (cm), : corn-cob length (cm), : percentage of plants laid, : percentage of plants broken, : final stand, : number of spikes, : percentage of patients corn-cobs, : weight of spikes (kg/ha), and : weight of grains (kg/ha) (third mode, ).

The data were obtained using a simple lattice design in duplicate 6 × 6 with 4 replications. Each plot consisted of two rows of 5 m, used entirely. The spacing between rows was 1 m and between hills it was 0.40 m. Each plot consisted of 26 hills and each hole with 3 seeds. The seeds used in the tests were sent by the National Center for Research in Maize and Sorghum—CNPMS of Empresa Brasileira de Pesquisa Agropecuária—EMBRAPA, already counted, and contained in envelopes coded with the number of genotype and number of the parcel.

2.1. Statistical Processing

Each attribute was calculated according to model the estimates corresponding to interactions: () for AMMI model or () for SREG model, where represents the mean corresponding to attribute for the genotype in location ; represents the mean corresponding to the attribute for the genotype ; represents the mean corresponding to the attribute in location ; represents the overall mean of the data for the attribute . Two tridimensional arrays (Z) were constructed with the values obtained using the AMMI model or the SREG model, that is, for each model, the value in the array Z represents the residue to the genotype in the location , for the attribute . Each array was applied to three mode principal components analysis. Therefore, this methodology can be used for any dataset.

2.2. Three Mode Principal Component Analysis

The method consists of finding the least-square estimators for the Tucker model: where , , and are the elements of the principal components matrix (A, B, and C) associated with each respective mode, and in this example, A is the principal components matrix for the genotype mode, B is the principal component matrix for the location mode, and C is the principal component matrix for the attribute. G is a core array of the three way, where the element represents the relationship between the th component of the first mode with the th component of the second mode and with the th component of the third mode. is the rank of the matrix that contains the residuals of the interaction with all the variables, calculated as an AMMI model or as an SREG model. In this matrix, the rows represent the levels of the genotypes and the columns represent combinations of the levels of locations and attributes. Similarly, and are defined as the rank of matrices and .

To select the number of principal components retained for each mode, the algorithm proposed by [18] was used, based on successive eliminations until finding the optimal solution.

2.3. Biplot Representation for Three Marker Matrices

A biplot is a simultaneous representation of rows and columns of a matrix (two-way array) in terms of directions and projections [7]. When we have several matrices, or when we are working with data contained in a three-way array, it is necessary to project on the principal components of one of the modes. Suppose we are going to project onto the components of the second mode (locations) and represents the number of principal components retained in this mode; then we have to obtain the matrices , and require one biplot for each matrix. is the part of G associated with the component of the second mode. This procedure is described in [8] and commonly called a joint plot.

3. Results and Discusions

3.1. AMMI Multiattribute

Table 1 shows the results of the 6 × 2 × 3 solution obtained by applying the three mode principal component analysis to the three-way array with the residuals of interaction obtained for the AMMI model with 10 attributes simultaneously. This solution explained 44.92% of the total variability, therefore, this percentage is significant in relation to ten attributes measured in the research. The components associated with the genotype mode explain, respectively, 12.85%, 11.70%, 6.37%, 6.22%, 4.93%, and 2.84%, the components of the location mode explain, respectively, 25.49% and 19, 44%, and the components of the attribute mode explain, respectively, 22.6%, 11.6%, and 11.0% of the total variability.

Component 1 of the local mode is characterized by all the sites with relatively high weights (Matrix B, Table 1). This component is described by contrast between sites and (Rio Branco-AC and Milages-CE) versus sites and (Linhares-ES oxisol cohesive soil and Linhares-ES LVA-dystrophic-LVD11 soil).

Figure 1 shows the most important genotypes × locations interactions for the different attributes evaluated, mainly for the attribute (percentage of plants laid). Genotype 19 interacts positively in locations and and negatively in locations and for attribute . Genotypes 18 and 31 have a behavior completely contrary to genotype 19, that is, they interact positively in the locations and and interact negatively in locations and for the attribute . In regard to variable (plant height), the behavior of these genotypes is totally contrary to variable , for example, the genotype 19 interacts negatively in locations and and positively in the locations and for the attribute .

Figure 1 also shows the contrast between the genotypes 9 and 12 of the genotypes 17, 24, and 32. The first group interacts positively in the attributes , , and for locations and and negatively in these attributes for the locations and . Genotypes 17, 24, and 32 have positive interaction in the attributes and in the locations and but have negative interactions in locations and in these attributes.

Figure 2 shows the projection biplot onto the second component of locations, which is dominated by contrast of the sites and with respective weights (−0.75 and 0.64) (Matrix B, Table 1). In this graphic, it is possible to study the genotypes × locations interaction in different attributes for the locations and . For example, genotype 15 interacts positively in the local for attribute and interacts negatively in local for this attribute. There is also the contrast of genotypes 4 and 18 in relation to genotype 26. The genotype 26 interacts positively with the attribute in local and interacts negatively in local for this attribute, then in there are a few percentage of plant laid, and in many percentage of plant laid. The contrast between the genotypes 5 and 14 in relation to the genotypes 6 and 35 for attributes , , and can be observed. The first group had positive interactions with and negative with and showed that in the there are high values of final stand, weight of spikes, and weights of grains.

3.2. SREG Multiattribute

Table 2 shows the results of the 5 × 2 × 3 solution obtained by applying the three mode principal components analysis to the three-way array containing the residuals of interaction for the SREG model with 10 attributes simultaneously. This solution explained 53.70% of the total variability. The components associated with the genotype mode explain, respectively, 27.69%, 13.34%, 6.96%, 3.77%, and 1.94%, the components of the local mode explain 46.23% and 7.48%, and the components of the mode attribute explain 27.96%, 17.31%, and 8.42%. In this analysis the first component of the local mode is characterized by all the sites with relatively high weights (Matrix B, Table 2).

Figure 3 studies the behavior of the genotypes in relation to the mean environmental for different attributes. Genotypes 10, 11, 28, 31, and 34 are above mean environmental for the attributes , , and in the 4 locations and, in contrast, the genotypes 13, 14, 15, 16, 17, and 32 have a performance below the mean environmental in these attributes. Genotype 4 has good performance in the variables and to the 4 sites in contrast to genotype 8 which is below the mean environmental in these attributes in the same locations. An opposite behavior occurs in the variable in which genotype 4 is below mean environmental and genotypes 8, 14, and 32 have good performance.

The second component of the local mode is characterized by local and with respective weights of −0.73 and 0.61 (Table 2, Matrix B). This component reflects the contrast between these two locations. In the corresponding joint plot (Figure 4), it can be observed that genotypes 1, 4, 7, 9, 12, and 25 have good behavior with the variable for the local , but are below the mean environmental in local for this same attribute. For the same genotypes, the opposite happens, they have good performance in variable for the local , but they are below mean environmental for the same attribute to the local .

The fundamental difference between AMMI models with multiple attributes and SREG models with multiple attributes is that in the first case study, the two-way interaction considering all the attributes, while in the SREG model, it is possible to study the multivariate behavior of genotypes in relation to the mean environmental.

4. Conclusions

Based on three mode principal components analysis, it is possible to work simultaneously with multiple attributes in the adjustment of AMMI and SREG models, so we can do a multivariate study of the most important aspects of the genotype × environment interaction and the response of genotypes in different environmental conditions.

When we have many combinations of levels of factors, the conditioned joint plot is a powerful tool to represent three arrays of markers and gives us information on the most important aspects of the response of genotypes. It was possible to detect the combinations of genotypes × locations × attributes causing the significant interaction and the differential response, that is, identify the best genotypes in view of its adaptability and performance.