|
|
||||||||
a Dep. of Ciências Exatas, Univ. of São Paulo/ESALQ, Av. Padua Dias 11, Cx.P.09, 13418-900, Piracicaba-SP, Brazil
b School of Mathematical Sciences, Laver Building, North Park Road, Exeter, EX4 4QE, UK
* Corresponding author (ctsdias{at}carpa.ciagri.usp.br)
| ABSTRACT |
|---|
|
|
|---|
Abbreviations: AMMI, additive main effects and multiplicative interaction model COMM, completely multiplicative model DF, degrees of freedom GEI, genotype x environment interaction GREG, genotype regression model IPCA, interaction principal component analysis MET, multi-environment trials NID, normally and independently distributed PCA, principal components analysis PRESS, predictive sum of squares PRECORR, predictive correlation RMSPD, root mean square predictive difference SHMM, shifted multiplicative model SREG, sites regression model SS, sum of squares SVD, singular value decomposition
| INTRODUCTION |
|---|
|
|
|---|
In plant breeding, multienvironment trials (MET) are important for testing general and specific cultivar adaptation. A cultivar grown in different environments will frequently show significant fluctuation in yield performance relative to other cultivars. These changes are influenced by the different environmental conditions and are referred to as GEI. A typical example of a matrix X arises in the analysis of MET, in which the rows of X are the genotypes and the columns are the environments where the genotypes are tested. Presence of GEI rules out simple interpretative models that have only additive main effects of genotypes and environments (Mandel 1971; Crossa, 1990; Kang and Magari, 1996). On the other hand, specific adaptations of genotypes to subsets of environments is a fundamental issue to be studied in plant breeding because one genotype may perform well under specific environmental conditions and may give a poor performance under other conditions.
Crossa et al. (2002) give a comprehensive review of the early approaches for analyzing GEI that include the conventional fixed two-way analysis of variance model, the linear regression approach, and the multiplicative models. The empirical mean response,
ij, of the ith genotype in the jth environment with n replicates in each of the i x j cells is expressed as
ij = µ + gi + ej +
ij +
ij where µ is the grand mean across all genotypes and environments, gi is the additive effect of the ith genotype, ej is the additive effect of the jth environment, (ge)ij is the GEI component for the ith genotype in the jth environment, and
ij is the error assumed to be NID (0,
2/n) (where
2 is the within-environment error variance, assumed to be constant). This model is not parsimonious, because each GEI cell has its own interaction parameter, and uninformative, because the independent interaction parameters are complicated and difficult to interpret.
Yates and Cochran (1938) suggested treating the GEI term as being linearly related to the environmental effect, that is setting (ge)ij =
iej + dij, where
i is the linear regression coefficient of the ith genotype on the environmental mean and dij is a deviation. This approach was later used by Finlay and Wilkinson (1963) and slightly modified by Eberhart and Russell (1966). Tukey (1949) proposed a test for the GEI using (ge)ij = Kgiej (where K is a constant). Mandel (1961) generalized Tukey's model by letting (ge)ij = 
iej for genotypes or (ge)ij =
gi
j for environments and thus obtaining a "bundle of straight lines" that may be tested for concurrence (i.e., whether the
i or the
j are all the same) or nonconcurrence.
Gollob (1968) and Mandel (1969)(1971) proposed a bilinear GEI term
ij =
sk=1
k
ik
jk in which
1
2
...
s and
ik,
jk satisfy the ortho-normalization constraints
i
ik
ik' =
j
jk
jk' = 0 for k
k' and
i
2ik =
j
2jk = 1. This leads to the linear-bilinear model
ij = µ + gi + ej +
sk=1
k
ik
jk +
ij, which is a generalization of the regression on the mean model, with more flexibility for describing GEI because more than one genotypic and environmental dimension is considered. Zobel et al. (1988) and Gauch (1988) called this the Additive Main Effects and Multiplicative Interaction (AMMI) model.
A family of multiplicative models can then be generated by dropping the main effect of genotypes (Site Regression Model, SREG), the main effect of sites (Genotype Regression Model, GREG), or both main effects (Complete Multiplicative Model, COMM). Another multiplicative model, the Shifted Multiplicative Model (SHMM) (Seyedsadr and Cornelius, 1992) is useful for studying crossover GEI (Crossa et al., 2002.)
However, one aspect that has not yet been fully resolved concerns the determination of the number of multiplicative components to be retained in the model to adequately explain the pattern in the interaction. Some proposals have been put forward by, among others, Gollob (1968), Mandel (1971), Gauch and Zobel (1988), Cornelius (1993), and Piepho (1994)(1995). All take into consideration the proportion of the variance accumulated by the components (Duarte and Vencovsky, 1999), and the more recent ones focus on cross validation as a predictive data-based methodology. However, some problems still remain, notably in optimizing the cross-validation process.
In this paper, we first summarize the AMMI model and analysis for genotypeenvironmental data, and sketch out the available methodology for selecting the number of multiplicative components in the model. We then describe two methods based on a full leave-one-out procedure that optimizes the cross-validation process. Both methods are illustrated on some unstructured multivariate data. Their application to analysis of GEI is then exemplified on some experimental data, and a comparison of all available methods is made on data from five multienvironment cultivar trials.
| MATERIAL AND METHODS |
|---|
|
|
|---|
![]() |
The AMMI model postulates additive components for the main effects of genotypes (gi) and environments (ej) and multiplicative components for the effect of the interaction (ge)ij. Thus, the mean response of genotype i in an environment j is modeled by:
![]() |
![]() |
![]() |
Estimates of the overall mean (µ) and the main effects (gi and ej) are obtained from a simple two-way ANOVA of the array of means Y(gxe) = [Yij]. The residuals from this array then constitute the array of interactions:
![]() |
k is estimated by the kth singular value of GE,
ik is estimated by the ith element of the left singular vector
k(gx1), and
jk is estimated by the jth element of the right singular vector
'k
associated with
k (Good, 1969; Mandel, 1971; Piepho, 1995). Correspondences between SVD and PCA are as follows:
k is the kth singular value or the square root of the kth largest eigenvalue of the arrays (GE)(GE)T and (GE)T(GE), which have equal nonnull eigenvalues;
ik is the ith element of the eigenvector of (GE)(GE)T associated with
2k;
jk is the jth element of the eigenvector of (GE)T(GE) associated with
2k.
The GEI in this model is thus expressed as a sum of components, each multiplied by
k, for a genotypic effect (
ik) and an environment effect (
jk). The term
k gives the proportion of the variance due to the GEI interaction in the kth component. The effects
ik and
jk represent weights for genotype i and environment j, in that component of the interaction (
2k). The rank of GE is s = min{g - 1, e - 1}, so the index k in the sum of multiplicative components can run from 1 to s. Use of all s components regains all the variation: SS(GEI) =
2k=1
2k and the model is saturated so it produces an exact fit to the data, with no residual error term against which to test effects (except in the situation when an independent error is estimated). When m < s, the model is said to be truncated. However, for AMMI, one does not try to recoup the whole SS(GEI) but only the components most strongly determined by genotypes and environments. Consequently, the index is generally set to run to m < s, so the estimates are obtained from the first m terms of the SVD of the GE array (Good, 1969; Gabriel, 1978). This is a least-squares analysis that leaves an additional residual denoted by
ij. Thus, the interaction of genotype i with environment j is described by
mk=1
k
ik
jk, discarding the noise given by
sk=m+1
k
ik
jk. Here, as in PCA, the components account, successively, for decreasing proportions of the variation present in the GE array (
21
22
...
2s). Therefore, the AMMI method is seen as a procedure capable of separating signal and noise in the analysis of the GEI (Weber et al., 1996).
Determining the Optimal Number of Multiplicative Terms in the AMMI Model
The main objective is the prediction of the true trait response in the cell of the two-way table of genotypes and environments. To achieve this, a truncated AMMI model should be used and thus criteria for determining the number of components needed to explain the pattern in the GEI term have been the objects of some research (Gollob, 1968; Mandel, 1971; Gauch and Zobel, 1988; Piepho, 1994, 1995; Cornelius, 1993; Cornelius et al., 1996).
Two basic approaches have evolved to determine the optimal number of multiplicative terms to be retained in the GEI component. One approach uses a cross-validation method in which the data are randomly split into modeling data and validation data. AMMI is fitted to the modeling data and the mean squared errors of prediction (expressed as the root mean squared predictive difference, RMSPD) are determined from the validation data. The main criticism of this approach is that the best predictive model computed from a subset of data may not be the best model when all data are considered (Cornelius and Crossa, 1999); if crossvalidation is used in MET data, the data must be adjusted for replicate differences within environments (Cornelius and Crossa, 1999). The other approach for determining the best predictive truncated model is to use tests of hypotheses about the kth component, H0k:
k = 0, using the complete data set (and not a subset like in the cross-validation approach). These tests are based on the sequential sum of squares explained by the multiplicative terms.
We will now briefly review these two approaches. It may be noted that shrinkage estimators of multiplicative models have been shown recently to be good predictors of cultivar performance in environments (Cornelius and Crossa, 1999), but these estimators require df estimates that the authors find problematic. Moreover, other classes of estimators can be better than shrinkage estimators (see Venter and Steel, 1993). So we do not consider them any further.
Tests of Significance of Multiplicative Terms
The sequential sum of squares of the AMMI model for the kth component, Sk, is given by n
2k for k = 1,2,...,rank(GE) (where GE =
ij -
i. -
.j +
..). As in PCA, all of the test criteria involve, at least indirectly, the ratio of the accumulated sum of squares for the first m components to the total SS(GEI), i.e.,
mk=1
2k/SS
.
One of the usual procedures consists of determining the degrees of freedom associated with a particular component of SS(GEI) for each member of the family of AMMI models. This enables mean squares to be computed for each component, together with an error mean square. Since we have an orthogonal partition of the interaction sum of squares, the ratio of the mean square of any interaction component to the error mean square is then assumed to follow an F distribution with the corresponding degrees of freedom. This implicitly assumes a normal distribution for the original response variable, and enables individual interaction components to be subjected to significance tests. However, validity of the F distribution in these circumstances is subject to considerable doubt. The eigenvalues
2k of the matrix (GE)(GE)T (or (GE)T(GE)) are distributed as eigenvalues of a Wishart matrix but do not have a chi-square distribution. Since the Sk are not independent random variables following a chi-square distribution, an F test does not hold. Nonetheless, selection of the optimal model is often based on F tests for the successive terms of the interaction, the number of included terms corresponding to the number of significant components. The Gollob (1968) approximate F test assumes that n
2k/
2 is distributed as chi-square and so obviously does not hold. Computer simulations done by Cornelius (1993) showed that Gollob tests at the 0.05 level are very liberal with Type I error rate of 66% for testing H01:
1 = 0. The F-approximation tests FGH1, FGH2 (Cornelius et al., 1992; Cornelius et al., 1993), effectively control Type I error rates, and are generally more parsimonious than the Gollob test. However, these tests are conservative for testing multiplicative terms for which the previous term is small. Simulation and iteration tests have greater power than the FGH1 and FGH2 tests with good control of Type I error rates. The residual AMMI, collected in the last term of SS(GEI), can also be tested to confirm its nonsignificance.
Turning to the question of degrees of freedom, Gauch and Zobel (1996) mention some methods for attributing degrees of freedom to components of an AMMI model; those of Gollob (1968) and Mandel (1971) are particularly popular. However, the authors warn that, unfortunately, there are disagreements between these methods. Choosing one requires both theoretical and practical considerations. The approach of Gollob (1968) is very easily applied, since the number of degrees of freedom for component m of the interaction is simply defined to be DF(IPCAm) = g + e - 1 - 2m, whereas most other approaches require extensive simulations before they can be used.
For instance, Mandel (1971) defines the number of degrees of freedom for component k to be DF(IPCAk) = E
/
2, where
2 is the population variance. However, simulations then have to be conducted to evaluate the number of degrees of freedom in particular cases. Mandel gives some tables derived from such simulations for a limited set of conditions, whereas Krzanowski (1979) gives some exact versions. These tables, however, are not exhaustive and this reduces the practical utility of the method. By contrast with Gollob (1968), Mandel's (1971) system generally results in a nonlinear decrease in the degrees of freedom for the successive interaction terms, which can still be fractions.
For some years, the degrees of freedom have been obtained by Mandel's (1971) proposal, which was considered exact and therefore correct. However, this proposal has received much criticism recently (e.g., Gauch, 1992), and it is now felt to be less appropriate than the approach of Gollob (1968). The reason for this criticism centers on the assumptions made by Mandel in his simulations, that the matrix contains only noise and not signals, whereas the presence of signal affects the component patterns substantially.
Gauch (1992) discusses the question of obtaining the degrees of freedom for the multiplicative components of an AMMI model. He concludes that rigorous simulations seem unnecessary or impractical, and generally recommends the use of Gollob's system when one is using an F-test approach, bearing in mind that the procedure is an intuitive guide. In cases where there seems to be a clear division between the large components determining the systematic part and small noise components, the author suggests that assigning equal degrees of freedom {DF(IPCAk) = [(g - 1)(e - 1)]/e]} is especially useful for early components because normally there will be little interest in partitioning the noise components. In addition, definitive questions of research require the exact assigning of the degrees of freedom to each multiplicative term. Therefore by Gollob's system, the full joint analysis of variance (computed from means) has the structure as shown in Table 1 .
|
The Cornelius test statistic with m multiplicative terms in the model is as follows:
![]() |
This is the FR test of Cornelius et al. (1992) that may turn out to be liberal as compared with FGH1, FGH, or simulation iteration tests. Under the null hypothesis that no more than m terms determine the interaction, the numerator (i.e., the residual SS(GEI) for the fitted AMMI model) is, approximately, a chi-square variable (Piepho, 1995) so the test statistic has an F distribution with f2 and Error mean square degrees of freedom.
Thus, a significant result for the test suggests that at least one more multiplicative term must be added to the m already included. It can therefore be seen as a test of significance of the first m + 1 terms of the interaction (similar to the test of lack of fit in linear regression). When m = 0, i.e., when no multiplicative term is included, the test is just equivalent to the F test for global GEI in the joint ANOVA. It is an exact test. One also notices that the number of degrees of freedom of the numerator of FR is equal to the degrees of freedom for the whole interaction minus the degrees of freedom attributed by Gollob (1968) for the m first terms. It is concluded, therefore, that the application of FR is equivalent to the test of residual AMMI for GEI, as suggested.
Predictive Assessment Using Cross Validation
Gauch and Zobel (1988) comment that evaluations such as those above by means of distributional assumptions via the F test can be termed "postdictive," in that they search for a model to explain a great part of the variation in the observed data (with high coefficient of determination). Thus, they argue, such methods are not efficient for selecting parsimonious models and are liable to include noise. By contrast, "predictive" criteria of evaluation capitalize on the ability of a model to form predictions with data not included in the analysis, simulating future responses not yet measured, so it would be preferable to base the model choice on such criteria.
To make predictions, in general, it is necessary to use computationally intensive statistical procedures. The less the model choice or assessment of performance of a predictor is based on distributional assumptions, the more general is the result. Thus, methods that are essentially data-based and free of theoretical distributions will have the greatest generality. Such methods involve resampling the given data set, using techniques such as the jackknife, the bootstrap and cross validation. Gauch (1988) introduced the name "predictive evaluation" when it is based on cross validation (Stone, 1974; Wold, 1978), and this is the principle underlying his proposal for selection of number of components in AMMI models.
In his method, the replications for each combination of genotypes and environments are randomly divided into two subgroups: (i) data for the fit of the AMMI model and (ii) data for validation. The responses are predicted for a family of AMMI models (i.e., for different values of m) and these are compared with the respective validation data, calculating the differences between these values. Then, the sum of squares of these differences is obtained and the result is divided by the number of predicted responses. This method was developed further by Crossa et al. (1991). The authors call the square root of this result the mean predictive difference (RMSPD) and suggest that the procedure be repeated about 10 times, getting an average of the results for each member of the family of models. A small value of RMSPD indicates predictive success of the model, so the best model is the one with smallest RMSPD. The chosen model is then used to analyze the data of all the n replications, jointly, in a definitive analysis.
Further modifications have been proposed in recent years. Piepho (1994) suggests obtaining the average value of RMSPD for 1000 different randomizations, instead of the 10 suggested by Crossa et al. (1991). The author considers a modification of the completely random partition of the data (modeling and validation) when the experiment is blocked. In this case, he recommends drawing entire blocks from the experiment and not making components for each combination of genotype and environment. Thus, the original block structure is preserved. However, despite the logical coherence of this type of proposal, studies confirming its effectiveness are still not available. Gauch and Zobel (1996) suggest that the validation data set should always be just one observation for each treatment. This is because that it is most likely, from n - 1 data points, to find a model that is closest to the analysis of the full set of n data points. We thus take up this idea in the present contribution, and describe two methods that optimize the cross-validation process by validating the fit of the model on each data point in turn and then combining these validations into a single overall measure of fit.
Cornelius and Crossa (1999) used cross validation for comparing performance of shrinkage estimators, truncated multiplicative models and best linear unbiased predictor (BLUP) by computing the RMSPD as the square root of the mean squared difference between the predictive value and their corresponding validation data on replication adjusted data of five MET. The authors used a stopping rule for the number of crossvalidations that consisted in calculating the pooled mean square predictive difference on the mth execution of the loop (PMSPDm). The crossvalidation was terminated if the maximum absolute value of PMSPDmPMSPDm-1 was less than 0.01. The maximum number of cross validations required was 64 and the minimum was 39.
Leave-One-Out Methods
We now propose two methods based on a full leave-one-out procedure that optimizes the cross-validation process. In the following, we assume that we wish to predict the elements xij of the matrix X by means of the model: xij = |<|\sum_|<|\mathrm|<|k|>|=1|>|^|<|\mathrm|<|m|>||>||>|dkuikvjk +
ij. The methods are those outlined by Krzanowski (1987) and Gabriel (2002) respectively, in which we predict the value
mij of xij(i = 1,...,g; j = 1,...,e) for each possible choice of m (the number of components), and measure the discrepancy between actual and predicted values as
![]() |
However, to avoid bias, the data point xij must not be used in the calculation of
mij for each i and j. Hence, appeal to some form of cross validation is indicated, and the two approaches differ in the way that they handle this. Both, however, assume that the SVD of X can be written as X = UDVT.
The standard cross-validation procedure is to subdivide X into a number of groups, delete each group in turn from the data, evaluate the parameters of the predictor from the remaining data, and predict the deleted values (Wold, 1976, 1978). Krzanowski (1987) argued that the most precise prediction results when each deleted group is as small as possible, and in the present instance that means a single element of X. Denote by X(-i) the result of deleting the ith row of X and mean-centering the columns. Denote by X(-j) the result of deleting the jth column of X and mean centering the columns, following the scheme given by Eastment and Krzanowski (1982). Then we can write
![]() |
![]() |
Now consider the predictor
![]() |
Each element on the right-hand side of this equation is obtained from the SVD of X, mean-centered after omitting either the ith row or the jth column. Thus, the value xij has nowhere been used in calculating the prediction, and maximum use has been made of the other elements of X. The calculations here are exact, so there is no problem with convergence as opposed to expectation maximization approaches that have also been applied to AMMI, but are not guaranteed to converge.
Gabriel (2002), on the other hand, takes a mixture of regression and lower-rank approximation of a matrix as the basis for his prediction. The algorithm for cross-validation of lower rank approximations proposed by the author is as follows:
For given (GEI) matrix X, use the partition
![]() |
![]() |
![]() |
11.
Similarly obtain the cross-validation fitted values
ij and residuals eij = xij -
ij for all other elements xij, i = 1,...,g; j = 1,...,m; (i,j)
(1,1). Each will require a different partition of X.
These residuals and fitted values can be summarized by PRESS
=
|<|\sum_|<|\mathrm|<|i|>|=1|>|^|<|\mathrm|<|g|>||>||>||<|\sum_|<|\mathrm|<|j|>|=1|>|^|<|3|>||>|e2ijandPRECORR
=Corr
respectively.
With either method, choice of m can be based on some suitable function of
![]() |
However, the features of this statistic differ for the two methods. Gabriel's approach yields values that first decrease and then (usually) increase with m. He therefore suggests that the optimum value of m is the one that yields the minimum of the function. The EastmentKrzanowski approach produces (generally) a set of values that is monotonically decreasing with m. They therefore argue for the use of
![]() |
On a computational level, the best accuracy of prediction seems to be achieved when the entries (xij) in different columns of X are comparable in size and there is relatively little variation among the di. The most stable procedure is thus one in which the mean
j and standard deviation sj of column j (j = 1,...,e) are first found from the values present in that column. Existing entries xij of X are then standardized to x'ij =
/sj, estimates are found by applying
ij = xTi.VD-1UTx.j to the standardized data, and then the final values are obtained from
ij =
j + sjx'ij.
Turning to the case of genotypeenvironment data, it would appear that X should be the array of interactions previously denoted GE. However, since we are merely looking for the appropriate number of multiplicative terms in the model, and any additive constants can be absorbed into the
ij component of the model, we can apply the leave-one-out procedure directly to the data matrix Y. Indeed, this may often be preferable given the small values taken by most elements of GE.
Cornelius et al. (1993) compared results of cross validation with those obtained after computing the PRESS statistics in multiplicative models in a complete MET data. The data splitting involved three replicates for modeling and one replicate for validation. They computed the RMSPD of PRESS by adjusting the value of PRESS as [PRESS/ge + 3s2/4]1/2, where g and e denote the number of genotypes and sites in the MET and s2 is the pooled within-site error variance. The term in s2 is an adjustment for the difference in variance of the validation data on cell means, to make results comparable to the RMSPD from 3-1 data splitting. Results on an MET with nine genotypes and twenty sites showed that PRESS is more sensitive to overfitting than is data splitting. Table 2 shows that PRESS differentiates more clearly the model forms than does data splitting. For some model forms (SHMM and SREG), the model with smallest PRESS predicted data in a deleted cell better than they were predicted by three replicates of data with all cells present. On the other hand, many overfitted models gave very unreliable prediction of a deleted cell.
|
| RESULTS |
|---|
|
|
|---|
Jeffers (1967) described two detailed multivariate case studies, one of which concerned 19 variables measured on each of 40 winged adelgids that had been caught in a light trap. Of the 19 variables, 14 are length or width measurements, four are counts, and one (anal fold) is a presence/absence variable score 0 or 1.
In Table 3 , we show a comparison between the EastmentKrzanowski and Gabriel methods for the soil data. In this case, when the data matrix was standardized, the rank of the resulting matrix was four and all submatrices associated with columns four and five had a singular matrix D. Thus we used a MoorePenrose generalized inverse instead of the ordinary inverse when implementing Gabriel's method. We can see that the PRESS(m) values are much lower with the EastmentKrzanowski method than with the Gabriel method up as far as m = 4.
|
In Table 4 , we show a comparison between both approaches using the Jeffers data. Now, with standardized matrix X, no singular submatrices appeared. We can see from PRESS(m) that the Gabriel values are lower until the third component. From the W statistic, we can see that the EastmentKrzanowski method suggests that four components should be retained in the model, whereas the Gabriel criterion suggests two.
|
|
The result of a full cross validation across 1000 randomizations of the data can be seen in the first three columns of Table 6 . For each randomization, one of the three observations at each treatment combination was randomly selected and used to create the interaction matrix, whereas the other two replicates were averaged to form the validation data. Also shown in the remaining columns of Table 6 are the EastmentKrzanowski and Gabriel leave-one-out results, as obtained from the single matrix of averages across the three replicates. The full cross validation yields minimum RMSPD at four components, although the RMSPD values for three, four, and five components are very similar in size and any could be chosen to represent the optimum number of components for the model. However, both the EastmentKrzanowski and the Gabriel methods suggest that one component should be retained in the model.
|
|
| DISCUSSION |
|---|
|
|
|---|
The first point to make is that the F-test methods all rely heavily on distributional assumptions (normality of data and validity of F distributions for mean squares), which may not be appropriate in many cases. Also, it is documented that the different F tests can come up with conflicting recommendations on a particular data set (Duarte and Vencovsky, 1999), while Piepho (1995) has noted that some of the tests select too many interaction components. This feature can be seen clearly in the comparisons of Table 7 also. So, in general, it seems that a data-based cross-validation method should be more appropriate.
Turning then to the full cross-validation randomization approach, the weakness here is that a large portion of the data must be set aside for the validation set. This means that the model is fitted to only a relatively small part of the data. For example, in the analysis reported in Table 6, the fit was to just one observation at each genotypeenvironment combination, while the assessment was on the mean of the other two replicates. Between-replicate variation may generally be very high, which inflates assessment error sums of squares and has probably contributed to the high number of components selected by this method.
By contrast, the leave-one-out methods make the most efficient use of the data and result in the most parsimonious model (AMMI 1) for the example of Tables 5 and 6. This model has 23 df (5 for years plus 7 for genotypes plus 11 for interaction PCA component 1) and is twice as parsimonious as AMMI 5 (in the sense that AMMI 5 contains twice as many degrees of freedom as AMMI 1). Thus, we conclude that a final model may be constructed by applying AMMI 1 to all the data (i.e., all three replications). The first interaction component recovers 43% of the GEI SS in only 31.4% of the interaction df (Table 5). The higher interaction components are judged by predictive assessment to be just noise for the purpose of yield prediction, and thus may be pooled with the residual.
The feature of parsimony is illustrated most clearly by the results of the EastmentKrzanowski method in Table 7. Trials 1 through 4 are complex trials in which there is clear evidence of GEI, whereas Trial 5 is much simpler and probably free of interaction. From a practitioner's point of view, capturing the essence of any interaction in relatively few components is an attraction as these components can be interpreted clearly, whereas fitting many components may create problems of interpretation. Most of the methods shown in Table 7 exhibit quite a large variability in the number of components selected, with large numbers in some data sets for each method. Such large numbers are undesirable in practice. At the other extreme, PRESS provides a maximum of one component for several complex trials in which the interaction structure is evidently not so straightforward, and several trials with no components including one (Trial 5) in which there is clear evidence of interaction. By contrast, the EastmentKrzanowski method provides a stable pattern of relatively low and hence interpretable numbers of components.
In summary, therefore, distributional F tests are often based on questionable assumptions while full cross-validation randomization methods remove too much of the available data for validation purposes and hence lead to less reliable fitted models. Use of leave-one-out method is therefore recommended in general. Of the two such methods investigated here, the EastmentKrzanowski method, has shown the greater parsimony and stability of fitted model.
| ACKNOWLEDGMENTS |
|---|
Received for publication April 8, 2002.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
H. G. Gauch Jr. Statistical Analysis of Yield Trials by AMMI and GGE Crop Sci., May 18, 2006; 46(4): 1488 - 1500. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| The SCI Journals | Agronomy Journal | Vadose Zone Journal | |||
| Journal of Plant Registrations | Soil Science Society of America Journal | ||||
| Journal of Natural Resources and Life Sciences Education |
Journal of Environmental Quality |
||||