|
|
||||||||
a Istituto Sperimentale per le Colture Foraggere, 29 viale Piacenza, 26900 Lodi, Italy
b Institut Technique des Grandes Cultures, BP 16, El-Harrach, Alger, Algeria
c Istituto Agronomico per l'Oltremare, 4 via A. Cocchi, 50131 Florence, Italy
* Corresponding author (bred{at}iscf.it)
| ABSTRACT |
|---|
|
|
|---|
Abbreviations: AMMI, additive main effects and multiplicative interaction ANOVA, analysis of variance CIMMYT, Centro Internacional de Mejoramiento de Maiz y Trigo GE, genotype x environment GIS, Geographic Information System GL, genotype x location GY, genotype x year ICARDA, International Center for Agricultural Research in the Dry Areas PC, principal component
| INTRODUCTION |
|---|
|
|
|---|
The scope for a specific-adaptation strategy may be explored on the basis of yield response of a germplasm pool that is representative of the available genetic base tested across a representative sample of sites within the target region (Annicchiarico, 2002a). After verifying the relative size of GL interaction effects that arise from the lack of genetic correlation among environments, candidate subregions may be identified on the basis of similarity for GL effects. Decisions on the adaptation strategy should be made on the basis of predicted yield gains offered by each strategy, and possibly verified by actual gains. Breeding for specific adaptation tends to imply (i) higher costs than wide adaptation, because of increased field testing and (ii) greater yield gains, owing to exploitation of GL effects and to increased heritability of yield as a consequence of low GL interaction within subregion (Kang, 1998). A fair comparison of wide- vs. specific-adaptation strategies should have similar costs if the same total number of selection environments are used (Annicchiarico, 2002a). However, there are choices to be made among the several available techniques (DeLacy et al., 1996; Annicchiarico, 2002b) for modeling GL effects and/or classifying locations. In addition, classification of test sites may be insufficient for an accurate environmental definition of subregions and the appreciation of their relative size. Annicchiarico (2002b)(p. 56) proposed extending the classification to nontest locations by means of environmental variables that are associated with GL effects and are capable of discriminating between site groups. However, the reliability of this procedure is unknown.
With reference to the potential durum wheat cropping region of Algeria as defined in a GIS (Delli et al., 2002), we had the following objectives: (i) to compare AMMI vs. joint regression modeling of GL effects, (ii) to verify the reliability of a GIS-based definition of two subregions that extended the site classification based on similarity for GL effects as a function of climatic variables in the GIS, and (iii) to compare wide- vs. specific-adaptation strategies in observed and predicted yield gains. Predicted gains were estimated for scenarios of defined and undefined selection locations.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
|
|
Definition of Subregions
Information from the first two years, which included records on six climatic variables, was used for preliminary modeling of adaptive responses and definition of subregions (Annicchiarico, 2002a; Annicchiarico et al., 2002a). Seven techniques for classifying sites into two subregions were preliminarily compared in terms of predicted yield gains over the region for specific adaptation vs. wide adaptation, assuming that the best methods for subregion definition maximize the advantage of specific adaptation. Six methods used the modeling of GL interaction effects by joint regression (Finlay and Wilkinson, 1963), AMMI (Gauch, 1992), or factorial regression (Hardwick and Wood, 1972) followed by grouping of locations either by cluster analysis or by using the main crossover point as a cut-off for defining two groups. The main crossover point is the estimated value of site mean yield for joint regression, site score on the first GL interaction principal component (PC) axis or site covariate for unidimensional factorial regression for which the crossover interactions of genotypes with sites reach the highest frequency (Annicchiarico, 2002b). These analyses were performed on log-transformed yields to remove the positive relationship between heterogeneity of genotypic variance and site mean yield of locations, which is frequent in Mediterranean regions (Yau, 1991) and may negatively affect the assessment of site similarity (Cooper et al., 1996). The regression of the within-site phenotypic variance of entry values as a function of site mean yield on a logarithmic scale was used as a criterion for transforming data. A slope value around 2.0 suggested the need to log-transform data (Annicchiarico, 2002b, p. 53). Site classification was also performed by pattern analysis, standardizing entry yields within a location before cluster analysis (DeLacy et al., 1996). All cluster analyses adopted Ward's clustering method (DeLacy et al., 1996) and a squared Euclidean distance as the dissimilarity measure. The AMMI + cluster analysis (the latter performed on site scores on the first PC, the only significant GL interaction PC axis) and pattern analysis provided similar classification results and were the best methods on the basis of predicted yield gains over the region (Annicchiarico, 2002a).
Site classification was extended to the total potential durum wheat cropping area through a discriminant analysis-based procedure. Long-term winter mean temperature (across January and February) for AMMI + cluster analysis and site altitude for pattern analysis were selected as significant (P < 0.10) variables in the discriminant function for site groups (Annicchiarico et al., 2002a). The GIS-based definition of subregions provided by the two classification techniques was very similar (consistency >95%), showing a low-elevation, milder Subregion B (including 6568% of the durum wheat cropping area) and a high-elevation, cold-prone Subregion A. Subregion definition by AMMI + cluster analysis was preferred here because it was based on an environmental variable (the level of cold stress) that directly affected the adaptive responses of genotypes and it was somewhat more reliable in terms of variation between site groups accounted for by the discriminant function (Annicchiarico et al., 2002a). The critical winter mean temperature for discriminating a subregion was 8°C. Site 8, having this temperature (Table 2), was considered borderline and was not assigned to a subregion. Site 7, although close to the critical temperature (Table 2), was included in the cold-prone Subregion A together with Locations 2, 4, 5, 6, 13, and 18 (Fig. 1). Subregion B included Sites 1, 3, 9, 10, 11, 12, 14, 15, and 16.
Modeling of Adaptation Patterns and Verification of Site Classification
The analyses described in this section were performed on the 3-yr data set after excluding Site 15, which was represented by just one test year. The following analysis of variance (ANOVA) model was adopted, given the variable number of test years per site:
![]() | [1] |
Given the unavailability of climatic data for most sites in the third year, only AMMI and joint regression models could be compared for ability to describe GL interaction effects. Testing of GL interaction PC axes was performed by the FR test as described by Annicchiarico (2002b)(p. 38), using the within-location GY interaction as an error term. Heterogeneity of genotype regressions was tested by using deviations from regression.
The need for log-transforming data before classifying sites by AMMI + cluster analysis was verified in two steps. First, the size of the two determinants of the GE interaction variance, i.e., the lack of genetic correlation and the heterogeneity of genotypic variance among environments, was assessed together with the pooled genetic correlation among environments for entry yield. This was performed as described by Cooper et al. (1996), following ANOVAs for single experiments and a combined ANOVA including genotype, environment, and block factors. Then, the regression slope of the within-site phenotypic variance of entry values as a function of site mean yield on a logarithmic scale was estimated. Test sites were classified in two subregions by AMMI + cluster analysis and pattern analysis using the same procedures described for the 2-yr data set and results were compared with those of the GIS-based classification in Fig. 1.
The AMMI-modeled adaptive responses were based on original yields (i.e., the unit of yield gains) of top-ranking entries. They were graphically expressed as nominal yields as a function of the site score on the first GL interaction PC axis (Gauch and Zobel, 1997). Nominal yields sum the estimated entry mean yield and the product of the entry by the site scaled scores on PC 1 (excluding the site main effect, which is irrelevant for entry ranking). The consistency of current PC 1 scores for entries and locations with previous indications based on 2 yr of data (Annicchiarico et al., 2002a, 2002b) was verified by correlation analysis.
The software IRRISTAT, released by the International Rice Research Institute (IRRI) of Manila, was used for AMMI and joint regression modeling. All other analyses described here and in the next section were performed by SAS (1999) software. In particular, PROC GLM was used for different ANOVAs, while genotypic and genotype-environmental variance components were estimated by the maximum-likelihood method in PROC VARCOMP.
Definition of Selection Locations and Comparison of Adaptation Strategies
Comparing adaptation strategies based on observed or predicted yield gains requires some preliminary assumptions, such as the total number of selection environments (sites and years combinations) and the intensity of selection hypothesized for late selection stages (Annicchiarico, 2002a, 2002b). For specific breeding, sites can be assigned to subregions in proportion to their relative size, and selection is based on entry mean yield across selection environments independently for each subregion. For a wide-adaptation strategy, selection is based on entry mean yield across all of the selection environments already defined for specific breeding. This is consistent with Lin and Butler's (1988) suggestion to choose selection sites across the region in a stratified manner and in proportion to the relative size of site groups. In this study, we hypothesized six selection environments over the region for late selection stages, assigning four environments (two sites in 2 yr) to subregion B (the larger) and two (one site in 2 yr) to subregion A (the smaller). Thus, selection for specific adaptation was based on 2-yr entry mean yields across selection environments independently calculated within subregions, whereas selection for wide adaptation was based on 2-yr entry mean yields across all six environments. Selection of three entries out of 24 (12.5%) was hypothesized in all cases.
Selection sites were chosen among those having 3 yr of data. The screening power of a given site for a target area is proportional to the phenotypic correlation between entry mean yields at the site and entry mean yields across the target environments (Cooper et al., 1996). Given the 2-yr period assumed for selection, entry mean yield at the site was always estimated from 2 yr of data, computing correlations for all possible pairs of test years at the site and holding the average correlation with target environments as an indicator of the screening power. Target environments included all environments in the relevant subregion as defined in Fig. 1, except those acting as selection environments in the assessment. To select the two selection sites for Subregion B, we also assessed the screening power of pairs of sites, avoiding pairs that were geographically very close. Site 15 was not assessed for screening power, but was included among the target environments of Subregion B here and in following analyses, whereas Site 8 (borderline between subregions) was excluded from analyses.
Observed yield gains for each adaptation strategy were computed in each subregion as the difference between the mean yield of the three entries selected in the relevant selection environments and the mean yield of all entries across the target environments belonging to the subregion. Yield gains were computed separately for each of the three pairs of test years at selection sites and then averaged. Target environments were represented by all relevant environments in the GIS-based definition of subregions, except those acting as selection environments. For each strategy, the average yield gain over the region was computed as a weighted mean of the average gains
GA and
GB observed within the two subregions (Annicchiarico, 2002a). Weights were the relative sizes of the Subregion A (PA) and the Subregion B (PB) as estimated in the GIS:
![]() | [2] |
Predicted yield gains per selection cycle, assuming adoption of the previously defined selection sites, were computed for the two strategies as correlated genetic gains in the target environments from the selection environments (
GT/M) according to Basford et al. (2004). In general
![]() | [3] |
1.56 for three selected entries out of 24: Falconer, 1989 p. 355), r(S,T) is the phenotypic correlation for entry mean yield between selected and target environments, and sp(T) is the estimated phenotypic standard deviation in the target environments. Yield gains in each subregion were predicted for each strategy separately for each of the three pairs of test years at selection sites and then averaged, providing the average gains
GA and
GB within individual subregions. Target environments were those belonging to the relevant subregion as defined in the GIS, excluding those acting as selection environments. For each set of target environments, sp(T) was estimated by the square root of the denominator of the formula for estimating the broad-sense heritability on an entry mean basis (h2):
![]() | [4] |
Adaptation strategies were also compared in terms of predicted yield gains per selection cycle as described by Annicchiarico (2002a). In this case, no prior choice of selection sites was assumed, generating results for sites with average screening power. The procedure implied the estimation of variance components for s2g, s2ge and s2e for three sets of target environments, i.e., all environments, for wide adaptation; and environments of Subregion A on the one hand and of Subregion B on the other, for specific adaptation. The average yield gain predicted over the region for a wide-adaptation strategy was computed by the formula:
![]() | [5] |
GA) and B (
GB) for a specific-adaptation strategy were computed by Formula [5], inputting in Formula [4] the estimates of variance components for the relevant subset of target environments and the appropriate e and r values (e = 2 for computing
GA; e = 4 for computing
GB; r = 4 in all cases). The average yield gain over the region predicted for specific breeding was derived from
GA and
GB through Formula [2]. | RESULTS |
|---|
|
|
|---|
|
|
|
Reliability of the GIS-Based Subregion Definition
The AMMI + cluster analysis and pattern analysis performed on the 3-yr data set provided the same subdivision of test sites into two groups. The groups coincided with those of the GIS-based subregion definition in Fig. 1, with the exception of Site 18, which was assigned to Subregion B. These results confirmed the reliability of the adopted definition of candidate subregions based on long-term winter mean temperature in the GIS.
Estimates of the GL interaction and within-site GY interaction variance components for original data over the region were 88 and 217%, respectively, of the genotypic variance. The zoning process allowed for a sharp reduction of the estimated GL interaction variance component, which was, on the average, only 24% of the genotypic variance within the individual subregions defined in Fig. 1.
Comparison of Adaptation Strategies
Site 4 was preferred to other candidate selection locations for Subregion A because of its top-ranking average phenotypic correlation with target environments of the subregion (Table 2). Sites 1, 9, 10, and 16, among those of Subregion B having three test years, showed the highest phenotypic correlation with target environments of this subregion (Table 2). Considering all possible pairs formed by these sites, except that of Sites 9 and 10 [which was excluded because of the small geographical distance between these locations (Fig. 1)], the pair of sites 1 and 10 maximized the correlation with entry values across target environments (r = 0.95, P < 0.001) and was, therefore, preferred to select for Subregion B. Sites 1, 4, and 10 jointly acted as selection sites to breed for wide adaptation.
Cultivar mean yields over the region and across environments of either subregion are reported in Table 1. GTA Dur and Chen's were the top-yielding entries over the region and over Subregion B, whereas Ofanto and GTA Dur were top-yielding over Subregion A. The phenotypic correlation between the two subregions for entry mean yield (r = 0.66, P < 0.001) indicated that the consistency of genotype response between the subregions was only moderate.
In each of the three pairs of test years, the three genotypes selected for wide adaptation on the basis of mean yields across relevant selection environments were always coincident with those selected for specific adaptation to Subregion B. They were GTA Dur and Chen's in all cases, Simeto in two cases and Eider in one case. The set of entries selected specifically for Subregion A included Ofanto and GTA Dur in all cases, Sahel 77 in two cases and Simeto in one case. The wide-adaptation pattern of GTA Dur and the specific-adaptation pattern of most other selected entries are showed by AMMI-modeled nominal yields in Fig. 2 (in which specific adaptation to Subregion A is highlighted by a dotted line). Sizeable crossover interactions between top-yielding genotypes only took place at negative PC 1 score values, where sites of Subregion A are located. The graph also shows the response of Polonicum, representative of those of traditional cultivars and old varieties, to show that this germplasm could become top-ranking only at sites with PC 1 scores that are extremely low and marginal even for the unfavorable subregion A.
Mean yields of selected entries, and observed gains relative to mean yield of all entries, are reported for the two strategies in Table 5 as average values across pairs of test years. The two strategies did not differ for Subregion B because of the coincidence of selected entries. Specific adaptation provided yield gains almost 40% higher than wide adaptation for Subregion A. This advantage implied just a 7% advantage over the region, owing to the smaller size and the smaller yield gains in absolute value of Subregion A (Table 5).
|
| DISCUSSION |
|---|
|
|
|---|
The complete consistency of site classification between AMMI + cluster analysis and pattern analysis supports the reliability of site groups determined by using these techniques. The former technique, which is more complex than the latter, may be preferred when modeling adaptive responses. Modeling genotype responses by pattern analysis in its ordination mode is also possible (e.g., Cooper et al., 1996), but, compared with AMMI, it suffers from inability to remove random error from the pattern (Gauch and Zobel, 1997; Annicchiarico, 2002b, p. 50).
The substantial consistency between ordinary classification of sites (by AMMI + cluster analysis or pattern analysis) and extended classification based on 2 yr of data as a function of the long-term winter mean temperature in the GIS is comforting and reinforces interest of procedures for up-scaling results from test sites to all sites within a region. The inconsistent classification of Site 18, assigned to the cold-prone Subregion A on the basis of its long-term winter mean temperature and to Subregion B on the basis of site similarity for GL effects, was already observed for 2 yr of data and attributed to the warm winters that occurred at the site during the test years (Annicchiarico et al., 2002a). Indeed, the up-scaling procedure may also compensate for the effect of unusual years (temporal up-scaling), besides improving the geographical definition and the estimation of the size of subregions (spatial up-scaling). It requires, however, that a sufficiently reliable discriminant function for site groups be defined on the basis of available environmental variables. Earlier GIS-based definitions of subregions relied on different approaches. Pollak and Corbett (1993) based it on site similarity for environmental factors that were supposed to be important (although chosen with no definite assessment of their impact on GL effects). Chapman and Barreto (1996) interfaced a GIS with simulation results for flowering date of different germplasm types. The current GIS has also been interfaced with factorial regression or AMMI-modeling of adaptive responses for cultivar recommendations, defining areas with the same top-yielding material (Annicchiarico et al., 2005).
Each of the procedures adopted for comparing adaptation strategies in terms of yield gains may have some limitations. Observed gains relate to the portion of GL effects that are of practical interest to breeders, i.e., the crossover interaction between top-ranking entries. But they may be more affected than predicted gains by small random errors, which may easily alter entry ranks. Procedures based on defined selection sites may be more realistic for breeding programs, but the estimated screening power of environments may largely depend on specific conditions during the test years that lead to particularly high or low GY interaction and/or experimental error. The Basford et al. (2004) procedure, in particular, was proposed for managed selection environments, where screening power is more reliably assessed because it is less affected by specific conditions during test years. Predicted gains for undefined selection sites, on the contrary, relate to the average screening power of environments within the subregion as estimated from the sample of environments.
In conclusion, the three procedures provide complementary information for comparing strategies, while being inappropriate for estimating actual selection gains that can be expected, unless the evaluation data are based on random sample of breeding lines. The results suggest that a specific-adaptation strategy may provide a modest advantage (likely around 45%) over wide adaptation at similar costs. The distinctly larger efficiency showed by specific adaptation for Subregion A (1039%) suggests for this strategy the additional merit of increasing the food security at the household level in this stressful subregion. The only moderate consistency of entry response between the two subregions (r = 0.66) did not result in a large advantage of specific breeding over the region because the large within-site GY interaction that is typical of Mediterranean regions limits the advantage of this strategy. Specific adaptation was largely superior to wide adaptation for barley (Hordeum vulgare L.) in northern Syria in a larger study (Ceccarelli et al., 1998) in which, however, the same test sites were assigned to a favorable or an unfavorable subregion depending on their mean yield in specific years, thereby reducing the impact of within-site GY interaction effects on the assessment.
Two aspects of the current assessment may have led to underestimating the relative merit of specific adaptation. Information on the actual size of each subregion, based on the proportion of the potential durum wheat area that is devoted to the crop, was lacking. If available, it would likely have increased the relative size of Subregion A (because of the paucity of alternative crops in this area) and the impact over the region of the distinctly larger efficiency of specific adaptation for this subregion. The paucity of material specifically adapted to Subregion A probably limited the advantage of specific breeding. Finally, it should be noted that all comparisons of strategies relied on the hypothesis of performing late selection on the same pool of material in the two subregions. The advantage of specific breeding may be much greater if it implied the early selection of breeding lines to either subregion on the basis of crucial adaptive traits, or the use of a distinct genetic base for each subregion. A previous study provided indications on relevant adaptive traits (Annicchiarico et al., 2002b). The AMMI-modeled yield responses in Fig. 2, possessing higher predictive ability than observed responses (Gauch, 1992), may be exploited for identifying parent material of specific interest for each subregion.
On the whole, our findings reinforce the potential of a specific-adaptation strategy for national breeding programs of medium-sized countries, in which it may contribute to increasing national yields as well as local yields in less favorable areas. Especially for poorer research systems, a relevant issue concerns the extent of experimental data and statistical effort that are required to obtain reliable indications on adaptation strategies. The less demanding procedure implying 2 yr of data, subregion definition by pattern analysis, and estimation of the relative size of subregions from the proportion of test sites that they accounted for (Annicchiarico et al., 2002a) showed an advantage of specific over wide adaptation based on predicted yield gains (11%) that was fairly close to that based on observed gains (7%). This finding encourages the basic analysis of relatively small, multilocational, multiyear data sets to obtain preliminary indications on the possible benefit of specific breeding.
| ACKNOWLEDGMENTS |
|---|
| NOTES |
|---|
|
|
|---|
Received for publication September 8, 2004.
| REFERENCES |
|---|
|
|
|---|
Related articles in Crop Science:
This article has been cited by other articles:
![]() |
H. G. Gauch Jr., H.-P. Piepho, and P. Annicchiarico Statistical Analysis of Yield Trials by AMMI and GGE: Further Considerations Crop Sci., May 1, 2008; 48(3): 866 - 889. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Putto, A. Patanothai, S. Jogloy, and G. Hoogenboom Determination of Mega-Environments for Peanut Breeding Using the CSM-CROPGRO-Peanut Model Crop Sci., May 1, 2008; 48(3): 973 - 982. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| The SCI Journals | Agronomy Journal | Vadose Zone Journal | |||
| Journal of Plant Registrations | Soil Science Society of America Journal | ||||
| Journal of Natural Resources and Life Sciences Education |
Journal of Environmental Quality |
||||