|
|
||||||||
a Alberta Agriculture, Food, and Rural Development, Room 300, 7000113 Street, Edmonton, AB, Canada T6H 5T6, and Dep. of Agricultural, Food, and Nutritional Sci., Univ. of Alberta, Edmonton, AB, Canada T6G 2P5
b Alberta Agriculture, Food, and Rural Development, Room 300, 7000113 Street, Edmonton, AB, Canada T6H 5T6
c Alberta Agriculture, Food, and Rural Development, RR6, 17507 Fort Road, Edmonton, AB, Canada T5B 4K3
d Alberta Agriculture, Food, and Rural Development, Brooks, AB, Canada T1R 1E6
* Corresponding author (rong-cai.yang{at}ualberta.ca).
| ABSTRACT |
|---|
|
|
|---|
Abbreviations: AR1, first-order autoregressive model LDF, loss of degrees of freedom LSS, least squares smoothing NNA, nearest neighbor adjustment RCB, randomized complete block REML, restricted maximum likelihood
| INTRODUCTION |
|---|
|
|
|---|
-design can have smaller blocks but spatial heterogeneity may persist in small blocks. Evidently, such design-based control of the error variation alone may not be sufficient to remove all spatial trends in the variety trials. Different model-based analyses that exploit the information on plot positions have been developed and applied to estimate and correct for spatial variation within and among blocks. These spatial analyses include NNA (Bartlett, 1978; Wilkinson et al., 1983; Zimmerman and Harville, 1991), LSS (Green et al., 1985), and modeling of spatial autocorrelation such as the AR1 model (Gleeson and Cullis, 1987; Gilmour et al., 1997). Efficiency of different spatial analyses relative to the analysis of the block designs has been frequently evaluated (Ball et al., 1993; Brownie et al., 1993; Clarke et al., 1994; Stroup et al., 1994; Grondona et al., 1996; Wu et al., 1998; Helms et al., 1999), but such evaluations are usually based on a limited number of field trials. On the other hand, other studies (e.g., Kempton and Howes, 1981; Cullis and Gleeson, 1989; Kempton et al., 1994) have used a large number of trials, but focused on the comparison of one spatial analysis with the conventional RCB analysis. Variety trials are often performed in a large number of test sites across many years. Patterns and extent of spatial variability may vary greatly among environments, suggesting that different spatial analyses may differ in their ability to remove spatial heterogeneity in different environments. It would be desirable to evaluate the efficiency of different spatial analyses across a large number of trials encompassing different environments.
This paper reports an evaluation of the efficiency of three spatial analyses (NNA, LSS, and AR1) relative to conventional RCB analysis based on 157 field pea variety trials tested in different growing zones across Alberta, Canada, during 1997 to 2001. These trials were part of the Alberta Field Pea Regional Variety Test Program that was established in 1987 to carry out multiyear and multisite tests for recommending registered varieties to local pea producers across the province (Park and Lopetinsky, 1999).
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
Spatial Analyses
Yield data for each trial were analyzed with a model that allowed for the incorporation of spatial variation (Brownie et al., 1993):
![]() | [1] |
k(ij) represents the mean performance of variety k in plot ij, Tij is the trend effect representing systematic spatial variation in this plot, and
ij is the random residual. Different spatial analyses were performed by modeling Tij and
ij in Eq. [1]. The baseline analysis was the RCB analysis of variance. In this case, the trend effects Tij were assumed to be constant for all plots within the same block, that is, Tij = ßi (the effect of the ith block).
For the NNA analysis, we used the iterative one-dimensional modification of Papadakis procedure (Wilkinson et al., 1983) to calculate a trend index from the neighbors on either side of each plot but the block effect (ßi) was preserved. Thus, Tij +
ij in Eq. [1] became ßi + bXij + eij in the NNA analysis, where Xij = (ei,j1 + ei,j+1)/2, b = the regression coefficient associated with the covariate Xij, and eij = Yij
ij with
ij being the variety mean in plot ij. For border plots at either end of a block, Xij was calculated as the residual for the one neighbor. Each iteration started with a new trend index that was the difference between the observed and adjusted variety means from the previous iteration. The iteration continued until the difference between the adjusted means in the two successive iterations was negligible.
For the LSS analysis (Green et al., 1985), trend effects, Tij, were estimated with the constraint that their second differences were zero (Ti,j 1 2Tij + Ti,j + 1 = 0). It was further assumed that residuals (
ij) were uncorrelated with each other and with the trend effects. An appropriate tuning constant (
) must be chosen to ensure uncorrelated residuals between neighboring plots while maintaining a smooth trend. Following Clarke et al. (1994), we searched for a
value that would give a near-zero serial correlation in the estimated residuals. The searching process was stopped when the absolute value of the serial correlation was <0.02 or when
reached a preset upper limit of 106, even though the serial correlation remained to be away from zero.
The third analysis was the direct fitting of field trends (Zimmerman and Harville, 1991; Gilmour et al., 1997) rather than by differencing or use of neighbor residuals as in the first two analyses. In this approach, the residuals (
ij) were assumed to be distributed according to spatial correlation models. A commonly used spatial correlation model is the AR1. The covariance between
ij and
ij' under the AR1 model is given by
![]() | [2] |
2 is the residual variance that would be estimated with the RCB analysis (no spatial trend). The presence of spatial trend would suggest that neighboring plots tend to be more alike than those farther apart (
> 0). We only considered spatial relationships within blocks (a one-dimensional AR1 model) because a full replication (block) was in a single field tier. In the presence of complicated covariance structure for the residuals such as AR1, the likelihood-based mixed model analysis was needed to model and estimate the covariance structure and adjusted variety means. All required calculations and analyses described above were performed with SAS/IML Workshop Version 2.0 for Release 8.2 of the SAS System (SAS Institute, 2002). One of the new features in SAS/IML Workshop Version 2.0 is its ability to call SAS procedures within the context of an IMLPlus program. For example, we performed the restricted maximum likelihood (REML) analysis of the AR1 model through calling and executing SAS PROC MIXED (SAS/STAT Release 8.2) from the IMLPlus program we developed. To carry out the mixed model analysis of the AR1 model, we included block effects in the RANDOM statement of PROC MIXED, and specified SUBJECT = BLOCK and TYPE = AR(1) as options for the REPEATED statement. The SAS source code for all calculations and analyses is available upon request.
Calculation of Efficiency
In evaluating the relative efficiency of the NNA analysis to the RCB analysis in removing field trends, Stroup et al. (1994) and Agrobase (1999) have defined the NNA weight as 1 Ea/Eu, where Ea and Eu are the error mean squares from the NNA and RCB analyses, respectively. The NNA weight of >0.3 is used to indicate the presence of moderate to large field trends (Stroup et al., 1994; Helms et al., 1999). However, it remains controversial what would be the appropriate df to the residual mean square from the NNA analysis (Binns, 1987). Even if, in some cases, the reduction in the residual df is negligible due to a large residual df from the unadjusted data, the NNA weight as defined is not necessarily bounded within the range of 0 to 1 as claimed in Agrobase (1999)(p. 281). The NNA adjustment with the analysis of covariance loses 1df in Ea compared with Eu. Thus, while the adjusted error sum of squares (SSEa) is always less than or equal to the unadjusted error sum of squares (SSEu), the condition of Ea
Eu is not guaranteed. For example, the condition of Ea
Eu did not hold in 48 of the 157 trials used in our study, leading to a negative NNA weight. For these reasons, we proposed to measure the efficiency of the NNA or LSS analysis relative to the RCB analysis by 1 SSEa/SSUu.
A different measure of the efficiency of the AR1 analysis was needed, as the REML analysis of the AR1 model directly estimated the residual variance without the need to explicitly consider df. Because the SEDs of adjusted means derived from the AR1 analysis varied among different pairs of varieties, an average SED was calculated as the square root of the average of squares of SEDs across all pairs of adjusted means [SEDAR1]. Similarly, the likelihood-based analysis of the RCB design gave the estimate of SEDRCBD that was simply equal to (2
2/b)1/2, with b being the number of blocks. Thus, the relative efficiency of the AR1 model to the RCB analysis was calculated as 1 SEDAR1/SEDRCBD.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
|
The efficiency of the NNA analysis (Fig. 1A) showed that about 18% (29/157) of the 157 trials had a >30% removal of the residual variation due to spatial heterogeneity, a criterion used to indicate moderate to large field trends revealed by the NNA analysis (Stroup et al., 1994; Agrobase, 1999; Helms et al., 1999). Out of those 29 trials, 11 had 30 to 40%, seven had 40 to 50%, and 11 had 50 to 85% reduction of residual variation. With the same criterion, 22% (35/157) of the 157 trials had a >30% reduction of residual variation by the LSS analysis (Fig. 1B) but only 4% (7/157) of the trials had a >30% reduction of residual variation by the AR1 analysis (Fig. 1C). The LSS analysis was slightly better than the NNA analysis in error reduction, which is consistent with the observations of Green et al. (1985) and Clarke et al. (1994). The AR1 analysis was the least effective in error reduction. The analysis based on the two-dimensional AR1 model showed no further improvement of efficiency over the one-dimensional AR1 analysis reported here, probably because each block in our RCB design occupied just a single row or column in a single field tier.
The loss of degrees of freedom (LDF) for the residual mean square from the NNA analysis is long recognized but it remains largely a controversial issue (Binns, 1987; Helms et al., 1999). On the other hand, the LSS analysis (Green et al., 1985) provides a means of estimating LDF. The estimates of LDF for the 157 field pea trials were the differences between degrees of freedom for the residual mean square from the RCB analysis and those from the LSS analysis (Table 2). On average, LDF was larger in 1997 (10.2) and 1998 (11.2) than in 1999 to 2001 (<5), suggesting again the presence of stronger spatial trends in the first 2 yr than in the latter 3 yr. The likelihood-based AR1 analysis directly estimated the residual variance without the need to explicitly consider the corresponding degrees of freedom.
While the amount of error reduction varied among the three spatial analyses (NNA, LSS, and AR1), the reduction by these analyses was consistent across the trials, as evident from high correlations among NNA, LSS, and AR1 for all 157 trials and for the 73 trials with the absolute values of serial correlation being within 0.02 (Table 3). Since LDF was indicative of error reduction, it was also highly correlated with the three analyses. The trials with large numbers of varieties per block also had high CV values as indicated by the high correlation between NVAR and CV (55.4% for all the trials and 70.7% for the 73 trials). Moderate to high correlations of the three analyses with NVAR or CV pointed to the obvious expectation that these methods should be more effective for larger blocks with greater residual variation (higher CV). Our results support the view that efficiency in removing spatial variation by spatial analyses is great when block sizes and number of varieties within blocks are large (e.g., Bartlett, 1978; Cullis and Gleeson, 1989).
|
). Following Green et al. (1985) and Clarke et al. (1994), we chose a
to ensure a near-zero serial correlation of the residuals. In all 157 trials analyzed, the
values ranged from 7 to 106 but the estimated serial correlations ranged from 0.44 to 0.02. For those trials where
already reached to 106 but the serial correlation was still away from zero, further increase in
failed to improve the serial correlation as found by Clarke et al. (1994). Consequently, 84 of the 157 trials had an absolute value of serial correlation > 0.02, the convergence criterion by Clarke et al. (1994) for the iterative search for the turning constant. Of those 84 trials, 74 had a CV (based on unadjusted data) of <15%, suggesting that the plot-to-plot variation was already low and there might be little spatial structure in such trials. Spatial trends were quite pronounced for a trial with CV = 42% (Fig. 2A) but were representative of blocking effects for a trial with CV = 7% (Fig. 2B). For those trials with high CV and high absolute values of serial correlation, on the other hand, the LSS analysis probably did not offer an adequate model to account for spatial trends.
|
Reliable identification of desirable varieties depends on whether or not the yields of check varieties show the same pattern of response across a trial as test varieties. With different yield responses due to the spatial heterogeneity, the use of checks would increase rather than decrease the error of variety comparisons (Kempton and Howes, 1981). Indirect evidence appeared to suggest the existence of such different yield responses in our field pea variety trials. For example, the average differences between RCB and NNA mean yields of check varieties across all the 157 trials were 17.3 kg ha1 in 1997, 10.1 kg ha1 in 1998, 3.9 kg ha1 in 1999, 4.1 kg ha1 in 2000, and 4.9 kg ha1 in 2001, but there were wide ranges of such differences in different years. Thus, the spatial analyses homogenized yield responses between check and test varieties in individual trials, thereby allowing for more accurate variety comparisons particularly in 1997 and 1998.
| ACKNOWLEDGMENTS |
|---|
Received for publication January 31, 2003.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
A. N. Kravchenko, G. P. Robertson, S. S. Snap, and A. J. M. Smucker Using Information about Spatial Variability to Improve Estimates of Total Soil Carbon Agron. J., May 3, 2006; 98(3): 823 - 829. [Abstract] [Full Text] [PDF] |
||||
![]() |
R.-C. Yang, S. F. Blade, J. Crossa, D. Stanton, and M. S. Bandara Identifying Isoyield Environments for Field Pea Production Crop Sci., January 1, 2005; 45(1): 106 - 113. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| The SCI Journals | Agronomy Journal | Vadose Zone Journal | |||
| Journal of Natural Resources and Life Sciences Education |
Soil Science Society of America Journal | ||||
| Journal of Plant Registrations | Journal of Environmental Quality |
The Plant Genome | |||