Crop Science Grow Your Career with CSSA
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Published online 24 February 2006
Published in Crop Sci 46:751-757 (2006)
© 2006 Crop Science Society of America
677 S. Segoe Rd., Madison, WI 53711 USA
This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via ISI Web of Science (1)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Hooks, T.
Right arrow Articles by Vogel, K. P.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Hooks, T.
Right arrow Articles by Vogel, K. P.
Agricola
Right arrow Articles by Hooks, T.
Right arrow Articles by Vogel, K. P.
Related Collections
Right arrow Sorghum
Right arrow Data Management
Right arrow Plant Genetic Resources

PLANT GENETIC RESOURCES

Variation in the U.S. Photoperiod Insensitive Sorghum Collection for Chemical and Nutritional Traits

Tisha Hooksa, J. F. Pedersenb,*, D. B. Marxa and K. P. Vogelb

a Dep. of Statistics, Univ. of Nebraska-Lincoln, Lincoln, NE 68583
b USDA-ARS, NPA Wheat, Sorghum and Forage Research, 344 Keim Hall, Univ. of Nebraska-Lincoln, Lincoln, NE 68583-0937

* Corresponding author (jfp{at}unlserve.unl.edu)


    ABSTRACT
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Screening germplasm for chemical and nutritional content can be expensive and time consuming. Near infrared spectroscopy (NIRS) and application of geostatistical models can make screening more efficient. The objectives of this study were to utilize these technologies to: (i) generate chemical and nutritional values for the U.S. photoperiod insensitive sorghum collection, (ii) describe variability for those traits, (iii) identify accessions in the highest and lowest 1% for each trait, and (iv) describe relationships among the accessions. Accessions were grown at Ithaca, NE, during 2001 and 2002. Samples of grain were scanned and NIRS equations developed for starch, fat, crude protein, acid detergent fiber, and phosphorus. The NIRS generated values for each accession can be accessed on GRIN at http://www.ars-grin.gov/cgi-bin/npgs/html/desclist.pl?69 ; verified 22 November 2005. The highest and lowest 1% of accessions was identified for each trait by best linear unbiased predictors (BLUPs). Means and standard deviations for observed values and variances due to accessions were calculated. Rank correlations between BLUPs and observed values ranged from r = 0.82 to r = 0.92. Principal component analysis showed that much of the variation is attributable to a contrast of starch with a weighted average of fat, crude protein, acid detergent fiber, and phosphorus. Cluster analyses showed clusters on the basis of canonical values, but no geographical, taxonomical, or morphological interpretation of the clusters was apparent.

Abbreviations: ADF, acid detergent fiber • BLUPs, best linear unbiased predictors • Ca, calcium • CP, crude protein • GRIN, Germplasm Resources Information Network • NIRS, near infrared spectroscopy • P, phosphorus • SEC, standard error calibration • SECV, standard error cross validation


    INTRODUCTION
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
SCREENING large germplasm collections for quantitative traits such as chemical and nutritional content can be prohibitively expensive and time consuming by traditional experimental designs and analytical techniques. For example, the current U.S. sorghum collection contains over 42 000 accessions (USDA-ARS, 2005a). Growing each sorghum accession in one single-row plot 7.6 m long spaced 76 cm apart would require over 24 ha of research plots per replication. Total per sample analysis costs for crude protein (CP), acid detergent fiber (ADF), phosphorus (P), calcium (Ca), starch, fat, and various calculated values can total $35/sample for wet chemistry (Ward Laboratories, 2005). Few research projects have the resources to undertake screening of such a large collection by traditional replicated field studies and wet chemical analysis.

Two technologies, NIRS and the use of new statistical models made possible by the increasing speed and power of computers, can be used to generate predicted values for chemical and nutritional content on the basis of a relatively small subset of samples and to account for spatial and temporal variability, reducing the need for replication of accessions or replicated check plots in augmented designs. NIRS predicts chemical and nutritional values from spectral data and is used to generate chemical and nutritional values for grain and forage samples. Time required to scan a sample is approximately 1 min, and cost per sample is low compared with the wet chemistry methods. The technology is common-place in many analytical labs and has been approved for use in testing ADF and protein in feeds for over a decade (AOAC, 1990). More recently, it was approved for testing of corn grain (Zea mays L.) for oil, protein, and starch content by the USDA Grain Inspection, Packers and Stockyards Administration (USDA-GIPSA, 1999).

Numerous statistical approaches to make large early generation or germplasm screening efforts more efficient have been developed during the past century. Augmented designs (Federer, 1956; Steel, 1958; Searle, 1965) and nearest neighbor analysis (Papadakis, 1937; Bartlett, 1937) are both examples that have been widely used. However, neither offered the needed increase in efficiency for large germplasm screening experiments, and neither specified the nature of the relationship between neighboring plots. Applications of geostatistical models to account for various correlation structures in field experiments have led to increases in accuracy and precision (Brownie et al., 1993; Cullis and Gleeson, 1991; Zimmerman and Harville, 1991). Mixed model equations developed by Henderson (1975) have become useful with the widespread availability of analytical software (Littell et al., 1996) and more powerful computers, and they have proven useful for the analysis of spatially correlated data (Henderson, 1975; Marx and Stroup, 1993).

Recent research in our laboratory using PROC MIXED (SAS, 2003) and simulated data to compare the efficiency of germplasm screening experiments with varying levels of check plots demonstrated that the use of BLUPs was superior to the use of least square means (LS means) and observed values for selecting the highest proportion of true top ranking genotypes and that BLUPs were influenced little by the proportion of check plots to experimental plots (Sebolai et al., 2005). Incorporation of the correlation structure and the use of BLUPs to select superior accessions may be especially well suited to screening germplasm collections since they were originally developed for ranking and selection (Robinson, 1991). The use of BLUPs is, however, premised on treatment (accession) effects being assumed random. In screening experiments involving a subpopulation of accessions from a collection, this would be valid if the subsetting criteria were independent of screening criteria.

A subset of the U.S. sorghum collection that are of particular interest for sorghum improvement is comprised of approximately 4000 photoperiod insensitive accessions that will flower at temperate latitudes. The objectives of this study were to (i) generate chemical and nutritional values for grain from the U.S. photoperiod insensitive sorghum collection, (ii) describe the variability for those chemical and nutritional values, (iii) identify accessions in the highest and lowest 1% for each trait after accounting for spatial and temporal variation, and (iv) describe relationships among the accessions using these values.


    MATERIALS AND METHODS
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Approximately 75% of the accessions in the U.S. photoperiod insensitive sorghum collection were grown during 2001 and 2002 at the University of Nebraska Field Laboratory, Ithaca, NE, on a predominantly Sharpsburg silty clay loam (fine, montmorillonitic, mesic Typic Arguidolls) site. Plots consisted of a single 7.6-m row of each accession spaced 76 cm apart, and individual accessions were planted in only 1 yr. The variety ‘Wheatland’ was interspaced throughout the nurseries at a density of 16% of the plots. Each plot was seeded with a precision vacuum planter calibrated to deliver 120 seeds per row (240 000 seeds ha–1). Because of limitations on field space, 1990 accessions were planted in 2001 and 1215 accessions were planted in 2002. Germination was generally adequate for plot establishment. Plots that did not germinate and those that did not reach maturity were treated as missing plots. Following the 2002 harvest, the decision was made that the 2914 accessions already grown in this experiment should adequately represent the diversity of the U.S. photoperiod insensitive sorghum collection and the remaining lines were not planted in subsequent years. Accessions grown in 2001 or 2002 were not preselected for any traits or descriptors and therefore can be considered a random subset of the photoperiod insensitive sorghum collection.

The experiments were planted 19 May 2001 and 23 May 2002. Nitrogen fertilizer was applied preplant at both locations at 157 kg ha–1. Atrazine (6-chloro-N2-ethyl-N4-isopropyl-1,3,5-triazine-2,4-diamine) was applied at 2.2 kg ha–1 immediately after planting, followed by an application of quinclorac (3,7-dichloro-8-quinolinecarboxylic acid) and atrazine at 0.37 kg ha–1 and 1.1 kg ha–1, respectively, approximately 14 d post emergence for weed control. In 2002, bentazon [3-(1-methylethyl)-1H-2,1,3-benzothiadiazin 4(3H)-one-2,2-dioxide] was added to the post emergence application at 0.28 kg ha–1 for velvetleaf [Abutilon theophrasti (Medik)] control. No supplemental irrigation was applied in 2001. Plots were irrigated by furrow irrigation on 2 July and 23 July in 2002.

Ten open pollinated panicles per plot were harvested as accessions reached maturity, bulked by row, threshed, and seed were stored at 10°C for NIRS scanning and chemical and nutritional quality analysis. All samples were ground to pass a 1-mm screen on a cyclone mill and scanned on a Model 6500 near infrared reflectance spectrometer (NIRS Systems, Silver Spring, MD).1 To cover the wide expected range in spectral diversity, 541 reference samples (50% from each year) were selected by cluster analysis of the reflectance data for wet chemistry analysis (Shenk and Westerhaus, 1991). Standard wet chemistry methods were used to determine starch (Y S I, 2000), fat (Padmore, 1990a), CP (Miller et al., 1998), ADF (ANKOM, 1999), calcium (Padmore, 1990b), and phosphorus (Padmore, 1990b) content of reference samples. All lab analyses were performed by Ward Laboratories, Kearney, NE. The wet chemistry values were then used to develop NIRS prediction equations by partial least squares (Shenk and Westerhaus, 1991) and to generate observed values for each row in each year. The calibration statistics for each trait are shown in Table 1.


View this table:
[in this window]
[in a new window]
 
Table 1. Calibration statistics for NIRS{dagger} prediction of starch, fat, CP, ADF, calcium, and phosphorus.

 
The mean and standard deviation of the observed values for each trait were calculated for each year separately and also for the combined data set. To identify the underlying spatial variation for each response variable, SAS PROC VARIOGRAM was used to calculate a semivariogram for each of the chemical and nutritional traits (assuming a no-nugget model) in both 2001 and 2002. Since the default initial values in SAS PROC MIXED often lead to unreasonable estimates for the range and sill (Marx and Stroup, 1993), the range and sill of the semivariograms were estimated visually to obtain feasible estimates for use as starting values. Then, analyses incorporating various known correlation structures were run for each year in PROC MIXED, and the model fitting information was used to determine which correlation structure best fit the data for each year separately (Littell et al., 1996). It was determined that an exponential model provided the best fit in all cases. Also, since it was anticipated that the variability would differ between the 2 yr, the geostatistical model with accessions regarded as random effects includes a separate accession variance, range, and sill for each year. The model is as follows:

Formula
where yijk represents the chemical trait of interest, µ is the overall intercept, Yk is the effect of the kth year, {tau}i(k) is the effect of the ith accession in the kth year, eijk is the random error term, dij is the distance between observation i and observation j, {sigma}ek2 is the sill of the semivariogram for Year k, and {rho}k is the range of the semivariogram for Year k.

This model has two variance components for accession, {sigma}{tau}12 and {sigma}{tau}22, which are variances of the accessions within Year 1 (2001) and Year 2 (2002), respectively. These values are a measure of the variability that exists across all possible accessions for each chemical trait in each year.

The entire data set was analyzed by incorporating the above statistical model into SAS PROC MIXED. Again, since accessions were regarded as random effects, the MIXED procedure calculates BLUPs which are adjusted for underlying spatial variability. The BLUPs were output into a separate data set, and SAS PROC RANK was used to obtain the highest and lowest 1% of the accessions on the basis of the values of the BLUPs. SAS PROC RANK was also used to obtain the highest and lowest 1% of the accessions on the basis of the observed values of the chemical and nutritional traits in order to determine if the results changed significantly after accounting for spatial variability. Since primary interest lies in the ranking of the accessions, Spearman's rank correlation between the BLUPs and the observed values was calculated for each chemical trait.

In addition to describing the variability present in this sorghum collection, it was of interest to describe relationships that exist among the accessions and also between the various chemical traits. Principal components for this particular analysis were defined as follows: S is the sample variance-covariance matrix of the following random variables: starch, fat, CP, ADF, and P. The first principal component is a linear combination a1'x = a11x1 + ... + a15x5 that maximizes the variance among all linear combinations of x subject to a1'a1 = 1. The second principal component a2'x with a2'a2 = 1 is such that it is uncorrelated with the first and its variance is highest among all the linear combinations uncorrelated with the first principal component. Similarly, the third, fourth, and fifth principal components, all uncorrelated with the others, are defined. Then {lambda}1 ≥ {lambda}2 ≥ ... ≥ {lambda}5 > 0 are the eigenvalues and a1, ..., a5 become the corresponding eigenvectors of S, where ai'ai = 1 for i = 1,2, ... 5. The values a1'x, a2'x, ..., a5'x are precisely the first, second, ..., fifth principal components of x, and the eigenvalues of S are the variances of the corresponding principal components.

The first step in determining the principal components of a vector x was to obtain the eigenvalues and the eigenvectors of the sample variance-covariance matrix S of x. It is equally valid to start with a sample correlation matrix R instead of a covariance matrix, and this is actually recommended if measurements on different variables are on different scales and the variances are of different magnitudes (Johnson and Wichern, 1998). For this reason, the principal components in this analysis were obtained from the sample correlation matrix using SAS PROC PRINCOMP.

Finally, cluster analysis was conducted with descriptor data obtained from GRIN (USDA-ARS, 2005b) to search for "natural" groupings. Descriptor data included country of origin, endosperm color, endosperm texture, endosperm type, kernel color, kernel plumpness, kernel shape, panicle shape, pericarp color, and race. Country of origin data was limited to seven countries in Africa (Botswana, Ethiopia, Niger, Nigeria, South Africa, Sudan, and Zimbabwe) with large numbers of accessions in the U.S. photoperiod insensitive collection. The accessions were clustered on the basis of their standardized BLUP values for starch, fat, CP, ADF, and P. The clusters were formed using the K-means method implemented in SAS PROC FASTCLUS. This is a nonhierarchical clustering technique designed to group items into a collection of K clusters, and this technique is particularly useful for large data sets (Johnson and Wichern, 1998).


    RESULTS
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
The NIRS chemical and nutritional values for each accession grown in this study are available on GRIN (USDA-ARS, 2005b) and can be accessed at the following URL: http://www.ars-grin.gov/cgi-bin/npgs/html/desclist.pl?69 verified 22 November 2005. Prediction equations for Ca were judged to have too low of an R2 to be useful and Ca values are not reported. The mean, standard deviation, and range of NIRS values of accessions and Wheatland are shown in Table 2. In 2001, mean accession values were similar to Wheatland. In 2002, mean accession values for ADF, CP, and Fat appeared slightly higher, and the mean accession values for other traits were similar or slightly lower than the Wheatland mean. The maximum range of values for most traits was greater in 2002, possibly due to irrigation allowing the accessions to express more of their genetic potential.


View this table:
[in this window]
[in a new window]
 
Table 2. Mean, standard deviation, and range of observed NIRS chemical and nutritional values of sorghum photoperiod insensitive accessions and Wheatland.

 
The highest and lowest 1% of accessions identified for each trait using BLUPs and using the observed NIRS values is shown in Tables 3GoGoGoGo. Variance associated with accessions after accounting for various spatial structures in the fields are shown for each year. Spearman rank correlation coefficients for BLUPS with NIRS values are also shown. The rank correlation coefficients ranged from 0.91 to 0.92 for ADF, CP, and P, and considerable redundancy existed between the accessions selected using BLUPs and observed NIRS values. The rank correlations for fat, and starch were 0.87 and 0.82, respectively.


View this table:
[in this window]
[in a new window]
 
Table 3. Sorghum photoperiod insensitive accessions identified with high and low acid detergent fiber (ADF) using BLUPS and observed NIRS values.

 

View this table:
[in this window]
[in a new window]
 
Table 4. Sorghum photoperiod insensitive accessions identified with high and low crude protein (CP) using BLUPS and observed NIRS values.

 

View this table:
[in this window]
[in a new window]
 
Table 5. Sorghum photoperiod insensitive accessions identified with high and low fat using BLUPS and observed NIRS values.

 

View this table:
[in this window]
[in a new window]
 
Table 6. Sorghum photoperiod insensitive accessions identified with high and low phosphorus (P) using BLUPS and observed NIRS values.

 

View this table:
[in this window]
[in a new window]
 
Table 7. Sorghum photoperiod insensitive accessions identified with high and low starch using BLUPS and observed NIRS values.

 
The principal component analysis is summarized in Table 8. The first and second principal components collectively account for 79.5% of the total variability. Consequently, the sample variation is adequately summarized by only two principal components. These principal components are given as follows:

Formula

Formula
Four clusters were formed in the analysis. SAS PROC CANDISC was used to compute the canonical variables for plotting the clusters, and the results are shown (Fig. 1 ). Canonical variables were also plotted on the descriptors obtained from GRIN (USDA-ARS, 2005b) for each point. No clustering was attributable to country of origin endosperm color, endosperm texture, endosperm type, kernel color, kernel plumpness, kernel shape, panicle shape, pericarp color, or race. A sample plot of canonical variables and country of origin are shown (Fig. 2 ).


View this table:
[in this window]
[in a new window]
 
Table 8. Coefficients for principal components.

 

Figure 1
View larger version (116K):
[in this window]
[in a new window]
 
Fig. 1. Clusters based on BLUPs for selected countries in Africa.

 

Figure 2
View larger version (118K):
[in this window]
[in a new window]
 
Fig. 2. Data plotted by country of origin.

 

    DISCUSSION
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
As apparent from the lists of selected accessions, the use of BLUPs causes differences in accessions identified in the highest or lowest 1% for a given trait when compared with the use of observed values. BLUPs account for spatial variation; thus, their rankings should differ from the rankings of the observed values. However, drastic differences should not exist between the two sets of rankings, resulting in high correlation coefficients between BLUPs and observed values. This is indeed what we observed in the sorghum photoperiod insensitive collection, with rank correlation coefficients ranging from r = 0.82 to 0.92.

The importance of these difference and similarities was demonstrated in simulations by Sebolai et al. (2005) who concluded that in relatively uniform fields, the use of BLUPs identified 56 to 60% of the true top ranking accessions, while the use of the observed values identified only 35 to 37% of the true top ranking accessions in a population and that both were relatively unaffected by check plot density. While it is not possible within the constraints of this field screening experiment to determine if more of the true highest and true lowest 1% of the accessions were selected for any trait using BLUPs or observed values, the results from Sebolai et al. (2005) strongly suggest that basing selection on BLUPs will at least equal selection using observed values in nonuniform field conditions and double the probability of identifying accessions with true high and low values for a given trait under uniform field conditions such as those common to research fields. We therefore recommend the use of the selections based on BLUPs (Tables 37) to identify probable "best" and "worst" accessions for chemical and nutritional traits within this population of accessions for future use by breeders.

Principal component analysis often reveals relationships that may not have been previously suspected and thus allows interpretation that would not normally result (Johnson and Wichern, 1998). In many cases, it is possible to give meaningful interpretations to the coefficients (eigenvectors) in the principal components. The analysis indicated that nearly 80% of the sample variation is described by the first two principal components. On the basis of its coefficients, the first principal component appears to contrast starch content with a weighted average of fat, crude protein, acid detergent fiber, and phosphorus content. The second principal component appears to be a contrast between fat and acid detergent fiber content (since the coefficients for the other variables are much smaller in magnitude, these variables can be ignored in the definition of the second principal component). Since starch content represents the largest chemical fraction of sorghum grain, its importance in contributing to sample variation should come as no surprise.

An alternative approach to understanding the results of the principal component analysis is to consider them on a morphological basis. Starch is found primarily in endosperm. Fat and protein are found primarily in the embryo (or "germ") fraction of the seed. Fiber is found primarily in the seed coat. One might therefore infer that the first principal component is essentially a contrast of endosperm vs. all other seed fractions and that the second component is a contrast of germ vs. seed coat. In both cases, variation is explained by contrasting the energy-rich components of sorghum grain with less energy-rich components. Understanding energy storage and compartmentalization, therefore appears key to understanding variation among accessions in this collection.

The plot of canonical values shown in Fig. 1 clearly displays clustering of the values. These clusters suggest relationships among the accessions. However, when canonical values are plotted by country of origin (Fig. 2), endosperm color, endosperm texture, endosperm type, kernel color, kernel plumpness, kernel shape, panicle shape, pericarp color, and race, no geographical, morphological, or taxonomic interpretation can be assigned to the clusters identified in Fig. 1.

In conclusion, chemical and nutritional traits in the U.S. photoperiod insensitive sorghum collection were characterized by NIRS and statistical techniques to account for the effects of spatial variability common to field screening nurseries. These combined technologies allowed us to identify the highest and lowest 1% of the accessions for five chemical and nutritional traits. Principal component analyses attributed much of the total variation among the accessions to a contrast of starch with a weighted average of fat, crude protein, acid detergent fiber, and phosphorus. Cluster analyses showed clear separation of clusters on the basis of canonical values, but no geographical, taxonomical, or morphological interpretation of the clusters was apparent.


    ACKNOWLEDGMENTS
 
The authors thank Mr. Steven Masterson for his efforts in developing NIRS prediction equations and generating the NIRS data used in this study.


    NOTES
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Joint contribution of the USDA-ARS and the Univ. of Nebraska Agric. Exp. Stn. As Paper no. 14532, Journal Series, Nebraska Agric. Exp. Stn.

1 The use of trade, firm, or corporation names in this publication is for the information and convenience of the reader. Such use does not constitute an official endorsement or approval by the United States Department of Agriculture, the Agricultural Research Service, or the Northern Plains Area of any product or service to the exclusion of others that may be suitable. Back

Received for publication May 4, 2005.


    REFERENCES
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 





This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via ISI Web of Science (1)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Hooks, T.
Right arrow Articles by Vogel, K. P.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Hooks, T.
Right arrow Articles by Vogel, K. P.
Agricola
Right arrow Articles by Hooks, T.
Right arrow Articles by Vogel, K. P.
Related Collections
Right arrow Sorghum
Right arrow Data Management
Right arrow Plant Genetic Resources


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
The SCI Journals Agronomy Journal Vadose Zone Journal
Journal of Plant Registrations Soil Science Society of America Journal
Journal of Natural Resources
and Life Sciences Education
Journal of
Environmental Quality