|
|
||||||||
a USDA-ARS, Plant Science and Entomology Research Unit, Dep. of Agronomy, 2001 Throckmorton Plant Science Center, Kansas State Univ., Manhattan, KS 66506 USA
b Pioneer Hi-Bred Intl., P.O. Box 328, Hamel, IL 62046 USA
c USDA-ARS, Plant Physiology and Genetics Research Unit, Dep. of Crop Sciences, 1101W. Peabody Dr., Univ. of Illinois, Urbana, IL 61801 USA
d CIMMYT Applied Biotechnology Center, Lisboa 27, Apdo Postal 6-641 06600 Mexico DF, Mexico
gbg{at}ksu.edu
| ABSTRACT |
|---|
|
|
|---|
Abbreviations: cM, centimorgans GD, genetic distance MG, maturity group PI, Plant Introduction PCR, polymerase chain reaction RAPD, random amplified polymorphic DNA SSR, simple sequence repeat RFLP, restriction fragment length polymorphism UPGMA, Unweighted Pair Group Method Using Arithmetic Averages
| INTRODUCTION |
|---|
|
|
|---|
The limited germplasm base of North American soybean cultivars threatens the ability of breeders to sustain genetic improvement. It also increases vulnerability of the crop to changes in pathogen and pest populations. Introgression of new genetic diversity through hybridization with introduced germplasm is one way to increase genetic variation in breeding populations, the base upon which gain from selection depends. More than 15 000 introduced accessions of G. max are maintained in the USDA soybean germplasm collection, the vast majority of which do not appear in the pedigree of any released cultivar. These introductions potentially represent a rich source of allelic variation not present in current North American soybean cultivars. At present, the use of exotic germplasm in soybean cultivar development generally has been limited to a small number of introductions that have served as sources of genes for resistance to disease and insect pests and have contributed little to overall genetic diversity.
To utilize introduced germplasm to increase productivity and provide new sources of genetic variation for future gain, selection criteria for parental stock need to consider genetic diversity as well as agronomic value. Agronomic performance of exotic germplasm in the target environment may be taken into account in parental selection; but, it is not known what effect this has on the probability of obtaining new allelic diversity. Geographic origin and plant morphology data are available for most of the introductions in the USDA collection and frequently serve as criteria for selection of genetically diverse parents. However, morphological differences usually are determined by a small number of genes and may not be representative of genetic divergence in the entire genome. Geographic origin also may not be an adequate indicator of genetic diversity. The original source of many soybean accessions introduced into the collection from secondary sources in Europe, Africa, and Asia are not known. In addition, a large number of the accessions in the USDA soybean collection are from the same regions of China and Korea as the introductions that make up the base of the North American germplasm. The genetic relationships of these materials with the major ancestors of modern U.S. cultivars are not known.
An understanding of the relationship of soybean introductions with the ancestors of North American cultivars based on selectively neutral DNA markers could be useful to breeding programs for selection of diverse parents. Using elite lines and culitvars, Kisha et al. (1997) demonstrated that greater diversity among parental lines as measured by restriction fragment length polymorphism (RFLP) makers produced greater variance for seed yield in the resulting populations. The availability of polymerase chain reaction (PCR) based molecular markers, such as randomly amplified polymorphic DNAs (RAPDs) and simple sequence repeats (SSRs), allows one to survey a large number of loci from many accessions. Thompson and Nelson (1998a) identified a set of 35 random primers that reliably produced RAPDs with gene heterozygosity scores of 0.30 or greater when a group of 35 genotypes were analyzed. Using 20 SSR loci, Diwan and Cregan (1997) observed a mean gene diversity of 0.80 in a survey of 35 major ancestors of North American soybean cultivars, much greater than that observed in similar studies using RFLP markers (Keim et al., 1992). Using the 20 SSR loci, they were able to distinguish several modern soybean cultivars considered identical on the basis of RFLPs, morphological, and pigmentation traits.
We have used 87 soybean introductions successfully in a our germplasm enhancement program to increase yield. The objective of this research was to evaluate the relationships of these introduced lines with the major ancestors of North American cultivars.
| Materials and methods |
|---|
|
|
|---|
|
|
Polymorphic DNA segments amplified with each random and microsatellite primer pair were assigned a letter and each band was scored as present (1) or absent (0). A matrix of all possible pair-wise GD was calculated by PROC IML in PC SAS (SAS Institute, 1989) on the basis of the following formula: GD = 1 - a/(n - d), where a is the number of shared bands for each pair of genotypes (1,1), n is the number of possible matches (1,1 and 0,0) and mismatches (0,1 and 1,0) between genotypes, and d is the number of (0,0) matches between genotypes. The distance measure used is equivalent to the compliment of Jaccard's coefficient of similarity (Jaccard, 1908). The 0,0 matches were not treated as information for two reasons: (i) lack of a RAPD band in two genotypes may not be due to a common evolutionary event, and (ii) the presence of multiple alleles at microsatellite loci inflates genetic similarity if 0,0 matches are treated as information. Gene diversity scores were calculated for each marker as 1-
pij2 where pij is the frequency of the jth allele of Marker i (Weir, 1990). Each RAPD fragment was considered to have two forms (present or absent) whereas multiple alleles were detected at SSR loci.
Cluster analysis was performed on the 105 x 105 distance matrix using the AVERAGE and WARD options of PROC CLUSTER of PC SAS (SAS Institute, 1989). Mean distances within and between clusters were calculated from the genetic distance matrix. An acceptable cluster was defined as a group of two or more genotypes with a within-cluster genetic distance less than the overall mean genetic distance, and between cluster distances greater than either within cluster distance of the two clusters involved.
To test the reliability of cluster assignments, resampling of the original data coupled with cluster analysis was performed using a SAS macro provided by D.Z. Skinner (1998, personal communication). The macro couples the IML and CLUSTER procedures of SAS to sample the data set, calculate a measure of genetic distance based on the sample, and perform cluster analysis. One hundred iterations of resampling and clustering were performed on the original data with the AVERAGE option of PROC CLUSTER. A 105 x 105 matrix was produced showing all possible pair-wise combinations of lines and the number of iterations for which any two lines were placed in the same cluster, which is referred to as the co-occurrence value (Vera Cruz et al., 1996). These data and the cluster assignments in the original analysis were used to determine the number of iterations that a line was grouped with all other members of a cluster. Cluster assignment by the AVERAGE and WARD options of PROC CLUSTER and average number of iterations a genotype was clustered with other members were used to place lines in a consensus cluster.
The distance matrix was subjected to MDS (Shepard, 1974) by the MDS procedure of PC SAS (SAS Institute, 1992) and criteria similar to that described by Gizlice et al. (1996) and Thompson et al. (1998). To maintain the scale of 0 to 1 used for the genetic distances, the ABSOLUTE option was used. The goodness of fit criteria (R2) between the original data and the predicted values that were derived from the MDS coordinates was used to evaluate the effectiveness of 2 to 20 dimensions. The most effective analysis was defined as the fewest dimensions resulting in an R2 > 0.95 with the original genetic distance matrix.
| Results and discussion |
|---|
|
|
|---|
Relationship of Ancestors and Plant Introductions
The UGPMA and Ward's methods both assigned genotypes into nine clusters (Table 1) that were judged reasonable on the basis of the within and between cluster distances. The cluster assignments were similar for the two algorithms although some rearrangement occurred. Thompson et al. (1998) identified 10 major clusters among 32 genotypes using the RAPD marker data included in this study. The addition of microsatellite marker data and 70 additional plant introductions resulted in some changes in the diversity patterns observed in the earlier study. Most notably, clusters were larger, and less distinction was observed between groups.
Members of Cluster A were grouped together in the majority of resampling iterations (Table 1). This cluster consisted of the ancestral line `Korean' and 16 plant introductions originating from France, Japan, northern China, Hungary, and the far eastern provinces of the former USSR (Table 1). Korean contributed less than 1% to the genetic base of northern cultivars and was not involved in the development of any cultivars in the southern growing region, so this cluster has little relationship to current U.S. varieties. Although members of the cluster were of very diverse origin, it was one of the most consistent clusters. Ten of the 17 members were placed in this cluster in more than 90% of the iterations. Cluster A also had the smallest within cluster mean GD of 0.32.
The ancestor `Mandarin (Ottawa)', which contributed over 17% to the genetic base of northern U.S. cultivars, was clustered with eight introductions from northern China, Korea, Japan, and Hungary in 85% of clustering iterations to form Cluster B (Table 1). Thompson et al. (1998) put Mandarin (Ottawa), `Richland' and `Capital' in the same cluster but that cluster was the least stable of any of their ancestral clusters. The two hierarchical clustering procedures in this analysis split these three ancestors into three different clusters.
The ancestors `Dunfield' and `Mukden', each of which account for about 3.5% of the genetic base of North American cultivars, were included in Cluster C, the largest cluster identified. Thompson et al. (1998) found that Dunfield did not fit well into any single cluster. In our study, 16 of the nineteen members of Cluster C, including Mukden and Dunfield, were grouped together in a majority of resampling iterations (Table 1). This cluster contained seven cultivars released in China in Liaoning and Jilin provinces between 1963 and 1983. Mukden and Dunfield originated from Liaoning and Jilin, respectively. Two of the Chinese lines were received with the name Jilin No. 3, one from Romania in 1971 (PI 361075) and one from Jilin province in 1978 (PI 427099). The physical appearance of these lines had been compared in the field and although the major qualitative descriptors were identical, the plant types were judged to be different enough to warrant maintaining both lines in the collection. This decision was supported by our DNA data which showed that these two lines differed at 16% of the loci compared. Pedigree information relates some of the cultivars in this cluster. Jilin No. 6 (PI 383276) shares a common parent with Jilin No. 3 and was most closely related to the two accessions named Jilin No. 3, PI 361075 and PI 427099, with GD of 0.24 and 0.20, respectively.
Cluster D was mostly MG I and II accessions developed in the former Soviet Union or originating from China. It includes three introductions (PI 68508, PI 68522, and PI 68658) that were imported into the USA at about the same time and from approximately the same region of China as many of the major ancestors of current N.A. cultivars. Also in this group were four cultivars released in Jilin and Liaoning provinces between 1970 and 1978. Some of these Chinese cultivars are very closely related to the Chinese cultivars in Cluster C. Jilin 10 in Cluster C has the same parents as Jilin 11 (PI 391584) in Cluster D. Jilin No. 6 in Cluster C has one of the same parents as Jilin No. 8 (PI 391594) in Cluster D. Jilin No. 3 in Cluster C is a parent of Jilin No. 10 (PI 391584) in Cluster D. The genetic distance between Clusters C and D was 0.50, less than the distance between most other clusters.
The MG VI and VII lines `Jackson', `Roanoke', and `Ogden', which together account for 24% of the genetic base of cultivars developed in the southern USA, were placed in the same cluster in more than 78% of iterations and also formed one of the major ancestral clusters in the analysis of Thompson et al. (1998). The UPGMA and Ward's methods clustered these genotypes in Cluster F with five and 14 introductions, respectively, and the ancestor Capital. The cluster classification of Capital was difficult to assess since it was not placed in any one cluster in greater than 50% of resampling iterations. On the basis of the mean genetic distance between Capital and PIs 291306A, 297500, and 407654 (entries 64, 65, and 66, respectively) of 0.48 and proximity of Capital to these introductions on the scatter plot of the first two dimensions of MDS (Fig. 1) , Capital (entry 63) was placed in consensus Cluster F.
|
The ancestors `Arksoy' (entry 72) and `Ralsoy' (entry 74) were always placed in the same cluster. This is consistent with the results of Thompson et al. (1998) but their analysis added `Haberlandt' to this group. These two closely related MG VI lines were clustered with the MG IV line `Perry' (entry 73), defined as a first progeny by Gizlice et al. (1994), in 77% of iterations to form Cluster G. Some overlap was observed in the MDS scatter plot between this group of genotypes and Cluster H (Fig. 2) which was defined by the ancestors Haberlandt (entry 78), and Richland (entry 84), and introductions PI 91730-1, PI 189893, PI 417510, and FC 04007B (entries 79, 80, 83, and 75, respectively) that were clustered together in approximately 75% of resampling iterations. Four other soybean introductions were included in Cluster H that were not stable members of the cluster (Table 1).
|
Clusters J and K were comprised entirely of plant introductions. Three of the most diverse genotypes in the study, PI 283331 (from Morroco through Australia in 1962), PI 87588 (from Korea in 1930), and PI 475814 (from Xinjiang, China in 1982) were placed in Cluster J in a majority of iterations. There is little soybean production in Xinjiang and the history of soybean in this province is much shorter than is most places in China. The soybean cultivars grown in Xinjaing probably originated in Northeast China. The fourth member of this cluster (PI 464887) is from Jilin province. All of the accessions in Cluster J may have originally come from Northeast China but they are genetically distinct from the other accessions from the region that were included in this research. PI 458511, which is Kai yu No. 3 released in Liaoning province in 1976 (Cui et al., 1999), had the second largest mean genetic distance from all other genotypes. It was grouped with another diverse introduction, PI 427088B, to form Cluster K. PI 427088B is an unknown Chinese cultivar obtained from a soybean crushing plant in Jilin province in 1978 (Bernard et al., 1989). Clusters J and K had the greatest mean GD with all other clusters.
Kisha et al. (1998) analyzed 9 ancestors and 12 introductions that are also a part of this study. Because of the few lines in common between the two studies it is difficult to make meaningful comparisons. We analyzed 10 of the 15 members of the Cluster 4 defined by Kisha et al. (1998). Our results placed those 10 accessions in three different clusters. We also had five members of the Cluster 6 defined by Kisha et al. (1998) and we agreed with the grouping of A.K. (Harrow), Lincoln, PI 445830, and S-100 but our results put Richland in a different cluster. Cluster formation is a comparative process so changing some of the lines in the analysis can change cluster members and Kisha et al. (1998) used only 65 RFLP alleles to classify 165 entries
Genetic Diversity in Exotic Soybean Germplasm
No discernable geographical patterns of variation were found in this research. However, lines included in the study were not selected to accurately represent the major regions of soybean production and many of the entries were products of modern breeding programs. Griffin and Palmer (1995) assumed that the long history of domestication and commerce of soybean in Asia has contributed to the dispersion of alleles throughout the region, lessening the influence of geography on patterns of variation among Asian soybean accessions. Nelson and Li (1998) did not find this to be true when comparing primitive soybean accessions. A unique allele was detected in the ancestral line CNS at the Satt 080 locus. Another unique allele was present at the Satt 006 locus in the genotype PI 87588. Satt 002 detected a rare allele in Arksoy, Ralsoy, and PI 399016. All of the lines for which unique alleles were detected originated from the Korean peninsula, except for CNS.
The level of genetic diversity observed in this study was higher than that reported in studies using RFLPs. This may be due to the selection of RAPD markers known to have high levels of gene diversity in soybeans and the inclusion of a small number of SSR loci that are more highly polymorphic than RFLPs. The analysis was able to identify related groups of genotypes and many of the soybean plant introductions included in the study were found to be genetically distinct from the founding stock of North American soybean breeding programs. For example, Clusters A and F were composed mainly of plant introductions, including only two ancestors, Korean and Capital, that have made only small contributions to the pedigrees of soybean cultivars in the northern USA. Clusters D, J, and K consisted exclusively of introduced germplasm. These results identify three new genetic groups (Clusters F, J, and K) in addition to those already found by Thompson et al. (1998) that are distinct from the ancestral base of U.S. soybean cultivars.
This study will aid breeders interested in selecting genetically diverse germplasm, both in relation to the ancestors of North American cultivars and to other plant introductions. All of the introductions analyzed have successfully contributed to a breeding program focused on increasing yield and genetic diversity. The fact that many of the genetically diverse plant introductions in this study were used to produce lines having from 25% to 100% plant introduction parentage and yields comparable to contemporary cultivars of similar adaptation (Thompson and Nelson, 1998b) illustrates the potential useful of these lines.
We have made the assumption that lines that have yields similar to commercial cultivars and that have a parent or parents that are genetically distinct from those same cultivars are good candidates for inclusion in a breeding program to expand genetic diversity and increase yield. We do not know if the plant introductions in this research have unique alleles for traits of economic importance. None of the RAPD markers used in the study have been mapped. Satt 002 is on linkage group D2 (Cregan et al., 1999) and is within 10 centimorgans (cM) of an unknown malate dehydrogenase locus (Palmer et al., 1992). Satt 006 is on Linkage Group L (Cregan et al., 1999) less than 3 cM from the Dt1 locus that controls stem termination. All genetic groups had members with both determinate and indeterminate phenotypes except for the two member K cluster that was all indeterminate and the three member E cluster that was all determinate ancestral lines. Satt 080 is on Linkage Group N (Cregan et al., 1999) and close to the Rps1 locus. It is unlikely that any of these known linkages influenced the cluster analyses of this research. At some point in the future, it is likely that we will be able be identify the exact loci and the unique alleles that can increase yield. Until we do and to help in that process, we think that identifying genetically distinct introductions that have the potential to produce high yielding lines is a productive strategy. Thompson and Nelson (1998b) demonstrated that these diverse soybean introductions can contribute genes that can increase the yield of U.S. cultivars. The literature shows that there is a relationship between marker diversity of the parents and genetic variance of the resulting progeny. Collecting data on genetic diversity in parents and progeny is time consuming and expensive. By identifying genetically diverse introductions that have the potential to produce high yielding progeny, we are making available to breeders and geneticists important germplasm resources that have a high potential to contribute not only to increasing yield but also to the process of understanding the genetic basis of yield improvement.Gizlice Carter Burton 1993; Johnson Wichern 1992; Lorenzen Boutin Young Specht Shoemaker 1995; Oard Dronavilli 1992; SAS 1989; SAS 1992
| ACKNOWLEDGMENTS |
|---|
| NOTES |
|---|
|
|
|---|
Received for publication April 15, 1999.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
Y.-B. Fu, G. W. Peterson, and M. J. Morrison Genetic Diversity of Canadian Soybean Cultivars and Exotic Germplasm Revealed by Simple Sequence Repeat Markers Crop Sci., September 1, 2007; 47(5): 1947 - 1954. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. M. Nichols, W. Lianzheng, Y. Pei, K. D. Glover, and B. W. Diers Variability among Chinese Glycine soja and Chinese and North American Soybean Genotypes Crop Sci., May 31, 2007; 47(3): 1289 - 1298. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Badu-Apraku and A. F. Lum Agronomic Performance of Striga Resistant Early-Maturing Maize Varieties and Inbred Lines in the Savannas of West and Central Africa Crop Sci., March 1, 2007; 47(2): 737 - 748. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. S. Guzman, B. W. Diers, D. J. Neece, S. K. St. Martin, A. R. LeRoy, C. R. Grau, T. J. Hughes, and R. L. Nelson QTL Associated with Yield in Three Backcross-Derived Populations of Soybean Crop Sci., January 22, 2007; 47(1): 111 - 122. [Abstract] [Full Text] [PDF] |
||||
![]() |
R.L. Nelson and E.O.C. Johnson Registration of Soybean Germplasm Lines LG97-7012, LG98-1445, and LG98-1605 Crop Sci., June 20, 2006; 46(4): 1822 - 1824. [Full Text] [PDF] |
||||
![]() |
R.L. Nelson and E.O.C. Johnson Registration of LG96-1797 Soybean Germplasm Crop Sci., April 25, 2006; 46(3): 1403 - 1403. [Full Text] [PDF] |
||||
![]() |
K. D. Bilyeu and P. R. Beuselinck Genetic Divergence between North American Ancestral Soybean Lines and Introductions with Resistance to Soybean Cyst Nematode Revealed by Chloroplast Haplotype J. Hered., September 1, 2005; 96(5): 593 - 599. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Chen and R. L. Nelson Relationship between Origin and Genetic Diversity in Chinese Soybean Germplasm Crop Sci., June 24, 2005; 45(4): 1645 - 1652. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.G. Shannon, R.L. Nelson, and J.A. Wrather Registration of S99-11509 and S99-11986 Improved Soybean Germplasm with Diverse Pedigree Crop Sci., June 24, 2005; 45(4): 1672 - 1673. [Full Text] [PDF] |
||||
![]() |
J. S. C. Smith, D. N. Duvick, O. S. Smith, M. Cooper, and L. Feng Changes in Pedigree Backgrounds of Pioneer Brand Maize Hybrids Widely Grown from 1930 to 1999 Crop Sci., November 1, 2004; 44(6): 1935 - 1946. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.L. Warburton, G.L. Brown-Guedira, and R.L. Nelson Registration of LG92-4208, LG94-1128, LG94-1906, and LG94-4667 Soybean Germplasm Crop Sci., July 1, 2004; 44(4): 1501 - 1502. [Full Text] [PDF] |
||||
![]() |
G. L. Brown-Guedira, M. L. Warburton, and R. L. Nelson Registration of LG92-1255, LG93-7054, LG93-7654, and LG93-7792 Soybean Germplasm Crop Sci., January 1, 2004; 44(1): 356 - 357. [Full Text] [PDF] |
||||
![]() |
G. N. Ude, W. J. Kenworthy, J. M. Costa, P. B. Cregan, and J. Alvernaz Genetic Diversity of Soybean Cultivars from China, Japan, North America, and North American Ancestral Lines Determined by Amplified Fragment Length Polymorphism Crop Sci., September 1, 2003; 43(5): 1858 - 1867. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Mohammadi and B. M. Prasanna Analysis of Genetic Diversity in Crop Plants--Salient Statistical Tools and Considerations Crop Sci., July 1, 2003; 43(4): 1235 - 1248. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. D. Burnham, D. M. Francis, A. E. Dorrance, R. J. Fioritto, and S. K. St. Martin Genetic Diversity Patterns among Phytophthora Resistant Soybean Plant Introductions Based on SSR Markers Crop Sci., March 1, 2002; 42(2): 338 - 343. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| The SCI Journals | Agronomy Journal | Vadose Zone Journal | |||
| Journal of Natural Resources and Life Sciences Education |
Soil Science Society of America Journal | ||||
| Journal of Plant Registrations | Journal of Environmental Quality |
The Plant Genome | |||