|
|
||||||||
a Dep. of Agronomy, Iowa State Univ., Ames, IA 50011-1010
b Dep. of Research and Product Development, Pioneer Hi-Bred International, Inc., Johnston, IA 50131
* Corresponding author (wfehr{at}iastate.edu).
| ABSTRACT |
|---|
|
|
|---|
Abbreviations: AFLP, amplified fragment length polymorphism p.d.f, probability density function PI, plant introduction QTL, quantitative trait loci RAPD, random amplified polymorphic DNA RFLP, restriction fragment length polymorphism SSR, simple sequence repeat
| INTRODUCTION |
|---|
|
|
|---|
Quantitative trait loci have been identified through associations with changes in molecular marker allele frequency in recurrent selection populations. Stuber et al. (1980) found that allele frequency changes at eight isozyme loci in maize (Zea mays L.) agreed with yield increases in four recurrent selection experiments. Changes in allele frequencies of restriction fragment length polymorphisms (RFLP) in the Illinois Long Term Selection Experiment in maize corresponded to QTL for increased oil concentration identified in a F2 mapping population (Sughroue and Rocheford, 1994). De Koeyer et al. (2001) measured allele frequency changes of RFLPs after seven cycles of recurrent selection for yield and other agronomic traits in oat (Avena sativa L.). They identified 13 QTL that had been detected previously in a recombinant inbred line population. Sebastian et al. (1995) compared allele frequencies for RFLP and random amplified polymorphic DNA (RAPD) markers of ancestral parents and elite soybean cultivars and lines. Changes in allele frequency were associated with 17 QTL for yield.
Whole genome scans for association of allele frequency with QTL would be expected to be especially effective in the identification of QTL in soybean. The relatively few ancestral parents that were the founders of the current elite gene pools and the self-fertilizing nature of the species favor the existence of extensive linkage disequilibrium (Nordborg et al., 2002; Rafalski, 2002a, 2002b).
A limited number of QTL for yield have been reported in soybean. Orf et al. (1999) used lines from Minsoy x Noir 1, Minsoy x Archer, and Noir 1 x Archer populations to identify four QTL for yield with RFLP and simple sequence repeat (SSR) markers. Concibido et al. (2003) used SSR and amplified fragment length polymorphism (AFLP) markers and the advanced backcross method of QTL mapping described by Tanksley and Nelson (1996) to identify a yield QTL in a HS-1 (Hartz Seed, Stuttgart, AR) x PI 407305 population. Specht et al. (2001) used the genotypic data of Orf et al. (1999) from the Minsoy x Noir 1 population to identify six QTL for yield under water stress conditions. Yuan et al. (2002) used SSRs in the Essex x Forrest and Flyer x Hartwig populations to identify four yield QTL.
Recurrent selection for yield in five populations, designated AP10, AP11, AP12, AP13, and AP14, that differed in their percentages of PI germplasm began at Iowa State University in 1979. Vello et al. (1984) found the genetic variability for yield in cycle 0 (C0) of the four populations that contained PI germplasm was twice that of the population with no PI percentage. Ininda et al. (1996) reported the genetic gain for yield after three cycles of selection among F4derived lines in the five populations was 2.5% cycle1 in AP10 (100% PI), 2.0% in AP11 (75% PI), 3.1% in AP12 (50% PI), 2.8% in AP13 (25% PI), and 5.4% in AP14 (0% PI). There were no significant differences among the five populations in genetic variability among lines for yield in cycle 4 (C4) (Narvel, 1999). Changes in marker allele frequencies associated with recurrent selection for yield in the populations may be useful to identify genomic regions important for yield in diverse soybean germplasm. The objectives of this study were to identify QTL for yield in elite and PI germplasm through their association with SSR alleles that had frequency changes in the populations AP10, AP12, and AP14 and to determine if the PIs possessed favorable alleles for yield at the QTL.
| MATERIALS AND METHODS |
|---|
|
|
|---|
The method described by Sebastian et al. (1995) was the basis for the QTL analysis. The method involves genotyping with molecular markers the improved lines from the most advanced cycle of selection and the earliest known ancestors of the improved lines. The lines used in this study were the original PI and elite parents of the C0 populations, the 20 high-yielding lines selected as parents in AP10 and AP14 to form the cycle 1 (C1) populations, the 15 highest-yielding lines from the C4 populations of AP10 and AP14, and the 13 highest-yielding lines from the C4 population of AP12 (Fig. 1) . The number of C4 lines chosen from AP12 was limited to 13 to make it possible to analyze the samples in complete 96-well plates. One ancestor of AP14 was an elite experimental line that could not be included in the study because its seed did not germinate. The data for selecting the highest-yielding lines from the C4 populations were obtained by Narvel (1999), who tested 100 randomly chosen C4 lines of each population in two replications at three Iowa locations in 2 yr.
|
A total of 184 fluorescently labeled SSRs spaced 15 cM apart on average were chosen based on their genome distribution. The map positions were derived from the USDAIowa State Univ. genetic map (Cregan et al., 1999). The PCR reaction consisted of 1.0 µL GeneAmp 10x PCR Buffer II, 0.6 µL 25 mM MgCl2, 0.2 µL 10 mM dNTP, 1.7 µL 2 µM forward/reverse primer mix, 0.06 µL AmpliTaq Gold DNA polymerase, 1.0 µL 10 ng µL1 DNA, and 5.44 µL HPLC H2O (Perkin-Elmer, Foster City, CA). The PCR program was 10 min at 95°C, then 45 cycles of the following: 50 s at 95°C, 50 s at the annealing temperature, and 85 s at 72°C. A final extension step of 10 min at 72°C was used. PCR was performed individually for each marker and genotype combination.
PCR products were multiplexed by allele size and florescence color, diluted by a SciClone Liquid Handling Workstation (Zymark Corporation, Hopkinton, MA), and separated via capillary electrophoresis with an ABI Prism 3700 DNA Analyzer (Applied Biosystems, Foster City, CA). ROX 400HD was used as the internal standard to calculate allele sizes (Applied Biosystems, Foster City, CA). Data were collected with GENESCAN Prism software (Applied Biosystems, Foster City, CA) and allele sizes estimated by GENOTYPER software (Applied Biosystems, Foster City, CA). Manual verification of the allele sizes was performed.
Allele Frequency Changes
Allele frequencies in the C4 lines were compared with the parents used to form the C0 populations of AP10, AP12, and AP14. For AP10 and AP14, the allele frequencies in the C4 lines also were compared with the 20 highest-yielding C0 lines used to form the C1 populations. The probability that each improved line inherited each allele from its ancestors was calculated and averaged over improved lines to determine the expected frequency of each allele. The observed and expected allele frequencies of the improved lines were compared to determine which alleles occurred more or less frequently than expected (De Koyer et al., 2001; Sebastian et al., 1995). The comparision of allele frequencies before and after selection must account for mutation, migration, and genetic drift, which may influence the allele frequency of the population (Falconer and Mackay, 1996). The effects of mutation were considered negligible because the breeding process consisted of only four cycles of selection. The self-fertilizing nature of soybean and the care practiced during intermating likely prevented the migration of alleles into the population. Genetic drift may have had a large influence on the allele frequencies of the lines in the C0 and C4 generations, and was accounted for through two methods.
For AP10 and AP14, analyses conducted with the parents of the C0 populations as the ancestors were compared with the analyses when the 20 highest-yielding C0 lines of each population were considered the ancestors. The comparison was used to differentiate between changes in frequency of marker alleles due to an association with a selected QTL allele versus allele frequency changes due to the restriction of alleles that occurred in forming the C1 populations. The 20 C0 lines used as parents to form the C1 populations was half the number of parents used to form the C0 populations of AP10 and AP14 and one-quarter the number of parents used to form the C0 population of AP12. The reduction in the number of parents and the inbreeding of the parents used to form the C1 populations contributed to genetic drift through a restriction in the number of alleles from the parents of the C0 populations that could be expected in the C4 lines.
The flow of each marker allele was simulated 10000 times from the ancestors to the C4 lines to construct a probability density function (p.d.f.) for each recurrent selection population structure. Given the pedigree structure for the C4 lines, and the genotype of the most distant ancestral nodes, the genotype of the C4 lines was computed by a simulation that assumed random inheritance of parental alleles, with each intermediate selfed to homozygosity. The simulation fully accounted for the dependencies between the probabilities of an allele appearing in each C4 line because of the dependencies reflected in the pedigree structure. Repeating the simulation 10000 times provided an estimate of the p.d.f. for the allele frequencies in the C4 lines under the null hypothesis of no selection. Missing ancestral genotypes were handled by randomly selecting for each simulation of a genotype based on the allele frequencies in the ancestors. The area under the tail of the p.d.f. was quantified as a P value that was the measure of the number of rounds that the simulation generated an allele frequency at least as extreme as that observed in the C4 lines (Miller and Miller, 1999). The formula was P = (Se/St), where Se was the number of rounds of simulation that produced an allele frequency equal to or more extreme than that observed in the C4 lines and St was the total number of rounds of simulation. For example, if the expected frequency of an allele in the C4 lines was 0.15, the observed frequency was 0.30, and 500 out of 10000 rounds of simulation produced an allele frequency
0.30, then Se = 500, St = 10000, and P = 0.05. A P value
0.05 was used as the significance threshold to declare that the frequency change in a marker allele was not entirely due to random genetic drift, but may be associated with selection for a linked QTL allele.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
0.05 (Table 1). The marker alleles with frequency changes were inferred to be associated with alleles that have undergone selection at yield QTL. There were 27 alleles at 25 SSR markers identified as significant in AP10, 21 alleles at 20 SSRs in AP12, and 19 alleles at 18 SSRs in AP14. There were 16 alleles at 15 SSRs unique to the PIs, 9 alleles at 9 SSRs unique to the elites, and 41 alleles at 36 SSRs in both the PI and elite parents (Table 1). The difference between the number of significant marker alleles and loci indicated that more than one allele at some loci was influenced by selection for yield. Narvel et al. (2000) measured the diversity of the original parents of the C0 populations of AP10 to AP14 with SSR markers and observed a greater number of alleles in the PI parents than in the elite parents. Our results indicated that some of those unique PI alleles remained after the fourth cycle of selection and increased appreciably in frequency in the highest-yielding C4 individuals of AP10 and AP12.
|
The results obtained from AP10, AP12, and AP14 were compared with the studies of yield QTL mapping in biparental populations. There were four yield QTL reported by Orf et al. (1999), one by Concibido et al. (2003), six by Specht et al. (2001), and four by Yuan et al. (2002). We identified a total of 54 SSR markers associated with 43 yield QTL in AP10, AP12, and AP14 (Table 1). The greater number of yield QTL identified in our study than in previous research reflected the greater number of PI and elite parents used to form the C0 populations. The larger number of accessions provided the opportunity for a greater number of QTL alleles to segregate than would be possible in any biparental population. Nine of the SSR markers associated with yield QTL in our study were in seven regions where yield QTL had been identified in previous research (Table 1). A yield QTL detected with Satt066 on B2 was identified previously by Concibido et al. (2003) with the AFLP marker U3944117. The QTL region identified with the marker Satt294 on C1 also was reported by Yuan et al. (2002). On C2, Satt277 identified a yield QTL in the same region that was reported by Orf et al. (1999) with the markers Satt277 and Satt489 and by Specht et al. (2001) with the markers Satt205 and Satt489. The marker Sat_074 on F was associated with a QTL for yield in our study and in a study by Specht et al. (2001). A QTL on H was associated with Satt469 and Specht et al. (2001) used the linked marker, Satt314, to identify the same yield QTL. Two regions containing yield QTL that were identified on K also were reported by Yuan et al. (2002). They found the first QTL on K was associated with Satt337 and Satt326 and the second QTL was associated with Satt539. The markers Satt590, Satt567, and Satt540 on M identified a yield QTL that was detected by Orf et al. (1999) using Satt150 and by Specht et al. (2001) using Satt150 and Satt567.
There was a larger reduction in AP10 than in AP14 for the number of allele frequency changes when the C4 lines were compared with the original PI or elite parents than when they were compared with the 20 highest-yielding lines used to form the C1 populations. When the 15 highest-yielding C4 lines were compared with the 40 parents of the C0 population of AP10, 58 alleles were significant at P
0.10 (Table 2). When the 20 highest-yielding C0 lines of AP10 were used as the ancestors, 29 alleles were significant. In AP14 when the 15 highest-yielding C4 lines were compared with the 40 parents of the C0 population, 29 alleles were significant. When the 15 highest-yielding C4 lines were compared with the 20 C0 lines used to form the C1 population of AP14, 24 alleles were significant.
|
The number of alleles with significant frequency changes in AP10, AP12, and AP14 did not correspond to the genetic gain for yield that had been realized during three cycles of selection in the three populations. In our study, the number of markers with significant changes in allele frequency was greatest for AP10, intermediate for AP12, and least for AP14 (Table 2). Ininda et al. (1996) reported that the percentage yield increase from the first three cycles of selection was 2.5% cycle1 in AP10, 3.1% in AP12, and 5.4% in AP14. The greater genetic gain in AP14 for the initial cycles of selection may be due to genetic asymmetry. Allele frequencies near 0.5 maximize the heritability for additive traits (Falconer and Mackay, 1996). In Table 1, the expected allele frequencies of the SSR markers represent the allele frequencies that were present in the parents of the C0 and C1 populations. The observed allele frequencies indicate the allele frequencies that were present in the highest-yielding C4 lines that would be used to form the C5 populations. The percentage of alleles that had expected frequencies between 0.2 and 0.8 was 2.7% for AP10, 4.2% for AP12, and 9.5% for AP14. The percentage of alleles with observed frequencies between 0.2 and 0.8 was 71.6% for AP10, 47.9% for AP12, and 57.1% for AP14. The increase in the percentage of alleles with frequencies near 0.5 suggested the heritability and genetic gain for yield may increase in future cycles of selection for yield in AP10, AP12, and AP14. The higher percentage of QTL alleles with intermediate frequencies in AP10 compared with AP14 indicates that AP10 might have an increased rate of genetic gain for yield over AP14 in future cycles of selection for yield.
| ACKNOWLEDGMENTS |
|---|
| NOTES |
|---|
|
|
|---|
Received for publication April 8, 2003.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
P. S. Guzman, B. W. Diers, D. J. Neece, S. K. St. Martin, A. R. LeRoy, C. R. Grau, T. J. Hughes, and R. L. Nelson QTL Associated with Yield in Three Backcross-Derived Populations of Soybean Crop Sci., January 22, 2007; 47(1): 111 - 122. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| The SCI Journals | Agronomy Journal | Vadose Zone Journal | |||
| Journal of Plant Registrations | Soil Science Society of America Journal | ||||
| Journal of Natural Resources and Life Sciences Education |
Journal of Environmental Quality |
||||