|
|
||||||||
a USDA-ARS Forage and Range Research Lab., Logan, UT 84322
b Raymond F. Baker Center for Plant Breeding, Iowa State Univ., Ames, IA 50011; E. Charles Brummer current address: Center for Applied Technologies, Univ. of Georgia, Athens, GA 30602
c USDA-ARS Soybean Genomics and Improvement Lab., Beltsville, MD 20705
d Dept. of Plant Breeding and Genetics, Cornell Univ., Ithaca, NY 14853
* Corresponding author (brummer{at}uga.edu)
| ABSTRACT |
|---|
|
|
|---|
Abbreviations: FWER, familywise error rate LG, linkage group QTL, quantitative trait locus RFLP, restriction fragment length polymorphism SSR, simple sequence repeat
| INTRODUCTION |
|---|
|
|
|---|
Medicago sativa produces highly nutritious biomass under a wide range of environmental conditions. Following removal of aboveground biomass, M. sativa regrows from crown and auxiliary buds, resulting in several flushes of biomass production each year. In addition, M. sativa plants growing in the temperate parts of the world have the ability to enter physiological dormancy in autumn, enabling them to survive winter and recommence biomass production the following spring. Thus, over a several-year period, biomass production in M. sativa will vary depending on both the environmental conditions and the developmental stage of a given plant. Unlike woody species, in which each new seasonal production of biomass extends the previous year's growth, herbaceous perennials like M. sativa have all aboveground biomass removed repeatedly so that each growth period replaces, rather than augments, the biomass produced previously. As trees age, some of the genomic regions associated with biomass production appear to change (Wu, 1998; Lerceteau et al., 2001), but the situation for alfalfa is unknown.
Extensive variation among alfalfa populations for biomass production is widely recognized by plant breeders, yet over the past 25 yr, applied alfalfa breeding efforts in the upper Midwest have not increased the biomass yield potential of commercially cultivated alfalfa substantially (Riday and Brummer, 2002). The improved alfalfa yield of newer cultivars relative to older ones appears to be attributable to increased pest resistance rather than to improved yield potential per se (Lamb et al., 2006). Two obvious methods to improve biomass production, neither of which has been extensively exploited in alfalfa, are (i) direct selection specifically for biomass production and (ii) the exploitation of heterosis through hybrid or semihybrid alfalfa cultivars (Brummer, 1999). Biomass yield heterosis has been identified in alfalfa (reviewed in Brummer, 1999; Riday and Brummer, 2002), but hybrid alfalfa cultivars have not been a focus of most breeding programs, with only a few hybrid cultivars commercially available (e.g., Hybriforce 400 from Dairyland Seeds, Inc., West Bend, WI).
The tetrasomic tetraploid genome of M. sativa, associated with an allogamous breeding system and a general intolerance of inbreeding, complicates genetic analyses. Although the theory of genetic linkage in autopolyploids is well developed (Haldane, 1930; DeWinton and Haldane, 1931; Mather, 1936), the difficulty of resolving allele dosage and linkage phases limits the information content that can be gathered from most current molecular markers. To avoid these complications, diploid relatives of cultivated polyploids have been mapped in several species, including alfalfa (e.g., Brummer et al., 1993; Kiss et al., 1993; Echt et al., 1994). While diploid mapping avoids the complexities of polysomic inheritance and works well if the synteny across ploidy levels is high, it might not be useful if the genetic control of a phenotype differs across ploidies. Evidence for differential genetic control across ploidies has been shown by gene expression profiling in yeast (Galitski et al., 1999) and by quantitative genetics in M. sativa (Groose et al., 1988). Several mapping studies have been conducted in tetraploid M. sativa (Brouwer and Osborn, 1999; Julier et al., 2003; Sledge et al., 2005), but none of these studies has investigated the underlying genetics of biomass production.
The objectives of this experiment were (1) to develop a genetic linkage map of tetraploid M. sativa, (2) to identify genomic regions in tetraploid M. sativa associated with aboveground biomass production across developmental stages of plants growing in multiple environments, and (3) to infer the loci underlying biomass production heterosis.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Genotyping and Genetic Map Construction
Genomic DNA was extracted from leaves using the method of Doyle and Doyle (1990). Extracted DNA was then used for RFLP and SSR analysis. RFLP analysis was performed using the procedures of Brummer et al. (1993) using the restriction enzymes EcoRI and HindIII. Separate sets of population blots were made for each enzyme. Probes came from the following sources: Drs. G. D. Kochert and J. H. Bouton (Botany and Crop and Soil Sciences Depts., Univ. of Georgia, Athens, GA); Dr. T. C. Osborn (Plant Breeding Dept., Univ. of Wisconsin, Madison, WI); Dr. J. J. Volenec (Agronomy Dept., Purdue Univ., W. Lafayette, IN); Dr. S. Arcioni (Istituto di Ricerche sul Miglioramento Genetico delle Piante Foraggere, Perugia, Italy); Dr. B. D. McKersie (Plant Agriculture Dept., Univ. of Guelph, Guelph, ON); and Dr. S. Laberge (Agriculture and Agri-Food Canada, Sainte Foy, QC). Primer sequences for SSR markers were acquired from several sources (Diwan et al., 2000; Thoquet et al., 2002; Julier et al., 2003; Eujayl et al., 2003) (Supplementary Table 1). Simple sequence repeat loci were identified based on the nomenclature used by Diwan et al. (2000), Thoquet et al. (2002), or the corresponding Genbank accession name. The polymerase chain reaction amplifications of SSR markers were based on the method of Diwan et al. (1997) and were detected using either a Licor 4200 DNA Analyzer or an ABI 3100 DNA Analyzer. Data were scored using the AFLP-Quantar (Keygene), Gene Scan, or Genotyper software (ABI). Each allele was scored for its presence or absence in each progeny genotype. Each allele was designated with a letter denoting the parent carrying it, with WISFAL-6 designated as "a," ABI408 as "b," and alleles present in both parents as "c." Multiple alleles produced by a single probe or primer were differentiated based on band size using numbers following the letter designations. A subset of loci that were scored on the ABI 3100 and exhibited distorted segregation was reanalyzed using the Licor 4200; the results were consistent between the two platforms.
The TetraploidMap program suite (Hackett and Luo, 2003) was used to infer the parental dosage (number of copies) of each allele and the most likely parental genotype, to identify possible double reductants, to cluster markers into linkage groups (LGs), and to calculate recombination frequencies and accompanying LOD scores (Supplementary Table 2). A consensus map representing both parental genomes, like that of Julier et al. (2003), was constructed by combining all marker data from both parents in the TetraploidMap analysis. Marker DNA fragments were considered to be allelic if the most likely genotype at a given locus contained the alleles being evaluated; otherwise, the fragments were considered to belong to duplicated loci. Using LOD values
3.0 and recombination frequencies
0.30, JoinMap 3.0 (Van Ooijen and Voorrips, 2001) was used to order the codominant marker data within LGs, and MapChart (Voorrips, 2002) was used to draw the resulting LGs. Linkage group numbering corresponds to that of M. truncatula (Thoquet et al., 2002; Choi et al., 2004).
The result of the overall analysis was eight consensus LGs representing the basic set of eight M. sativa chromosomes. Thus, each of these consensus LGs was based on marker information from the four homologous chromosomes from each parent. To visualize the linkage arrangements of markers on the individual homologs, each consensus LG was individually decomposed into the eight constituent cosegregation groups, or a total of 32 cosegregation groups for each parent (Supplementary Fig. 1). The cosegregation groups were determined by manually analyzing the recombination frequency output from the TetraploidMap program and identifying alleles linked in coupling phase. The cosegregation groups of each consensus LG were determined by systematically analyzing the linkage relationships of each marker with the other markers in the group and placing alleles linked in coupling phase in the same cosegregation group. The vast majority of alleles were placed into cosegregation groups based on coupling-phase linkages with an LOD
3.0. In a few instances, placement occurred with an LOD < 3.0 or was inferred from repulsion-phase linkages. This most often occurred with alleles that were present in multiple copies in one or both parents. These alleles typically had strong linkage support for one copy to be included in a cosegregation group but with less support for inclusion of the second (or third) copy of the allele into other cosegregation groups. The order of the markers in each cosegregation group was based on their placement on the consensus LG. Because the consensus LG included all the recombination information in the population, it offers a more realistic locus order than would be produced by remapping the individual cosegregation groups, which are more sparsely populated with markers.
Phenotyping
Experimental Design
Field experiments were planted at the Agronomy and Agricultural Engineering Research Farm west of Ames, IA, on 19 May 1998; at the Northeast Research Farm south of Nashua, IA, on 22 May 1998; and at the Snyder 5 east field adjacent to the Game Farm Road Weather Station in Ithaca, NY, on 79 June 1999. The plot design at Ames and Nashua was a quadruple
-lattice consisting of 840 total plots (each replication consisted of 15 incomplete blocks each with 14 plots) and at Ithaca was a randomized complete block design consisting of 4 blocks and 824 total plots. The difference in total plot numbers between IA and NY was due to the loss of six genotypes before transplanting and the inclusion of two additional check cultivars in NY. Plots in IA consisted of five clones of each genotype; in NY, they consisted of seven clones, but only the inner five clones were harvested. In IA, spacings were 30 cm between plants within a plot, 60 cm between plots in the same row, and 75 cm between rows. In NY, spacings were 25 cm between plants within a plot, 60 cm between plots within the same row, and 90 cm between rows. In addition, in NY the experiment was overseeded with red fescue on 9 Sept. 1999 to limit weed competition.
Phenotypic Data Collection
After the initial establishment year, biomass production was measured for 3 yr (19992001) in IA and for 2 yr (20002001) in NY. Data were collected on three harvests per year (June, July, and September) at each location with the exception of NY in 2000, when excessive rainfall resulted in no data collection in June.
Harvesting consisted of the removal of all aboveground biomass to
7.5 cm above the soil surface. Harvesting was conducted by hand, using rice sickles, during 1999 in IA and for both years in NY. A flail-type, self-propelled forage harvester (Carter Manufacturing Co., Inc., Brookston, IN) was used in IA during the 2000 and 2001 harvest years. The mass of wet forage from each plot was determined in the field. Random subsamples were taken from the harvested material in each replication, weighed, dried for 4 d at 60°C, and reweighed to compute a dry matter percentage. An average dry matter percentage across the subsamples was used to adjust plot wet mass to a dry matter basis. The number of plants present in each plot was counted after each harvest and plot dry matter production recorded as g plant1.
Phenotypic Data Analysis
Biomass from each of the three individual harvests per year was summed to produce the yearly biomass production for each plot. To keep this article to manageable length, we only consider total yearly biomass production. The complete model was analyzed with genotype, location, and their interaction being fixed effects, and replications and incomplete blocks being random effects. Data from IA were analyzed to determine the effect of both locations and genotype x environment interactions. Least-square means on a per entry basis for yearly totals were calculated for the combined data from the two IA locations and separately for NY using the MIXED procedure (SAS statistical software, Cary, NC; Littell et al., 1996). Data from IA and NY were not combined because the experiment in NY began 1 yr after the IA experiment, and the first harvest from NY (in 2000) was lost due to excessive rainfall. Due to the perennial nature of alfalfa and to heterogeneous error estimates between years, growth in each year was treated as a separate and distinct trait, which allowed more thorough investigation of changes in the genetic basis of biomass production from year to year as the plants aged.
Variance components and heritabilities with their standard errors were computed from an all random effects model (Holland et al., 2003) with the parents and check genotypes removed. The consideration of genotype as a random effect, as opposed to the previous fixed effect assumption, was necessary to calculate the variance associated with this effect. The genetic variation includes all higher-order intralocus and epistatic interactions present in a tetrasomic tetraploid (Rumbaugh et al., 1988). Because the phenotypic data came from a single full sib population with no ability to partition the additive genetic variance from the dominance genetic variance, broad-sense heritability estimates were generated (Holland et al., 2003). Phenotypic and genetic correlations were calculated using the MIXED and IML procedures (Holland et al., 2006), which also produced the corresponding standard errors. Correlations between years were calculated for IA and NY separately. All discussions of statistical significance were based on the 5% probability level, unless otherwise noted.
MarkerPhenotype Associations (QTL Analysis)
Markerphenotype associations were calculated using single-marker analysis of variance with the GLM procedure of SAS. The least-square means of individuals containing an allele were contrasted to those of individuals not containing the allele. Because this is the first study examining potential QTLs for biomass production in alfalfa, we were more concerned about identifying putative genomic regions associated with the trait than about false positive associations. For this reason, we set the cutoff value for declaring an association between a marker allele and a phenotype at
= 0.01. Although this level will lead to some spurious associations, those markers identified in more than one environment would be unlikely to arise by chance. To identify associations based on a familywise error rate (FWER), we used nonparametric permutation tests (Churchill and Doerge, 1994) based on the method of Westfall and Young (1993) to determine the FWER for the data from each year.
Multiple regression was used to develop a model that best explained the underlying phenotypic variation using the REG procedure of SAS with the stepwise selection option. All alleles identified as having an association with the trait of interest were initially included in the model, and only those alleles that remained significant at
= 0.05 were retained.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
|
We identified 25 putatively duplicated loci (18 SSR; 7 RFLP). With six exceptions, these loci were duplicated on the same LG. Some of the duplications may have resulted from inaccurate designations among alleles.
Biomass Production
Location and genotype-by-location interaction (GxL) variances were computed from the IA data (Table 1). The location effect was only present in 2000, and although GxL variation was present in 1999 and 2000, it was typically an order of magnitude smaller than the variance associated with genotype. Subsequent analyses of biomass production and QTL identification were based on data combined across the two IA locations and separately for NY. Therefore, five locationyear combinations (termed "environments" in the subsequent discussion) were identified (IA99, IA00, IA01, NY00, and NY01).
|
Biomass production ranged widely within the population. Transgressive segregation in both directions was present at all locations and years (Table 2) and was consistently associated with the same genotypes across years and within locations (data not shown). The parents did not differ for biomass production in any location or year (Table 2). The mean performance of the F1 population was not higher than either the high-parent (ABI408 had a numerically higher yield in most environments) or the mid-parent value (Table 2). The ability to discriminate among genotypes declined across years as the error associated with biomass determination escalated due to plant mortality and, in IA, mechanical harvesting. The increase in error variance is not uncommon for aging forage trials.
|
0.60 for both years; the correlation between IA00 and NY00 was
0.6 and between IA01 and NY01
0.4 (all with p values < 0.0001). The yield data were normally distributed with no obvious outlier points that could have unduly affected the correlations (data not shown).
|
Forty-one alleles were associated with biomass production in at least one environment (Table 4). Seven of these alleles were identified in more than one year or location, providing further support that they are actually linked to QTLs and not associated with biomass yield due to random chance. In addition, a stringent permutation test based on a FWER set to control false positive results at the 0.05 probability level also identified several marker alleles associated with biomass production (Table 4). Alleles that were associated with biomass production on a given LG were often linked (e.g., LG 7). Different alleles at the same locus were associated with biomass in different environments, as in the case of bn2_21e3v14, in which one allele had a positive effect in IA99 but another allele a negative effect in IA01. Further, in some cases, the same locus had alleles associated with biomass yield in each parent (e.g., MsaciB); this would appear to be unlikely to have arisen simply due to chance. The gene MsaciB is known to be related to winter hardiness (Monroy et al., 1993), and its association with biomass production is interesting and may suggest that biomass yield is affected by winter injury. Nevertheless, some of the alleles we identified may be falsely associated with biomass production.
|
One method to identify QTL by environment interactions is to examine trends of markerphenotype associations across environments (Lynch and Walsh, 1998). Of the seven alleles associated with biomass yield in more than one year, only two, aw7750621c1 and aw686836c1, were identified in more than two environments; none was present in all environments (Table 4). The remaining 34 alleles exhibited an association with biomass production in only one environment. Although some of these may be false positive associations, interaction of alleles with years is also a probable explanation, particularly because several of these alleles were also identified by the conservative permutation test (Table 4) and by multiple regression analyses (Table 5). The changing environmental conditions and developmental trajectory of the plants from year to year would likely impact the genetic control of biomass production. The subset of alleles identified in more than one environment indicates that at least some genomic regions are important for biomass production across various environmental conditions, in agreement with similar studies in tree species (Wu, 1998; Lerceteau et al., 2001). Possibly due to the change in harvest management in IA and/or to aging of plants at both locations, our ability to detect allelic associations with biomass QTLs declined over years, and this may be a reason that some alleles identified in one year were not detected otherwise.
|
Application of Results to Crop Breeding
The results presented here show that the M. sativa subsp. falcata genotype WISFAL-6 contains QTL alleles that increase biomass production; we can surmise that other M. sativa subsp. falcata germplasm, including that which does not produce biomass yield heterosis in interspecific progeny (Riday and Brummer, 2005), may also harbor beneficial alleles. In this experiment, LG 7 contained several alleles associated with biomass production in most of the environments. A previous genetic map constructed from an F2 diploid alfalfa population (Brummer et al., 1993) expressed very high levels of segregation distortion on LG 7 (which corresponds to group 4 on that map), with progeny ratios skewed toward excess heterozygotes, reaching 90% heterozygous progeny for some loci. The fact that LG 7 has a striking effect on fitness (biomass productivity) in a tetraploid population further supports the hypothesis that this region may be involved with heterosis for biomass productivity. Our result suggests a connection with heterosis in that positive alleles from both germplasm sources are present in the highest-yielding hybrid progeny. While the identification of heterosis in a single full sib population is confounded with yield per se, this result bears further investigation through studies in other populations.
A move toward marker-assisted selection in M. sativa faces several obstacles. Little linkage disequilibrium may be present in modern alfalfa breeding populations that have had the potential for substantial amounts of recombination during repeated rounds of recurrent selection (Williams, 1998; Kidwell et al., 1999). However, the identification of several significant markerphenotype associations in this experiment is heartening, and the important associations on LG 5 and 7 are obvious targets for fine mapping. Our analysis of the genetic determinants of biomass production in tetraploid alfalfa offers the first steps toward the use of molecular markers in an alfalfa recurrent selection program. However, to be useful in a selection program, associations would require improved localization and the identification of a tight linkage between marker and trait or, ideally, the identification of a functional marker that defines the beneficial allele (Andersen and Lübberstedt, 2003). Another concern is that the amount of linkage disequilibrium in an F1 population makes isolating the QTLs to a small interval difficult. Further exploration of biomass QTLs in other populations and environments is warranted to substantiate these results.
Perspectives
This study is the first of its kind and is necessarily exploratory. Although it presents evidence that QTLs associated with biomass production can be identified, even in complex tetraploid populations, the experiment has some limitations. First, we used clonally propagated, spaced plants. This obviously differs from a commercial field, both in mode of establishment and in intraplant competition. Thus, further research done with progeny families grown in swards or semiswards would be very useful. Second, we are only evaluating two parental genotypes, a restricted germplasm base not commonly used in alfalfa breeding. Moving to population-based methods of mapping, such as association analysis (e.g., Skøt et al., 2005), would help alleviate some of that concern. Additionally, mapping in populations with more recombination, and using a higher density of markers, particularly ones for which allele dosages can be obtained, would make the localization of QTLs more tractable. Finally, we are conducting a more detailed look at harvest-by-harvest biomass production, and further work on that area, coupled with analyses to separate the effects of plant age from environment, should be undertaken.
| ACKNOWLEDGMENTS |
|---|
Received for publication November 3, 2005.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
J. J. Ariss and G. J. Vandemark Assessment of Genetic Diversity among Nondormant and Semidormant Alfalfa Populations Using Sequence-Related Amplified Polymorphisms Crop Sci., November 7, 2007; 47(6): 2274 - 2284. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. G. Robins, G. R. Bauchan, and E. C. Brummer Genetic Mapping Forage Yield, Plant Height, and Regrowth at Multiple Harvests in Tetraploid Alfalfa (Medicago sativa L.) Crop Sci., January 22, 2007; 47(1): 11 - 18. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| The SCI Journals | Agronomy Journal | Vadose Zone Journal | |||
| Journal of Natural Resources and Life Sciences Education |
Soil Science Society of America Journal | ||||
| Journal of Plant Registrations | Journal of Environmental Quality |
The Plant Genome | |||