|
|
||||||||
a Dep. of Plant and Animal Sci., Brigham Young University, Dep. of Plant & Animal Sciences, Provo, UT 84602
b Dep. of Biology, Stephen F. Austin State Univ., Nacogdoches, TX 75962
* Corresponding author (Jeff_Maughan{at}byu.edu).
| ABSTRACT |
|---|
|
|
|---|
Abbreviations: AFLP, amplified fragment length polymorphism BAC, bacterial artificial chromosome BES, BAC-end sequence H, heterozygosity MAX, longest tandem repeat excluding half-repeats ONA, observed number of alleles PCR, polymerase chain reaction SSR, simple sequence repeat
| INTRODUCTION |
|---|
|
|
|---|
Amaranth grain is 50 to 60% starch, with higher fiber (8%) and more fat (7–8%) than the grain of most cereals (Pedersen et al., 1987; Breene, 1991). The grain amaranths are noted for their relatively high protein content and balance of essential dietary amino acids. Crude protein content from pale-seeded grain types is substantially higher than most cereal grains and has been reported to range from 12.5 to 22.5% on a dry matter basis, with an average of about 15% (Bressani, 1989; Breene, 1991). Lysine is often the limiting amino acid in other cereal grains; however, amaranth seed protein is rich in this essential amino acid, ranging from 0.73 to 0.84% of the total seed protein content (Bressani et al., 1987). Amaranth oil is also of high nutritional quality, containing a relatively high content of squalene (7–8%; Bressani et al., 1987), and is thought to be effective in reducing cholesterol levels in humans (Berger et al., 2003; Martirosyan et al., 2007). The grain amaranths have also been noted for their ability to thrive under extreme abiotic stress (Brenner et al., 2000). They are impressive producers of biomass under warm and dry conditions, an attribute likely related to their C4 photosynthesis (Kadereit et al., 2003). Thus, several researchers have suggested that amaranth may be a useful alternative crop in developing nations, especially in overpopulated and undernourished areas (Pal and Khoshoo, 1974; Sauer, 1993).
The evolutionary origin of the grain amaranths is still under debate, although two hypotheses have been proposed by Sauer (1967, 1976). The first hypothesis is based on geography and suggests that all three grain amaranths evolved independently, while the second hypothesis is based on morphological features and proposes that all three grains are descended mainly from A. hybridus. Molecular studies, including analyses with isozymes (Chan and Sun, 1997), random amplified polymorphic DNAs (Transue et al., 1994; Chan and Sun, 1997), and amplified fragment length polymorphisms (AFLPs) (Xu and Sun, 2001), have attempted to clarify the relationships among the grain amaranths and their relatives. While these studies support Sauer's second hypothesis of a monophyletic evolution of each of the three grain amaranth species from A. hybridus, they have highlighted the need for new methods with greater resolving power to clarify taxonomic relationships within the A. hybridus complex.
Microsatellites are short repeated nucleotide motifs usually one to four base pairs in length that are flanked by conserved sequences and occur ubiquitously throughout eukaryotic genomes (Tautz and Renz, 1984). They are widely considered the genetic marker system of choice because they are highly reproducible, informative, locus-specific, multiallelic, and codominant (Morgante and Olivieri, 1993; He et al., 2002). Microsatellites have been extremely useful in determining taxonomic relationships among closely related individuals and assessing diversity within a species (Ni et al., 2002; Fukunaga et al., 2005; Ellwood et al., 2006). Sun et al. (1999) noted that among probes designed from various types of repetitive sequences, a probe consisting of microsatellite and minisatellite sequence showed the highest polymorphism across the grain amaranths and their close relatives, suggesting that microsatellites may be extremely valuable for characterizing inter- and intraspecific relationships within the A. hybridus complex. While the initial cost of developing microsatellites markers is high, once developed these polymerase chain reaction (PCR)–based markers are inexpensive to use and require less technical expertise relative to other types of molecular markers. Thus, the goals of this project were (i) to develop a collection of reproducible microsatellite markers for the grain amaranths, (ii) to assess the informativeness of these microsatellite markers by screening them against a panel of grain amaranth accessions, and (iii) to use the markers to characterize the relationships of the grain amaranths and their putative ancestors. Moreover, we show the Mendelian inheritance of 37 of these microsatellites in a segregating F2 population.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
Microsatellite Enriched Library Construction
Four libraries, enriched for microsatellites consisting of AC, AG, AAC, or AAT motifs, were produced by Genomic Identification Services (Chatsworth, CA) using genomic DNA from A. hypochondriacus cv. Plainsman. Genomic DNA was partially digested with a mixture of seven blunt-end restriction endonucleases (RsaI, HaeIII, BsrB1, PvuII, StuI, ScaI, and EcoRV). Size-separated DNA fragments ranging from 300 to 750 bp were ligated with adapters and enriched for each specific microsatellite motifs using biotinylated capture molecules (CPG, Lincoln Park, NJ). The captured fragments were then amplified and digested with HindIII to remove the adaptors and clone the fragments into pUC19. The resulting plasmids were subsequently transformed into competent Escherichia coli DH5
cells by electroporation.
Microsatellite Identification and Classification
Enriched libraries were plated on S-gal (Sigma, St. Louis, MO) agar media supplemented with 100 µg mL–1 of ampicillin for blue/white selection of recombinant clones. A total of 1457 recombinant clones were sent to the Arizona Genomics Institute (Tucson, AZ) for plasmid DNA isolation and bidirectional sequencing using M13 primers (forward: 5'-GTA AAA CGA CGG CCA GT; reverse: 5'-CAG GAA ACA GCT ATG AC) and standard ABI Prism Taq dye terminator cycle sequencing methodology. The computer program Contig Express (InforMax, Frederick, MD) was used to determine consensus sequences, eliminate redundant clones, and identify microsatellites. Microsatellites were classified as either simple or compound and perfect or imperfect using the classification system given by Weber (1990) with modifications described by Mason et al. (2005).
Primer Design
Primers flanking each unique microsatellite were designed using the Web-based computer program Primer3 version 2.0 (Rozen and Skaletsky, 2000) according to the program's default parameters, with the following exceptions: preferred product size range equal to 150 to 200 bp; melting temperature differences in forward and reverse primers of no more than 1°C; and max poly-X (maximum allowable length of a mononucleotide repeat) of 3. Oligonucleotide primers were synthesized by Integrated DNA Technologies (Iowa City, IA). Primer pairs were assigned names based on their repeat motif (e.g., AHAAT035, where AH = A. hypochondriacus, AAT = motif type, 035 = clone ID).
Bacterial Artificial Chromosome- Derived Microsatellites
Bacterial artificial chromosome (BAC)–end sequence (BES) microsatellites were identified using the Web-based computer program Tandem Repeats Finder (Benson, 1999) and 728 amaranth BAC-end sequences. The sequences were obtained from clones of an A. hypochondriacus (cv. Plainsman) BAC library developed by Maughan et al. (unpublished data). Only sequences with total repeat lengths greater than 20 bp (n = 10 for dinucleotides, n = 7 for trinucleotides, etc.) were selected for primer design using the program Primer3 version 2.0 (Rozen and Skaletsky, 2000) as previously described.
Microsatellite Amplification
Amplification of microsatellite loci was performed in 10 µL PCR reactions using HotStarTaq Master Mix (Qiagen, Germantown, MD) and 30 ng genomic DNA according to the manufacturer's recommendation. The thermocycling profile was 95°C for 15 min followed by 31 cycles of 94°C for 60 s, 56°C for 30 s, 72°C for 60 s, and a final extension at 72°C for 10 min. Polymerase chain reaction products were separated on 3% Metaphor agarose gels (Cambrex Bio Science, East Rutherford, NJ), run in 0.5X TBE at 120 V for 5 h and visualized using ethidium bromide staining with ultraviolet transillumination. Microsatellite alleles using this protocol were effectively resolved with a resolution of at least four base pairs, as evidenced by the molecular ladders run on each gel.
Data Analysis
The number of alleles and the informativeness for each microsatellite locus was determined by calculating heterozygosity (H). For a multiallele system, heterozygosity values can be estimated using the following equation:
![]() |
Phylogenetic analysis of marker data was performed using the distance (neighbor-joining) method and the full heuristic search option (TBR branch swapping, random addition with 10 replications) of the computer program PAUP*4.0 (beta version 4.0b10; Swofford, 2002). Robustness of the topology of the cladogram was evaluated by bootstrap analysis (10,000 replicates) of the data set. Only groups with frequencies > 50% were retained.
Marker segregation was analyzed for conformation to Mendelian ratios expected in an F2 population using a chi-squared test, with two and one degrees of freedom for codominant and dominant markers, respectively. Linkage groups were constructed with a minimum LOD score of 3.0 using the default mapping parameters (LOD > 1.0, recombination threshold = 0.4, ripple value = 1, jump threshold = 5, Kosambi mapping function) of the computer program JoinMap, version 3.0 (Van Ooijen and Voorrips, 2001).
Statistical Models
Statistical analysis of factors contributing to the polymorphism of individual microsatellite markers, measured as the observed number of alleles (ONA) per locus, was performed using stepwise forward selection using the computer program NCSS97 (Hintze, 1997). Microsatellites were classified as described by Mason et al. (2005) according to (i) complexity (simple or compound), (ii) type (perfect or imperfect), (iii) motif (AAC, AAT, AC, other), (iv) total complete repeats, (v) longest uninterrupted stretch of tandem repeat (excluding partial repeats) (MAX), (vi) total length of repeat including nonrepeat bases, (vii) nonrepeat and half-repeat bases, (viii) size of motif (e.g., dinucleotide, trinucleotide), (ix) number of terminal repeats, (x) number of microsatellites amplified per primer pair, and (xi) expected PCR product size. The numerical classifiers were measured in base pairs. Motif, complexity, and type were determined on the basis of the repeat with the largest MAX. Motifs observed less than five times were grouped together in the category "other."
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
Marker Characterization in the Grain Amaranths
Of the 382 microsatellite loci identified, we successfully designed flanking primer pairs for 319, including 157, 66, 76, and 20, corresponding to the AAT, AAC, AC, and BES libraries, respectively. Flanking primers could not be designed for the remaining 63 microsatellite sequences because of Tm constraints and/or the lack of flanking sequence. All primer pairs were initially screened on an exploratory panel of eight diverse amaranth lines (Table 1). Primer pairs that produced monomorphic banding patterns or failed to amplify on this panel were eliminated from further consideration. A total of 179 microsatellites produced strong amplification products that showed simple, polymorphic banding patterns among the grain amaranth accessions in the exploratory panel. All 179 microsatellite sequences have been submitted to GenBank under accession numbers EU094479 to EU094654. Among these were 97, 30, 39, and 13 markers from the AAT, AAC, AC, and BES libraries, respectively. Interestingly, 19 of the polymorphic microsatellites identified in this study amplified two distinct polymorphic bands that appear to represent two independent loci. The amplification of duplicate loci from a single microsatellite marker has been reported for several polyploid plant species (Röder et al., 1998; Han et al., 2004), where it was suggested that each locus represented orthologous loci derived from the independent ancestral genomes of the polyploid. Such loci in the amaranths, once confirmed via segregation analysis, should prove to be valuable tools in elucidating the paleo-polyploidy event that led to the evolution of allotetraploid amaranths (Pal et al., 1982; Greizerstein and Poggio, 1994, 1995).
To characterize the informativeness of these markers, we screened all 179 markers on a larger and more diverse panel of 35 grain and wild Amaranthus accessions (Table 1; Fig. 1
). Marker informativeness was quantified by calculating the ONA amplified per marker and by calculating the H value associated with each marker (Supplementary Table 1). ONA and H values were calculated for each grain species separately, for the grain species combined, and for the A. hybridus accessions alone (Table 2
). Limiting the data set to the three grain species (n = 27), a total of 731 alleles were observed with an average of 4 alleles per locus and a range of 2 to 8 alleles observed per locus. Using the H value calculated for the grain species and the thresholds given by Ott (1992), where a marker is considered polymorphic if H
0.1 and highly polymorphic if H
0.7, all 179 markers were considered polymorphic, with 59 (33%) of the microsatellite loci being highly polymorphic (H
0.7). Heterozygosity values ranged from 0.14 to 0.83, with an average H value of 0.62 per locus, and are similar to those obtained from microsatellite development studies in cultivated relatives of amaranth, including sugar beet (0.61; Rae et al., 2000) and quinoa (0.57; Mason et al., 2005).
|
|
Statistically Important Factors Affecting Microsatellite Polymorphism
To evaluate the factors that influence the informativeness of a potential marker in amaranth, we used a stepwise forward selection model and found that the factors motif (AAC, AAT, AC, etc.) and MAX (base pair length of the longest uninterrupted tandem repeat) were the most significant predictors (P < 0.01) of marker polymorphism, measured as ONA. The model explained 32% of the variation of ONA, and a t test analysis showed that AAT repeats have a significantly higher (P < 0.0001) ONA than other types of repeats, especially when the tandem repeat length is greater than 20 bp. These observations correlate well with those observed by others, including Moriguchi et al. (2003), who observed that microsatellites with high tandem repeat numbers have higher polymorphism (ONA), and Mason et al. (2005), who reported that a definite change in the percentage of polymorphic versus monomorphic markers occurs when the tandem repeat length is greater than 20 bp. The high rate of polymorphism for the AAT motif compared with the other repeat types has also been observed in other plant species. In wheat (Triticum aestivum L.), for example, Song et al. (2002, 2005) reported that among trinucleotide repeats, the AAT motif is the most polymorphic. These observations and the relative abundance of the AAT motif observed in the amaranth genome (see above) suggest that future development of microsatellite markers for amaranth species with high polymorphic content should focus on AAT repeats with a tandem repeat greater than 20 bp.
Genetic Diversity within Weedy Amaranthus Species
Amaranthus hybridus, a putative wild progenitor species of the grain amaranths, showed the most genetic diversity of all the species included in the complete screening panel. One-hundred and sixty microsatellite markers were polymorphic, and 472 total alleles were observed. That >99% of the microsatellite markers (developed from A. hypochondriacus) amplified in A. hybridus is notable because it confirms the close ancestry between the grain amaranths and A. hybridus at the DNA level. The higher genetic diversity observed among the A. hybridus accessions is consistent with an expectation that a wild progenitor species should be more diverse than a derived domesticated species due to genetic drift and selection (Hilu, 1995).
In addition to being the putative progenitor of the grain amaranths, A. hybridus (smooth pigweed) along with several other members of the Amaranthus genus, including A. retroflexus (redroot pigweed) and A. powellii (Powell amaranth), are particularly notorious weeds (Wassom and Tranel, 2005). Various studies have already demonstrated the utility of molecular markers for correcting taxonomic misclassifications among the weedy species within the Amaranthus genus (Wetzel et al., 1999; Wassom and Tranel, 2005); however, taxonomic problems still exist, especially for closely related species, and highly polymorphic markers are needed to resolve these taxonomic questions. Such markers would also be beneficial in intraspecies population studies and for establishing the first genetic maps in these species. To determine the transferability and utility of these microsatellite markers to related weedy species, we evaluated the level of amplification for the 179 polymorphic microsatellite markers in three additional weedy species (A. hybridus, A. powellii, and A. retroflexus). As previously noted, 177 (>99%) of the markers amplified in the A. hybridus accessions, while 158 (88%) and 141 (78%) of the microsatellite markers amplified in the A. powellii and A. retroflexus accessions, respectively. Between the two A. powellii accessions included in the large screening panel, 97 (52%) markers were polymorphic (Supplemental Table 1). The high transferability observed in this study demonstrates the utility of these markers as new molecular tools for use across the Amaranthus genus.
Evolutionary Origins of the Grain Amaranth Species
Neighbor-joining analysis reveals that A. caudatus, A. cruentus, and A. hypochondriacus are monophyletic, while A. hybridus is polyphyletic (Fig. 2
). Sauer (1967, 1976) proposed two hypotheses for the evolutionary origins of the grain amaranths. The first hypothesis is based on geography and suggests that all three grain amaranths evolved independently: A. caudatus from A. quitensis in the Andean region of South America; A. cruentus from A. hybridus in Central America; and A. hypochondriacus from A. powellii in Mexico. The second hypothesis is based on morphological features and proposes that A. hybridus gave rise to A. cruentus, that introgression of A. cruentus and A. powellii gave rise to A. hypochondriacus, and that introgression of A. cruentus and A. quitensis produced A. caudatus.
|
We also note that in the neighbor-joining tree, accessions of A. caudatus and A. cruentus split into monophyletic subclades corresponding to New and Old World accessions, whereas New World A. hypochondriacus are paraphyletic with respect to Old World accessions. The paraphyletic result is not unexpected considering the biogeography of the grain amaranths, which are all native to the New World and were spread to Asia, Europe, and Africa during post-Colonial American times (Sauer, 1967). With increased sampling, we expect that New World accessions will be paraphyletic and that Old World accessions will constitute a subset of the genetic diversity found in New World populations.
A larger investigation with a wider sampling of Amaranthus species, including numerous accessions of A. hybridus, needs to be conducted to further evaluate our proposed alternative hypothesis of the origins of the grain amaranths, as well as the relationship between New and Old World accessions of the grain species. The transferability and highly polymorphic nature of microsatellite markers across the genus make them ideal for such an investigation.
Mendelian Inheritance of Microsatellite Markers
To evaluate the utility of these markers for future linkage map construction, we investigated the inheritance of the microsatellite loci in a segregating F2 cross (PI 482049 x PI 477914). In total, 40 (22%) loci were polymorphic between these parents and screened on 92 individuals of the F2 population. Thirty-four (85%) of the markers were scored codominantly (1:2:1), while the remaining six (15%) were scored in a dominant fashion (3:1). Three (AHAAT143, AHAAT144, and AHAC008) of the loci deviated significantly (P < 0.01) from their expected Mendelian pattern of inheritance based on chi-squared analysis. All three distorted markers had a significantly higher than expected frequency of the paternal allele (P < 0.0001) and may reflect marker loci linked to genes affecting gametic or zygotic viability (Xu et al., 1997). Linkage analysis, performed using the program JoinMap (Van Ooijen and Voorrips, 2001), identified nine linkage groups consisting of 21 linked microsatellite loci spanning 108 cM (data not shown). While most linkage groups consisted of only two linked markers, two groups with three and four linked markers were observed. The largest linkage group spanned 29 cM. While this is only an exploratory linkage analysis, it highlights two important observations regarding the grain amaranths: (i) the disomic marker inheritance patterns observed herein suggest that if these species are of polyploid origin, they are most likely allotetraploids and are thus amenable to linkage map construction and marker assisted breeding; and (ii) the level of intraspecies polymorphism is limiting (e.g., in this cross, only 22% of the markers were polymorphic), suggesting that either additional markers will be needed to develop saturated intraspecies maps or interspecies populations will be needed to augment the level of polymorphism within a single cross. One such interspecific mapping population currently being constructed at the University of Illinois (Urbana) by Pat Tranel (personal communication, 2007) is between Plainsman (A. hypochondriacus) and 21605-16 (A. hybridus). Preliminary analysis of our data set shows that nearly a threefold increase in the total number of markers (119 microsatellite markers) should be segregating in this interspecies population compared with the intraspecific cross described above.
| CONCLUSIONS |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
| NOTES |
|---|
|
|
|---|
Received for publication August 16, 2007.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
P. J. Maughan, S. M. Yourstone, E. N. Jellen, and J. A. Udall SNP Discovery via Genomic Reduction, Barcoding, and 454-Pyrosequencing in Amaranth The Plant Genome, November 1, 2009; 2(3): 260 - 270. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| The SCI Journals | Agronomy Journal | Vadose Zone Journal | |||
| Journal of Natural Resources and Life Sciences Education |
Soil Science Society of America Journal | ||||
| Journal of Plant Registrations | Journal of Environmental Quality |
The Plant Genome | |||