|
|
||||||||
a Dep. of Plant Breeding, 252 Emerson Hall, Cornell Univ., Ithaca, NY 14853-1901
b GeneFlow Inc., 503 Mt. Vernon Ave., Alexandria, VA 22301
c Dep. of Plant Breeding, 240 Emerson Hall, Cornell Univ., Ithaca, NY 14853-1901
* Corresponding author (SRM4{at}cornell.edu)
| ABSTRACT |
|---|
|
|
|---|
Abbreviations: bp, base pair cM, centimorgan RCF, relative centrifugal force, P.C.A., principle component analysis P.I.C., polymorphism information content PCR, polymerase chain reaction RFLP, restriction fragment length polymorphism RM, rice microsatellite SSR, simple sequence repeat WTR, well-to-read
| INTRODUCTION |
|---|
|
|
|---|
Use of fluorescence-based semiautomated analysis of restriction fragments was first reported by Carrano et al. (1989). This method was then adapted and improved upon for microsatellite analysis (Edwards et al., 1991; Ziegle et al., 1992). Semiautomated methods of SSR genotyping are gradually replacing manual systems in plant breeding and genetics research. These methods facilitate the efficient application of microsatellite markers for high-throughput mapping (Rhodes et al., 1998; Ponce et al., 1999), pedigree analysis (Lexer et al., 1999), fingerprinting of accessions (Carrano et al., 1989), and assaying genetic diversity (Diwan and Cregan, 1997; Macaulay et al., 2001). The technology can also improve the efficiency of managing a germplasm collection, help deliver purity-proven seed stocks to growers, and provide the basis of intellectual property protection (Mitchell et al., 1997).
In rice, Blair et al. (2002) recently reported the use of 27 fluorescent labeled SSR markers organized into four panels for diversity analysis of Oryza species. However, to date no comprehensive, semiautomated SSR genotyping system providing whole genome coverage has been reported for Oryza, despite the fact that approximately 500 SSR markers are now publicly available for rice (Temnykh et al., 2001; http://www.gramene.org; verified June 12, 2002).
The purpose of this project was to develop and apply multiplex panels of fluorescently labeled microsatellite markers for semiautomated genotyping of O. sativa at the whole genome level. Combinations of primer pairs were assembled to accommodate the analysis of genetically diverse cultivars, providing a robust and flexible approach for detection of intra- and interspecific variability. The design and application of the multiplex panels was greatly facilitated by the use of the GeneFlow computer program (http://www.geneflowinc.com/; verified June 12, 2002). The software allowed us the assembly of panels in a semiautomated manner and facilitated the analysis of the data on SSR polymorphism among accessions representing a wide spectrum of cultivated rice germplasm.
| MATERIALS AND METHODS |
|---|
|
|
|---|
After mixing, tubes were centrifuged at 13 000 rpm (RCF = 14926 g) for 10 min and the supernatant was removed with a pipette and placed into a new Eppendorf tube, where it was mixed with 2/3 times the volume of cold isopropanol. The tubes were inverted repeatedly to precipitate the DNA, followed by another centrifugation at 13 000 rpm (RCF = 14926 g) for 12 min to pellet the DNA. The supernatant was discarded, and the pellet was washed with 800 µL of cold 70% (v/v) ethanol, precipitated by centrifugation and dried. The pellet was resuspended in 50 µL of TE buffer in a 65°C water bath.
PCR Conditions
PCR was performed with a PTC-225 thermocycler (MJ Research Inc., Watertown, MA) as described by Chen et al. (1997) except that 13 µL of reaction mixture was used and 12 ng of template DNA, 0.3 units of polymerase and 1.5 to 3.0 pmol of each primer were added per reaction. Primers, labeled with hexachloro-6-carboxyfluorescein (HEX), tetrachloro-6-carboxyfluorescein (TET), or 6-carboxyfluorescein (FAM) dye phosphoramidites, were synthesized on an Applied Biosystem 392 (Applied Biosystem, Foster City, CA) by the Cornell BioResource Center. Unlabeled reverse primers were synthesized at Research Genetics (advertised as "RicePairs" at http://www.resgen.com; verified June 12, 2002). Markers were amplified individually or in groups of two because of the different PCR profiles required by markers in a panel. A table providing information about markers, panels, primer pairs, and annealing temperatures for PCR is available as a downloadable file from (http://ricelab.plbr.cornell.edu/publications/2002/coburn/; verified June 12, 2002).
Pooling of PCR products, T4 DNA Polymerase Treatment and Electrophoresis
PCR products were pooled by combining 2.0 µL of each amplified product and adding water if necessary to bring the products to a uniform dilution of 1:12. Equal amounts of each PCR product could be pooled because the PCR profiles of individual markers were adjusted so that product concentrations of each were similar.
In a separate experiment, markers that exhibited an average standard deviation higher than 0.5 bp were subjected to a post-PCR T4 DNA polymerase treatment, as recommended by Ginot et al. (1996). T4 DNA polymerase (0.5 U) was added to 24 µL of pooled PCR product. The treated PCR product pool was then incubated at 37°C for 30 min.
Before loading of gels, samples were prepared by combining 1 µL of the pooled PCR product with 1.75 µL deionized formamide, 0.5 µL loading dye, and 0.3 µL GENESCAN 500-TAMRA size standard (Perkin Elmer/Applied Biosystems). After denaturing at 85°C for 2 min, 0.8 µL of the sample was loaded into each lane. Electrophoresis was performed with 5% (w/v) Long Ranger (FMC Bioproducts, Rockland, ME), 6 M urea, 1x TBE 24-cm WTR gels on an ABI model 373A automatic sequencer (Perkin Elmer/Applied Biosystems). Gels were run at 30 W for a minimum of 3 h.
SSR Fragment Analysis
SSR fragment sizing was performed by the "Local Southern Method" and default analysis settings of the GeneScan version 3.1 (Perkin Elmer/Applied Biosystems). Size standard peaks were defined by the user. Allele calling was performed with Genotyper software, version 2.5 (Perkin Elmer/Applied Biosystems) and the two highest peaks were labeled using the size setting of 1/100th of a base. The GeneFlow software (http://www.geneflowinc.com/) was used for panel design allele binning and data analysis. Principle component analysis (P.C.A.) was performed by the NTSYS pc software package (Rohlf, 1998).
| RESULTS |
|---|
|
|
|---|
In each panel, fluorescent labels were assigned so that no two markers with the same dye label had overlapping allele size ranges (Fig. 1) . To ensure maximum usefulness of these panels with a broad range of germplasm, a 20-bp buffer zone was maintained separating the allele molecular weight ranges of SSR loci sharing the same fluorophore. This strategy was implemented to prevent overlap of alleles from untested rice cultivars that might fall outside the allele size range detected among the 13 test cultivars. Markers with allele sizes between 56 and 387 bp (as tested on the panel of 13 cultivars) were selected. Markers with allele sizes larger than 400 bp were eliminated for reasons of gel runtime and sizing accuracy on a 5% (w/v) polyacrylamide gel. The multiplex marker set included 84 dinucleotide, 36 trinucleotide, 6 tetranucleotide, and 33 complex repeats with a PCR product size range for individual marker loci varying from 3 to 128 bp. When evaluated on our standard panel of 13 cultivars, the set of 159 SSR markers detected from two to 10 alleles/locus, with an average of five alleles per marker, and had PIC values ranging from 0.26 to 0.89, with an average PIC value of 0.67.
|
Panel Testing and Marker Quality Assessment
Fluorescent detection of alleles at all 159 loci in the 13 standard genotypes was tested for reliability on a minimum of three separate gels, with PCR products resulting from at least two different reactions. After calculating the average allele sizes and standard deviation statistics, a comparison among di-, tri-, tetranucleotide and complex repeat markers was performed. The average std. dev. (0.23 bp for the dinucleotide repeat SSRs, 0.32 bp for the trinucleotides, 0.18 for the tetranucleotide, and 0.26 bp for the complex repeat markers) indicated that there was no significant difference in marker quality among the different kinds of motifs and that an occasional single base pair polymorphism would be distinguishable for most of the markers using the system developed in this study.
T4 DNA Polymerase Treatment
When multiple PCR amplifications using the same markers and the same genotypes were sized on an ABI373, allele calling was more consistent with some markers than others. To achieve a std. dev. of <0.5 bp for each allele size estimate, 22 microsatellite markers for which the std. dev. of some or all of the allele molecular weights was consistently >0.5 bp (for details, see http://ricelab.plbr.cornell.edu/publications/2002/coburn/) were subjected to a post-PCR T4 DNA polymerase treatment. This was aimed at reducing plus-A modification of the PCR products by Taq DNA polymerase (Ginot et al., 1996). In our experiments, T4 polymerase treatments were associated with a mean reduction of 0.28 bp in the average std. dev. of allele molecular weight for the problematic markers. This represented a 50% reduction of average std. dev. and strongly suggests that the variation associated with allele size estimates at many microsatellite loci was due to the plus-A modification of PCR products by Taq DNA polymerase. After the T4 polymerase treatment, an average std. dev. of 0.24 was achieved for the 159 markers used in the panels.
Diversity Analysis
The data derived from these experiments were analyzed to evaluate the usefulness of these microsatellite panels for various applications in rice genetics and breeding. First, we determined to what extent our estimates of genetic variation obtained for the standard set of 13 rice cultivars was predictive for a larger set of germplasm. By comparing data derived from a common subset of 22 fluorescently labeled markers for the 13 cultivars with data obtained for 256 diverse rice cultivars (Y. Xu, Cornell Univ., 1998, unpublished), we found that the larger set of genotypes showed expanded allele size ranges, but in the majority of cases the differences did not exceed the 20-bp buffer zone established for the automated multiplex array. For only three markers (RM11, RM38, and RM212) did the size range exceed this buffer zone. Nonetheless, an average of only 44% of the specific alleles at any of the loci observed in the 256 cultivars were detected by the smaller set of germplasm.
The average P.I.C. value estimated for the 22 SSR markers based on the 13 cultivars (0.73) was very similar to that of the much larger collection of 256 cultivars (0.72). For 12 of the markers the P.I.C. values were almost identical for the two data sets, while for the other 10 markers, the estimates of P.I.C. differed significantly (Fig. 2) . Interestingly, there was no correlation between number of alleles detected by a marker and the degree of deviation in P.I.C. values, which suggests that the observed difference in polymorphism information content is largely due to variation in allele frequency between the two data sets. A similar trend was observed when we compared our data with the results reported by Ji et al. (1998), who examined a set of accessions comprised of 51 tropical and temperate japonicas. In this case, the average P.I.C. values estimated for japonica cultivars (51 japonicas in the Ji et al. (1998) study and five japonicas in the present study) using 30 SSR markers shared by the two studies were 0.47 and 0.45. These results demonstrate that the estimates derived from the standard set of 13 cultivars can be used as the basis for developing multiplex arrays for automated genotyping, because they provide a reasonable estimate of the polymorphism potential in O. sativa.
|
|
The genetic relationships among the 13 cultivars were examined by means of all 159 SSR markers and the results are presented in the P.C.A. diagram in Fig. 4 . The indicas all group in close proximity to each other, while the japonicas are found in two distinct clusters; tropical japonicas Lemont and Azucena in one cluster, and temperate japonicas Nipponbare, Gihobyeo, and Jing-Xi 17 in the other. These results agree with classical genetic subspecies classifications (Morishima and Oka, 1981) and with previous reports based on isozymes (Second, 1986; Glaszmann, 1987), restriction fragment length polymorphisms (RFLP) (Wang and Tanksley, 1989), intersimple sequence repeats (ISSR) (Blair et al., 1999) and SSRs (Blair et al., 2002; Ni et al., 2002). When the Tongil cultivar Milyang 23 (derived from an indica/japonica cross), was placed on this diagram to determine its relationship with the subspecies groups, it was found to cluster with the indica accessions.
|
| DISCUSSION |
|---|
|
|
|---|
The optimization of PCR reactions to ensure consistent marker concentration improved the efficiency and accuracy of gel scoring because it minimized the number of false calls by the Genotyper software (Perkin Elmer/Applied Biosystems). False allele calls are often due to "pull-up" peaks that result from excessive product concentrations or weak peaks that get lost in the background signal. Scoring markers with short amplification products (<100 bp) was improved by reducing the amount of unincorporated primer due to better utilization of the smaller peaks of the internal size standard as they were less often obscured by the dye front.
Several methods exist for increasing the throughput of semiautomated genotyping. One method involves pooling primer pairs for multiplex PCR reactions (Henegariu et al., 1997). Using the same SSRs described above, we have experimented with a duplex PCR system where marker combinations were selected by determining an average melting temperature (Tm) for the forward and reverse primers of each SSR, and pooling those that have a similar average Tm. Trials in our lab have found this simple method to be about 80% successful in amplifying two SSR markers in a single PCR reaction. If the same PCR profile is optimal for some or all markers in a panel, the time required for setting up the reactions and for pooling the PCR products prior to gel loading can be significantly reduced when multiplex PCR is employed.
To increase the accuracy of allele calling, we addressed the problem of plus-A modification using the T4 DNA polymerase treatment. This approach was advantageous because it required neither a redesign of primers, nor a significant change of our established PCR protocol. However, other studies report modifications of reverse primer sequences (Brownstein et al., 1996) or PCR protocols (Smith et al., 1995). In some cases, redesign of a primer may be helpful in adjusting allele sizes to fit in a multiplex scheme better, and/or modification of PCR conditions may enhance the ability to multiplex during the PCR amplification phase. Therefore, the advantages of the various options for minimizing plus-A modification of PCR products should be evaluated in light of the system as a whole.
One of the important aspects in any large-scale genotyping experiment is reliability and informativeness of the resulting data. We have performed a detailed analysis of the microsatellite data collected in this study in the conjunction with unpublished data kindly provided by Dr. Y. Xu and data from Ji et al. (1998) to demonstrate that the multiplex panels of microsatellite markers we designed and evaluated on a standard set of 13 rice cultivars can be reliably applied to a much wider range of rice cultivars. The employment of the "Compare Genotypes" function of the GeneFlow software helped to identify markers easily that appear to have indica or japonica specific alleles. These may prove useful for the identification of subspecific introgressions in progeny derived from indica x japonica crosses. The information can also be used to decide which markers are likely to be most informative for evaluating diversity in different gene pools. For instance, for distinguishing cultivars within the same subspecific group, SSR markers can be selected which detect a higher level of polymorphism for a given subspecies. On the contrary, markers with conserved subspecific alleles may be more informative in assigning a cultivar to a certain phylogenetic group. In our study, three distinct clusters of rice cultivars revealed by the P.C.A. were in good agreement with previous genetic subspecies classifications. As a tool for whole-genome scans, the multiplex panels based on microsatellite markers can be efficiently used for whole-genome mapping, evaluation of genetic diversity and estimation of genetic relationships between O. sativa cultivars of different origin.
| ACKNOWLEDGMENTS |
|---|
| NOTES |
|---|
|
|
|---|
Received for publication June 25, 2001.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
B. C.Y Collard and D. J Mackill Marker-assisted selection: an approach for precision plant breeding in the twenty-first century Phil Trans R Soc B, February 12, 2008; 363(1491): 557 - 572. [Abstract] [Full Text] [PDF] |
||||
![]() |
H.-S. Ji, S.-H. Chu, W. Jiang, Y.-I. Cho, J.-H. Hahn, M.-Y. Eun, S. R. McCouch, and H.-J. Koh Characterization and Mapping of a Shattering Mutant in Rice That Corresponds to a Block of Domestication Genes Genetics, June 1, 2006; 173(2): 995 - 1005. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Semon, R. Nielsen, M. P. Jones, and S. R. McCouch The Population Structure of African Cultivated Rice Oryza glaberrima (Steud.): Evidence for Elevated Levels of Linkage Disequilibrium Caused by Admixture with O. sativa and Ecological Adaptation Genetics, March 1, 2005; 169(3): 1639 - 1647. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Garris, T. H. Tai, J. Coburn, S. Kresovich, and S. McCouch Genetic Structure and Diversity in Oryza sativa L. Genetics, March 1, 2005; 169(3): 1631 - 1638. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Lu, M. A. Redus, J. R. Coburn, J. N. Rutger, S. R. McCouch, and T. H. Tai Population Structure and Breeding Patterns of 145 U.S. Rice Cultivars Based on SSR Marker Analysis Crop Sci., January 1, 2005; 45(1): 66 - 76. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Xu, H. Beachell, and S. R. McCouch A Marker-Based Approach to Broadening the Genetic Base of Rice in the USA Crop Sci., November 1, 2004; 44(6): 1947 - 1959. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| The SCI Journals | Agronomy Journal | Vadose Zone Journal | |||
| Journal of Natural Resources and Life Sciences Education |
Soil Science Society of America Journal | ||||
| Journal of Plant Registrations | Journal of Environmental Quality |
The Plant Genome | |||