Crop Science Journal of Natural Resources and Life Sciences Education
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Published online 1 January 2005
Published in Crop Sci 45:66-76 (2005)
© 2005 Crop Science Society of America
677 S. Segoe Rd., Madison, WI 53711 USA
This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF) Free
Right arrow Supplemental Table and Figures
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Related articles in Crop Science
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (21)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Lu, H.
Right arrow Articles by Tai, T. H.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Lu, H.
Right arrow Articles by Tai, T. H.
Agricola
Right arrow Articles by Lu, H.
Right arrow Articles by Tai, T. H.
Related Collections
Right arrow Crop Genetics
Right arrow Rice
Right arrow Plant Genetic Resources
Published in Crop Sci. 45:66-76 (2005).
© 2005 Crop Science Society of America
677 S. Segoe Rd., Madison, WI 53711 USA

CROP BREEDING, GENETICS & CYTOLOGY

Population Structure and Breeding Patterns of 145 U.S. Rice Cultivars Based on SSR Marker Analysis

Hong Lua, Marc A. Redusc, Jason R. Coburnb, J. Neil Rutgerc, Susan R. McCouchb and Thomas H. Taid,*

a Pioneer Hi-Bred International, A DuPont Company, 7300 NW 62nd Ave., Johnston, IA 50131-1004, USA
b Dep. of Plant Breeding, Cornell Univ., Ithaca, NY 14853, USA
c USDA-ARS Dale Bumpers National Rice Research Center, Stuttgart, AR 72160, USA
d USDA-ARS Crops Pathology and Genetics Research Unit, Dep. of Agronomy and Range Science, Univ. of California, Davis, CA 95616, USA

* Corresponding author (thtai{at}ucdavis.edu).


    ABSTRACT
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 REFERENCES
 
This study was undertaken to investigate the population structure of U.S. rice (Oryza sativa L.). A total of 115 U.S. rice cultivars and 30 ancestral accessions introduced from Asia were genotyped by means of 169 simple sequence repeat (SSR) markers that are well distributed throughout the rice genome. SSR-based clustering analysis identified three groups of U.S. rice cultivars that were recognizable as temperate japonica with short to medium grains, tropical japonica with medium grains, and tropical japonica with long grains. Indica cultivars were represented among ancestral accessions, but always clustered independently. Indica germplasm has been used for cultivar improvement, but never directly in U.S. rice production. Cluster analysis of cultivars based on four time periods representing their first release date or introduction (1900–1929, 1930–1959, 1960–1979, and 1980–2000) resulted in the identification of the same three groups. This suggests that the population structure in U.S. rice was established before 1930 and remains essentially intact today, despite a large amount of controlled crossing and artificial selection as a part of the breeding process. Fifty-seven percent of U.S. rice cultivars were developed from intragroup crosses, indicating the availability of substantial genetic variability within each group. Similar results were obtained using genetic distance-based and model-based clustering methods. Information about population structure and associated phenotypic characteristics recognized by geneticists and breeders paves the way for coordinated association mapping studies in the future.


    INTRODUCTION
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 REFERENCES
 
THE DYNAMICS OF genetic diversity and population structure throughout the history of breeding plants for agricultural purposes has been a focus of attention for many crops. Plant breeders use such information to help them better understand their germplasm, guide their breeding plans, and better exploit genetic variation that is available to them. The short breeding history of U.S. rice (about 90 yr) makes it possible to examine fully the changes in genetic diversity at different periods of time and to examine shifts in population structure over the course of the breeding process. Dilday (1990) examined the pedigrees of 140 rice accessions representing U.S. cultivars and the ancestral introductions from which they were developed. He found that the pedigrees of cultivars adapted to the southern rice belt (Arkansas, Louisiana, Mississippi, Missouri, and Texas) could be traced back to 22 introductions, and the pedigrees of the western rice belt (California) could be traced back to 23 introductions. Seven parental introductions contributed to cultivar development in both regions. From this information, it was concluded that current U.S. rice cultivars are closely related. However, pedigree information is not an accurate predictor of ancestral contributions to an artificially bred line (Culp, 1998). Molecular markers facilitate a more accurate assessment of genetic structure at both individual and population levels.

Molecular markers, such as SSRs, have been widely used in rice germplasm evaluation for both international (Yang et al., 1994; McCouch et al., 1997; Ishii and McCouch, 2000; Ishii et al., 2001) and domestic U.S. (Mackill, 1995; Cao and Oard, 1997; Ni et al., 2002) collections. The use of SSRs to interpret population structure provides much greater resolution than other types of markers because of the high level of polymorphism at SSR loci (Cho et al., 2000; Akkaya et al., 1992). Previous generations of molecular markers were unable to detect enough genetic polymorphism among closely related rice cultivars such as those used in U.S. breeding programs to make them efficient tools for interpreting population structure (Mackill, 1995). However, SSR markers are well suited to the task. In rice, the highly polymorphic nature of SSR motifs is coupled with a low level of homoplasy observed in O. sativa cultivars (Chen et al., 2002), providing an appropriate tool for population genetic studies.

With the public availability of rice genome sequence information, there is growing interest in identifying and characterizing genes associated with both qualitative and quantitative forms of phenotypic variation. Of particular interest to rice breeders is the possibility of using existing germplasm resources for gene and allele discovery on the basis of association mapping strategies (Kruglyak, 1999; Jorde, 2000; Farnir et al., 2000). Understanding population structure is important to avoid identifying spurious associations between phenotype and genotype in association mapping (Pritchard and Rosenberg, 1999; Pritchard et al., 2000; Pritchard and Donnelly, 2001).

This is the most comprehensive study to date assessing population structure in U.S. rice, taking advantage of the ease and reproducibility of SSR allele calling with a high throughput capillary based system (Rhodes et al., 1998; Ponce et al., 1999; Coburn et al., 2002). The 145 accessions evaluated here represent the majority of rice cultivars that have been released in the USA during the 20th century. The specific objectives of this study were (i) to analyze population structure in U.S. rice using both genetic distance-based and model-based clustering methods, (ii) to determine whether the population structure can be attributed to modern U.S. rice breeding efforts (1930–present) or whether it predates this history, and (iii) to explore U.S. rice breeding patterns, if any, by examining the genetic relationships of rice cultivars developed at different time periods in the 20th century.


    MATERIALS AND METHODS
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 REFERENCES
 
Rice Accessions and DNA Extractions
The 145 cultivars analyzed in this study consisted of 115 cultivars that were bred in the USA and used in production during the 20th century and 30 accessions collected outside the USA that were used as parents in cultivar improvement (Table 1). This collection included more than 90% of U.S. rice cultivars registered in Crop Science between 1965 and 2000. Seeds were obtained from several sources as noted in Table 1. Leaf tissue was harvested from 15 to 30 seedlings after 3 to 4 wk of growth in the greenhouse and stored frozen at –80°C. DNA extractions were performed as described by Tai and Tanksley (1990) except frozen tissues were ground with a mortar and pestle before extraction.


View this table:
[in this window]
[in a new window]
 
Table 1. U.S. rice cultivars and collected accessions listed according to the time of their first release or introduction to the USA.

 
Phenotypic Information
Grain type information (long, medium, and short) of most accessions was retrieved from the GRIN database (http://www.ars-grin.gov/npgs; verified 1 September 2004) and Mackill and McKenzie (2003). Plant height data were obtained from registrations in Crop Science, which are listed in the GRIN database (http://www.ars-grin.gov/cgi-bin/npgs/html/csr.pl?RICE; verified 1 September 2004).

SSR Markers
A total of 169 previously developed SSR markers (Table 2) were used for genotyping (Akagi et al., 1996; Chen et al., 1997; Temnykh et al., 2000, 2001; McCouch et al., 2002). Markers were distributed along the 12 chromosomes with an average distance between markers of approximately 9 cM and an average of 14 markers per chromosome. Primer sequences for these markers can be found on the Gramene website (www.gramene.org; verified 1 September 2004).


View this table:
[in this window]
[in a new window]
 
Table 2. List of 169 SSR markers in positional order on each rice chromosome.{dagger}

 
Marker Amplification and Allele Detection
Polymerase chain reaction (PCR) amplification of markers was performed in 10- or 15-µL reaction volumes consisting of 25 to 50 ng genomic DNA, 10 mM Tris-HCl pH 8.3, 50 mM KCl, 2.5 mM MgCl2, 300 to 400 nM of each primer, 1 unit Taq DNA polymerase. For each marker, forward primers labeled with fluorescent dyes (6-FAM, VIC, HEX, and NED) were purchased from Applied Biosystems1 (Foster City, CA). Amplifications were performed with MJ Research Tetrad Thermal Cyclers (Waltham, MA) and the following PCR conditions: (i) initial denaturation at 94°C for 5 min; (ii) 25 to 35 cycles of 94°C for 1 min, 55°C to 67°C (marker dependent) for 1 min, 72°C for 2 min; (iii) 5 min final extension at 72°C; (iv) 4°C hold. Amplified products were pooled where possible (typically three markers per run along with ROX-labeled size standard) and run on an ABI Prism 3700 DNA Analyzer according to manufacturer's instructions (Applied Biosystems, Foster City, CA). SSR fragment sizing was performed with GenScan 3.1.2 software (Applied Biosystems, Foster City, CA) using the "Local Southern Method" and default analysis settings. Alleles were called with Genotyper 2.5 (Applied Biosystems, Foster City, CA), and binned manually.

Statistical Analysis
Genetic distance and cluster analyses were conducted using the PowerMarker program (http://www.powermarker.net; verified 1 September 2004). Nei's genetic distance (1972) was used to calculate pair-wise genetic distance among all accessions. The UPGMA method was used to conduct cluster analysis. The TreeView program distributed at http://taxonomy.zoology.gla.ac.uk/rod/rod.html (verified 1 September 2004) was used to construct clustering trees. Model-based cluster analysis was performed by the Structure program (Pritchard et al., 2000), which detects population structure in structured or admixed populations. The number of subpopulations (K) was set from 2 to 8, and each was run three times. Each run started with 10000 burn-ins followed by 50000 iterations. When K was set at 5, a run with the highest log likelihood was achieved and was used to produce model-based population structure. The polymorphism information content (PIC) for each marker was calculated (Anderson et al., 1993):

where pli is the frequency of the ith allele at locus l with n alleles. Polymorphic loci were defined as those whose most frequent allele had a frequency of less than 0.95.


    RESULTS AND DISCUSSION
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 REFERENCES
 
Overview of Genetic Diversity Present in U.S. Rice Germplasm
Among the 145 accessions, 136 are classified as japonica. Analysis of these japonica cultivars with the 169 markers resulted in the detection of 870 alleles, an average of 5.15 alleles per locus. Rare alleles (defined as a frequency < 0.05 in the total sample) accounted for 50%, while abundant (frequency > 0.30) and intermediate (0.05 < frequency < 0.30) alleles constituted 24 and 26%, respectively, of the 870 alleles. Inclusion of eight indica and K65, a cultivar collected from Surinam, resulted in the detection of a total of 1111 alleles with an average of 6.57 alleles per locus. Among these alleles, 59% (652) were classified as rare, 18% (198) abundant, and 23% (261) intermediate. A total of 159 (93%) SSR loci were polymorphic in the total sample. The most polymorphic loci (PIC > 0.80), distributed on chromosomes 1, 2, 4, 5, 6, 7, 10, and 11, showed no tendency to cluster in specific locations in the genome. There were 11 monomorphic loci (RM17, RM85, RM185, RM188, RM208, RM245, RM296, RM416, RM507, RM512, and OSR13) located on chromosomes 2, 3, 4, 5, 9, and 12. These loci were not located in the regions identified by Temnykh et al. (2000) as associated with low levels of diversity in a sample of 13 O. sativa cultivars. The number of alleles per locus ranged from two (at 13 loci) to 21 (at RM303 on chromosome 4), and PIC values ranged from 0.028 (RM512 on chromosome 12) to 0.881 (RM164 on chromosome 5) with an average of 0.463, which is also the average gene diversity of the total sample.

The average observed heterogeneity of the total sample across all 169 loci was 3.1%, which was as expected. However, five loci (RM44, RM144, RM221, RM341, and RM1189) across the total sample and three accessions (Delitus, Colusa, and IR659-10-8-3) across the total loci had observed heterogeneity ≥ 10%. Notably, IR659-10-8-3 (T018) had as high as 46% of observed heterogeneity, indicating that this sample was not stable or purified yet.

Groupings of USA-Developed and Important Introduced Cultivars at Different Time Periods
To explore patterns in U.S. rice breeding history, we classified the 145 rice cultivars into four groups according to the time period they were first introduced to the USA or released for production (Table 1). During the first period (early 1900s–1929), 18 cultivars were imported from Asia and crosses were initiated from these introductions. Five of these cultivars were brought directly from the Philippines, China, and Madagascar, two were from unknown sources, and the remaining 11 were selections from heterogeneous parental accessions, most of which were collected in Asia. During the second period (1930–1959), extensive crosses were made among the first generation of cultivars and an additional 31 cultivars were developed or introduced. The third period (1960–1979) was marked by the introduction of semidwarf germplasm. Although this material was widely used in rice breeding programs, few of the 44 cultivars released during this period were semidwarf. Many of the 52 cultivars released during the fourth time period (1980–2000) time were semidwarf or of short stature.

The 18 cultivars imported or released during the first period (1900–1929, T1) were classified into three groups (Fig. 1) . The first group (T1G1) consisted of one cultivar, Early Wataribune (EYWB), from California and five cultivars from China and the Philippines. They all shared the characteristic of short grains. Colusa (COLU) was selected from the cultivar Chinese (CHNA). It should be noted that CHNA used in this study and cited in Dilday (1990) may or may not be the cultivar from which COLU was developed as the reported origin of the CHNA cultivar from which COLU was selected is Italy (Johnston, 1958), while the CHNA used in our study is from China. Nevertheless, the results of our analysis suggest that the CHNA used in this study is indeed closely related to COLU. Caloro (CALO) was selected from EYWB. This group formed the foundation for rice breeding in California, which is the only region in the USA that grows temperate japonica (TMJ). The second group (T1G2) consisted of eight accessions, three of which can be traced back to Blue Rose (BROS) or its improved versions and they were grouped closely together. Edith (EDTH) was clustered closely with Honduras (HNDS), from which it was selected. Delitus (DLTS) is joined with this group at a greater genetic distance. T1G2 was selected by breeders in the southern states, mainly Louisiana, and formed the foundation for U.S. medium-grain tropical japonica (TRJ-M). The third group (T1G3) consisted of four accessions, all of which have long grains and three of which were selected by breeders in Louisiana. Nira (NIRA) is loosely joined with this group at a greater genetic distance. This group formed the foundation for U.S. long-grain tropical japonica (TRJ-L), which is the major type of rice in the southern U.S. rice belt. On the basis of SSR analysis, the three groups (TMJ, TRJ-M, and TRJ-L) were already differentiated, genetically, by 1929, which suggests that they were derived from existing subpopulations in Asia. During the first three decades of the 20th century, U.S. breeders did not use or develop any indica germplasm.



View larger version (18K):
[in this window]
[in a new window]
 
Fig. 1. Groupings of 18 U.S. rice cultivars collected or selected before 1930 based on 169 SSR markers using UPGMA method.

 
To determine the relationship among ancestral and newly developed cultivars, the first generation of 18 cultivars was included in groupings of later cultivars. The 31 cultivars developed in the second period (T2) were classified into four groups (S-Fig. 1; available online), which represented extensions of the three groups identified by the first generation of 18 cultivars and an additional group (T2G4) corresponding to the indica subspecies. The first group was composed of three collected cultivars from Japan (Koshihikari, KOSH), Taiwan (Tainan Iku 487, T487), and Bangladesh (Latisail, LTSL), and three cultivars developed by California breeders (Calady, CLDY; Calady 40, CA40; and Caloro, CALO) along with the six cultivars in T1G1. They were short or medium grained TMJ adapted to the California ecosystem. The second group (T2G2) contained three newly developed cultivars by Arkansas and Louisiana as well as all T1G2 accessions with the exception of DLTS, which was clustered to T2G3. Most of the accessions in this group were of the medium grain-type. The third group (T2G3) consisted of 23 accessions, which were all TRJ-L and were closely associated with accessions from T1G3. The rapid expansion of the third group reflected the prevalence of TRJ-L in U.S. rice production since the 1930s. The fourth group included four indica that were introduced from Bangladesh, Taiwan, and Japan. The indica group was highly differentiated from the previous three groups of japonica, reflecting the ancient distinction between japonica and indica subspecies.

The 44 cultivars in the third period were classified into four groups (S-Fig. 2; available online). Group 1 (T3G1) consisted of 11 cultivars, of which California contributed 80%. All were of the short to medium grain-type. Group 2 (T3G2) had 12 medium grain cultivars developed by southern states, primarily by Arkansas. However, DLTS was loosely joined to this group at a greater genetic distance. Group 3 (T3G3) consisted of 17 new long grain (TRJ-L) cultivars developed by southern states. NIRA stands alone in the intermediate position between TMJ and TRJ. Group 4 (T3G4) contained two collected indica and L110, a Louisiana cultivar selected from TN1/H4, where H4 was introduced from Sri Lanka as PI275451 (http://www.ars-grin.gov/cgi-bin/npgs/html/acchtml.pl?1206433; verified 1 September 2004). Cultivar K65, a collection from Surinam, stands alone between the indica and japonica, it was thus called an interspecies group.

The fourth time period (1980–2000) contained 52 new cultivars, which were clustered into four groups (S-Fig. 3; available online). Group 1 (T4G1) included 10 short to medium grain cultivars developed in California and representative of U.S. TMJ for that time period. The second group (T4G2) contained seven medium grain cultivars, all developed by southern states and belonging to TRJ-M. Group 3 (T4G3) had 34 new long grain cultivars. This group contained U.S. TRJ-L and California long grain japonica. Group 4 (T4G4) included only one indica, TeQing (TQNG), which was collected from China and has been used in recent years for introgression of its high yielding alleles. This group is substantially distinct from the other japonica groups.

Cluster analysis clearly shows that released U.S. rice cultivars can be classified into three groups: the TMJ group, short season, cold tolerant temperate japonica, mostly developed and used in California with short or medium grains; the TRJ-M group, which has medium grain length; and the TRJ-L group, which has long grain length and remains the predominant type in the southern USA. Indica germplasm has been utilized as donors of desirable genes such as those conferring semi-dwarf stature, disease resistance, and high yield to improve U.S. japonica cultivars but has never been used directly in production. The genetic structure of temperate and tropical japonica is clearly distinguished throughout the history of U.S. rice breeding in the 20th century. This difference is paralleled by the clear distinction between long and short-to-medium grain cultivars. U.S. temperate japonica are consistently associated with short to medium grains and U.S. tropical japonica are divided into two well-defined subgroups: one associated with medium grains and the other with long grains. Indica cultivars are always substantially different from japonica cultivars.

A number of papers regarding the classifications of U.S. rice cultivars have been published. For example, Cao and Oard (1997) analyzed 26 U.S. elite rice cultivars and lines with 69 RAPD (random amplified polymorphic DNA) primers. They found that cultivars with the same maturity group or grain type were generally placed together in RAPD-based cluster analysis. The present SSR-based groupings are consistent with maturity group and grain type as well. Ni et al. (2002) evaluated the genetic diversity of 38 diverse rice cultivars (O. sativa) and two wild species accessions (O. rufipogon Griffith and O. nivara Sharma et Shastry) using 111 SSR markers. They classified the japonica accessions into two groups: tropical japonica and temperate japonica. While these studies provided general information about U.S. rice cultivar classifications, the relatively small number of U.S. accessions and/or small number of markers included limit their resolution and power. In this study, examination of 169 SSR markers revealed that clear and consistent population structure exists in the 145 rice cultivars used in or developed by U.S. breeding programs.

Relationship between Genetic Distance-Based Groupings and Pedigrees
There were several subgroups (Fig. 2) within each of the three groups of U.S. japonica cultivars. Cultivars in the same subgroup usually shared a high proportion of ancestry and/or agronomic characteristics such as maturity and disease resistance. Close scrutiny of each subgroup provides useful information for cultivar development. TMJ consisted of at least two major subgroups. Sub1-1 included nine accessions, eight of which were collected or selected from Asian cultivars, with only one (Nortai, NTAI) developed in the USA. In general, they are not actively used in today's U.S. rice breeding programs, so this subgroup is referred to as "Old Cultivars." Sub1-2 had 24 cultivars, all of which were derived from CALO. Within this subgroup, CS-M3 (CSM3) is widely used as a parent in recent and contemporary breeding programs, but its pedigree can be traced back to CALO. This subgroup is therefore referred to as "Caloro & CS-M3" subgroup in Fig. 2. Cultivars in Sub1-2 shared at least 77% ancestry with TMJ (S-Table 1) and all had early season maturity. Sub3-4 and 3-5 all contain the semidwarf gene, sd1, or have very short stature, and are high yielding and early maturing. Other subgroups also were identifiable with their most common-shared ancestor, such as Sub2-2 (Lacrosse), Sub3-1 (Starbonnet), and Sub3-2 (Fortuna) (Fig. 2). All cultivars except the Old Cultivars subgroup in TRJ-M shared BROS in their pedigrees; therefore, BROS was the most important ancestor for TRJ-M group. All subgroups in TRJ-L shared Rexoro (RXOR) in their pedigrees with different percentages, and most of them also shared Fortuna (FRTA) in their pedigrees. Therefore, RXOR and FRTA were the two primary contributors to TRJ-L as reported by Dilday (1990) and Mackill and McKenzie (2003). However, more recently Starbonnet (STBN), Bluebelle (BBLE), and the long grain California cultivars L201 and L202 have been used repeatedly as parents, underwriting the subgroup identity of this group.



View larger version (45K):
[in this window]
[in a new window]
 
Fig. 2. Groupings of 145 U.S. rice cultivars based on 169 SSR markers using genetic distance-based (UPGMA) and model-based (Structure) methods.

 
Old cultivars are replaced periodically with new releases that have the same genetic architecture and fall into the same subgroups as the ones they are intended to replace. For example, cultivars in Sub2-3 are all short season, high-yielding and largely blast-resistant, and though they were developed at different times, they are closely related by pedigree. Mars (MARS), released in 1977 (Johnston et al., 1979), replaced Saturn (STRN, released in 1965; Jodon, 1965), and was selected from STG-6102515/STRN. Mercury (MERC), released in 1987 (McKenzie et al., 1988), was developed from Short MARS/NATO. Orion (Brazos/MARS) and Bengal (MARS//M201/MARS) were released in 1992 and 1993 (Moldenhauer et al., 1992; Linscombe et al., 1993), respectively, to replace MARS. This pattern was shared by many other subgroups, where the replacement cultivar was usually derived from the cultivar per se or a close relative.

This study included five sets of full sibs: Cypress-Jodon (L202/Lemont), Lemont-Gulfmont (Lebonnet//CIor 9881/IR659-10-8-3), Maybelle-Jackson (Skybonnet/L201), Nova-STG533187 (Lacrosse//Zenith/Nira), and Bluebonnet 50-Sunbonnet (selections from Bluebonnet). On the basis of their pedigrees, each pair was expected to group together; however, this was not always the case. Marker-based clustering trees provided insight into the true genetic makeup of each accession. Divergence of a pair of full sibs may reflect strong divergent selection by breeders. For example, Cypress (CPRS) clustered with the L202 subgroup, as might be expected, but Jodon (JODN) shared the BBLE subgroup with Lemont (LMNT) (Fig. 2). This could be attributed to different selection criteria imposed by Louisiana breeders that exploited inherent genetic differences between L202 and LMNT. The Structure program revealed that LMNT and L202 shared 98 and 84% of TRJ-L ancestry, respectively (S-Table 1), so they were possibly divergent for the other 18% of their ancestry. In other cases, such as Jackson (JKSN) and Maybelle (MBLE), full-sibs were closely clustered together, reflecting the high similarity between their parents (L201 and SKBT, which both shared approximately 99% ancestry of TRJ-L) (S-Table 1).

While the SSR-based classifications were consistent with cultivar pedigrees in our study, some discrepancies were observed. In some cases, reasons for the discrepancies have been proposed, as with the full-sibs as discussed earlier. However, accession T018 was classified into the indica group instead of the tropical japonica group as expected (Fig. 2). T018 was developed at IRRI, but its pedigree reveals that it was selected from backcross progenies of BBLE/TN-1 backcrossed five times to BBLE. As a result, it was expected to have approximate 99% of its genome from BBLE, a tropical japonica long grain cultivar (TRJ-L). However, the Structure program revealed that T018 had only 22% ancestry from japonica and 62% from indica, and therefore it was classified as an indica (S-Table 1). While backcrossing followed by intensive selection can shift allele frequencies from the expected percentage based on pedigree predictions, this is an extreme example and might be related to its high percentage (46%) of observed heterogeneity.

Breeding Patterns in U.S. Rice Germplasm Development
This study contained a total of 115 rice cultivars that were developed in the USA during the period 1930 to 2000. Among the 115 cultivars, 24 (21%) were TMJ, 22 (19%) were TRJ-M, and 69 (60%) were TRJ-L (Table 3). An additional 30 cultivars were collected from other countries and brought to the USA as a source of germplasm enhancement, or directly selected from those collections.


View this table:
[in this window]
[in a new window]
 
Table 3. U.S. rice breeding patterns reflected by the groupings of rice cultivars developed in the USA during 1930–2000.

 
The breeding history of U.S. cultivars revealed that the majority (63%) of TMJ were selected from intragroup crosses, 13 and 8% of cultivars were developed from crosses between 1 x 2 and 1 x 4, respectively (Table 3). There were no TMJ cultivars selected from crosses between 1 x 3. Group 3 was TRJ-L; therefore, we can infer that the U.S. TMJ have no or little TRJ-L in their pedigrees. Approximately one third (36%) of TRJ-M were developed from intragroup crosses, and almost half (45%) were developed from groups 1 x 2 and 1 x 3 (Table 3). However, no indica germplasm (group 4) was used for the development of U.S. TRJ-M. While 61% of TRJ-L group was selected from intragroup crosses, all possible combinations were represented in the remaining 39% of Group 3 cultivars (Table 3).

Interestingly, while group 2 (TRJ-M) combines the medium grain characteristic of TMJ in group 1 and many of the tropical japonica characteristics of group 3, cultivars from group 2 were most frequently used as parents for the improvement of other groups and correspondingly, group 2 cultivars were also derived from a higher proportion of crosses with other groups. Therefore TRJ-M may be regarded as providing a genetic bridge between TMJ and TRJ-L.

Comparisons between Genetic Distance-Based and Model-Based Groupings
While genetic distance-based (GD-based) clustering is powerful, easy to use, and has been widely reported in the literature, the problem with this approach is that the number of groups identified is based on an arbitrary cutoff that depends on the user's judgment, with no standard way of evaluating the statistical significance of the grouping result. Model-based methods, such as that reported by Pritchard and colleagues (Pritchard et al., 2000; Pritchard and Donnelly, 2001), use a Bayesian clustering approach in which each group or population is characterized by a set of allele frequencies at each locus along with the likelihood for each K (number of groups). This enables users to choose the number of groups with the highest log likelihood. Model-based methods have been widely used to identify population structure for association mapping in human genetics (Pritchard and Rosenberg, 1999; Pritchard and Donnelly, 2001; Pritchard and Przeworski, 2001; Rosenberg et al., 2002) and, to a lesser extent, plant genetics (Remington et al., 2001; Thornsberry et al., 2001). In this study, we used both genetic distance-based and model-based methods to assess population structure.

Groups resulting from the two methods were consistent for 139 (96%) cultivars (Fig. 2), indicating a good consensus between the two methods. In Structure analysis, comparative runs showed that the highest log likelihood was achieved when K was set at 5. Therefore, both methods classified the 145 rice cultivars into five groups as summarized in the previous section. K65, a genetically distinct cultivar, clustered independently from either indica or tropical japonica, indicating that this accession was different from both the indica and japonica subspecies or the admixed pedigrees of the two subspecies. This result agreed with GD-based analysis (Fig. 2). NIRA, an early selection from Louisiana, had 34, 26, and 40% of ancestry from TRJ-L, TRJ-M, and K65, respectively (S-Table 1), as estimated by Structure. Thus, it appeared most similar to K65, but clearly showed ancestry with both long and medium grain tropical japonica groups. The other three accessions (WC-6, Badkalamakati, and LTSL) grouped together with K65 by Structure were also old introduced cultivars and showed conflicting classifications between the GD- and model-based methods. DLTS, an accession that showed conflicting classifications, has 47% of TRJ-M, 39% of K65, and 14% of TRJ-L, and it was classified as either TRJ-M (Time 1 and 3 in Fig. 1 and S-Fig. 2) or TRJ-L (Time 2 in S-Fig. 1). Therefore, accessions sharing a high percentage (>39%) of ancestry with K65 cannot be reliably classified into either TMJ or TRJ groups, though they may share ancestry with one or both.

On the basis of our results, we conclude that both GD-based and model-based grouping methods worked equitably well; however, for a data set with clear pedigree relationships like the one in this study, the GD-based method generated results more consistent with pedigree information than the model-based method. Unlike the GD-based method, the model-based method provided each entry's percentage of ancestry, which is valuable for breeders. However, the model-based method does not provide information about subgroups, as is provided by the GD-based method.

Understanding the population structure of U.S. rice germplasm is a prerequisite for future studies aimed at association mapping where regions of the genome can be associated with phenotypes of interest. The ability to use genetic resources familiar to the rice breeding community as the foundation for establishing phenotypic-genotypic associations offers exciting opportunities for gene discovery but will only be efficient if population structure is taken into account (Flint-Garcia et al., 2003).


    ACKNOWLEDGMENTS
 
This manuscript is dedicated to M.A. Redus. Thanks to R. Gibbons, V. Johnson, and G. Miller for technical assistance in DNA preparation and marker genotyping. Seeds of some of the accessions were kindly provided by H. Bockelman, R. Dilday, J. Gibbons, F. Lee, and K. Moldenhauer. We thank K. McKenzie and K. Moldenhauer for critical reading of this manuscript. This work was supported in part by United States Department of Agriculture National Research Initiative Grant 00-35300-9216 to T.H.T., S.R.M, and J.N.R.


    NOTES
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 REFERENCES
 
This work was completed while Hong Lu was a post doc in the Dep. of Plant Breeding at Cornell University.

1 Mention of trade names or commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the U.S. Department of Agriculture. Back

Received for publication December 18, 2003.


    REFERENCES
 TOP
 NOTES
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS AND DISCUSSION
 REFERENCES
 


Related articles in Crop Science:

THIS ISSUE IN CROP SCIENCE

Crop Science 2005 45: xi. [Full Text]  



This article has been cited by other articles:


Home page
Crop Sci.Home page
S. Chao, W. Zhang, J. Dubcovsky, and M. Sorrells
Evaluation of Genetic Diversity and Genome-wide Linkage Disequilibrium among U.S. Wheat (Triticum aestivum L.) Germplasm Representing Different Market Classes
Crop Sci., May 31, 2007; 47(3): 1018 - 1030.
[Abstract] [Full Text] [PDF]


Home page
Crop Sci.Home page
L. E. Giarrocco, M. A. Marassi, and G. L. Salerno
Assessment of the Genetic Diversity in Argentine Rice Cultivars with SSR Markers
Crop Sci., March 1, 2007; 47(2): 853 - 858.
[Abstract] [Full Text] [PDF]


Home page
Crop Sci.Home page
G. C. Eizenga, H. A. Agrama, F. N. Lee, W. Yan, and Y. Jia
Identifying Novel Resistance Genes in Newly Introduced Blast Resistant Rice Germplasm
Crop Sci., July 25, 2006; 46(5): 1870 - 1878.
[Abstract] [Full Text] [PDF]


Home page
Crop Sci.Home page
F. Breseghello and M. E. Sorrells
Association Analysis as a Strategy for Improvement of Quantitative Traits in Plants
Crop Sci., April 25, 2006; 46(3): 1323 - 1330.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
F. Breseghello and M. E. Sorrells
Association Mapping of Kernel Size and Milling Quality in Wheat (Triticum aestivum L.) Cultivars
Genetics, February 1, 2006; 172(2): 1165 - 1177.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF) Free
Right arrow Supplemental Table and Figures
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Related articles in Crop Science
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (21)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Lu, H.
Right arrow Articles by Tai, T. H.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Lu, H.
Right arrow Articles by Tai, T. H.
Agricola
Right arrow Articles by Lu, H.
Right arrow Articles by Tai, T. H.
Related Collections
Right arrow Crop Genetics
Right arrow Rice
Right arrow Plant Genetic Resources


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
The SCI Journals Agronomy Journal Vadose Zone Journal
Journal of Plant Registrations Soil Science Society of America Journal
Journal of Natural Resources
and Life Sciences Education
Journal of
Environmental Quality