|
|
||||||||
a Marker-assisted Rice Breeding Research Team, National Institute of Crop Science, Tsukuba 305-8518, Japan
b Dep. of Biotechnology, Kyoto Sangyo Univ., Kyoto 603-8555, Japan
* Corresponding author (yonezaw{at}cc.kyoto-su.ac.jp).
| ABSTRACT |
|---|
|
|
|---|
Abbreviations: ANP, the average number of plants tested per generation of selection BF, BL, and NB, the schedules of crossing in which the backcross with a recipient line is performed first (before crossing between the donors), last (after the crossing), and not performed, respectively CRO, the total number of crossings performed in a crossing schedule ETN, the expected total number of plants tested in a crossing schedule GEN, the number of generations used for a crossing schedule S, T, and ST, the crossing schedules with symmetric, tandem, and mixed structures, respectively SEL, the total number of generations of selection performed in a crossing schedule.
| INTRODUCTION |
|---|
|
|
|---|
The marker-based gene accumulation will be rather straightforward when two or three donor lines are involved; procedures practicable with a minimum resource expense could be easily found in such a case. When accumulating many genes from four or more donor lines, however, some guideline principles are needed to find the best procedures. To our knowledge, procedures for marker-based gene accumulation from multiple donor lines have been little discussed, although studies have been made in detail for the marker-assisted backcross strategy for gene introgression (Tanksley, 1983; Hospital et al., 1992; Tanksley and Nelson, 1996; Frisch, 2004) as well as the marker-assisted selection strategies in which the detection of markers and selection assisted by the detected markers are performed for biparentally initiated populations in outcrossing crops (Lande and Thompson, 1990; Whittaker et al., 1995; Moreau et al., 1998; Hospital et al., 2000; Liu et al., 2003) and self-fertilizing crops (van Berloo and Stam, 1998; Charmet et al., 1999; Liu et al., 2004). With more and more markers of useful trait genes becoming available every year, it will be important to establish strategies for utilizing these markers to produce high-degree, gene-pyramided lines.
Procedures in a marker-based gene accumulation program proceed in two steps. First, all target markers in the donor lines are assembled into the genome of a single plant in a heterozygous state, and second, a plant that has all the markers in a homozygous state is selected from among the progeny of the heterozygous plant produced in the first step. Different parameters determine the efficiency of the two steps; the schedule (pattern and order) of crossing between donor lines is important in the first step, and the scheme of selection, in the second step. The purpose of this paper is to propose some guidelines for optimizing the procedures of the first step.
| DEFINITIONS OF CROSSING SCHEDULES AND EFFICIENCY INDICATORS |
|---|
|
|
|---|
4) donor inbred lines in a monoecious diploid plant species. Donor line Mi (i = 1
k) has mi (
1) target markers (Fig. 1
), each of the markers being perfectly linked with a desirable gene for a qualitative or quantitative trait. The recipient inbred line V may also carry some markers to be accumulated which, however, are irrelevant for comparing the relative efficiencies of different crossing schedules and are ignored in the present discussion. The target markers are assumed to be codominant or dominant, being independently inherited. Selection in any generation is performed on the basis of the markers. To obtain a plant that carries all the target genes in a homozygous state in the genetic background of line V, a plant genotype that has all target markers in a heterozygous state must be produced in the first step (Step I in Fig. 1), followed by selection for obtaining the objective plant that has all the markers in a homozygous state (Step II). Here, we describe the optimum procedures for Step I. Several generations of backcrosses are used in Step I for recovery of the genetic background of V but are not used in the absence of any particular recipient line. We discuss both two cases. The guideline principles derived herein will apply not only to autogamous plant species but also to allogamous species when the recipient line V and donor lines Mis are inbred.
|
![]()
|
|
For each genetically segregating population, the marker test ends when a plant with the target marker genotype ("
![]()
x(1p)x1 , p being the segregation pro- portion of the target plant]. The expected total number of tested plants, ETN, is calculated as the sum of the expected plant numbers across all generations in which the marker
selection is performed, i.e.,
(1/pj), where pj stands for the segregation proportion of the target genotype in generation j. In schedule NB-4T, as the most convenient example, a plant having all target markers of donors M1, M2, and M3 in a heterozygous state occurs in a proportion of 1/2m1 + m2 in the first genetically segregating generation [Generation 3 in Fig. 2(a)], and one having all markers of M1 to M4 occurs in a proportion of 1/2m1 + m2 + m3 in the next generation and 1/2m1 + m2 + m3 + m4 in all (n) subsequent generations, giving a total expected plant number of 2m1 + m2 + 2m1 + m2 + m3 + n x 2m1 + m2 + m3 + m4 (mi stands for the number of markers possessed by donor Mi). For brevity, formulae of ETN as well as the other indicators are presented in the supplements of the tables that appear in the next section.
| COMPARISON OF THE SCHEDULES |
|---|
|
|
|---|
|
A contrastingly different trend is observed regarding superiority between schedules in category NB; a schedule with tandem structure (NB-4T) is superior to that with symmetric structure (NB-4S) in any sets of mis calculated. Generality of this trend is substantiated by the formulae of Table 1(a); NB-4T is superior to NB-4S when 2m1 + m2 + 2m1 + m2 + m3 < 2m1 + m2 + m3 + m4, simply, 1 + 2m3 < 2m3 + m4, which always holds. With the same set of donors, a disposition directed as m1
m2
m3
m4 is most efficient [Table 1(a)]. Screening is not necessary for the markers of a donor (M4) that is used in the last stage in a tandem schedule (NB-4T). This explains how NB-4T is superior to NB-4S.
When k = 5, schedules with mixed structure, BF-5ST1 and BF-5ST2, are superior to that with fully tandem structure, BF-5T, in any cases calculated [Table 1(b)]; the conditions for BF-5ST1 and BF-5ST2 to be superior to BF-5T are derived as 2m4 + m5 < 2m1 + m2 + m3 + m4 and 2m3 + m4 < 2m1 + m2 + m3, being simplified to m5 < m1 + m2 + m3 and m4 < m1 + m2, respectively. Of the two schedules BF-5ST1 and BF-5ST2, the former is superior in all cases calculated, indicating that when a tandem structure is unavoidable within a schedule, it should be used in the earliest possible stage. By the formulae given in the supplement of Table 1(b), BF-5ST1 is superior to BF-5ST2 when 2m1 + m2 + m3 + 2m4 + m5 < 2m3 + m4 + 2m1 + m2 + m3 + m4, which always holds when m5
m3. In the range of dispositions of (1, 2, 2, 3, 4) to (4, 3, 2, 2, 1), ETN is minimized (taking a value of 4368) in BF-5ST1 with disposition (1, 2, 3, 2, 4) in which the crossing is performed in two clusters, i.e., a triplet composed of M1, M2, and M3 and a pair of M4 and M5, with the same total number of markers being disposed to each cluster, i.e., 6 (1 + 2 + 3 and 2 + 4). This result coincides with the one obtained in case of k = 4 that ETN in BF-4S is minimized when each of the two pairs, i.e., M1 and M2 versus M3 and M4, is disposed with the same total number of markers, i.e., 5 (1 + 4 and 2 + 3). Of the two dispositions (1, 2, 3, 2, 4) and (1, 3, 2, 2, 4) in BF-5ST1, the former requires a fewer ETN, indicating that two plants with fewest markers should be crossed first within a triplet cluster.
The superiority among schedules in NB is contrastingly different from that among schedules in BF. In Table 1(b), the superiority occurs in the order of NB-5T > NB-5ST2 > NB-5ST1 in all cases calculated; from the formulae attached to this table, the superiority conditions for NB-5T over NB-5ST2 and for NB-5ST2 over NB-5ST1 are derived as 2m1 + m2 + 2m1 + m2 + m3 < 2m1 + m2 + m3 + m4 (simply, 1 + 2m3 < 2m3 + m4) and 2 x 2m1 + m2 + m3 + m4 < 2m1 + m2 + 2m1 + m2 + m3 + m4 + m5 (simply, 21 + m3 + m4 < 1 + 2m3 + m4 + m5), both of which hold with any sets of mis. NB-5T with disposition m1
m2
m3
m4
m5 is the best, as is confirmed with the calculations of Table 1(b).
In short, a crossing schedule with more symmetric structure as well as disposition of the donor plants is superior when heterozygous plants obtained via backcrossing with a recipient line are used as donors. Ideal-type schedules with four to eight donors are presented in Table 2 for breeder's convenience. By contrast, a schedule with tandem structure and asymmetric disposition directed as m1
m2
···
mk is superior when the inbred (homozygous) donors themselves are crossed without the backcross.
|
Complications may occur in schedules of category NB. From the formulae of ETN of NB-4S and NB-4T presented in the supplement of Table 3, the condition for NB-4T to be superior to NB-4S is derived approximately as m12 < m40 + 0.42x(m14 + m24) [derived from the inequality 2m10 + m20 + 2m10 + m20 + m30 + m12 x (4/3)m13 + m23 < 2m10 + m20 + m30 + m40 x (4/3)m13 + m14 + m23 + m24, which can be approximated to m12 < m40 + (2 ln3/ln2)x(m14 + m24)]. This condition indicates that, as confirmable with the calculations in Table 3, NB-4T is superior when the two donors carrying fewer redundant markers (m12) than intrinsic markers of the fourth donor M4 (m40) are crossed in the first stage. On the other hand, comparison of the dispositions of intrinsic markers of (1, 2, 3, 4) to (4, 3, 2, 1) shows that, with the same configuration of redundancy, ETN of NB-4T is minimized with disposition (1, 2, 3, 4). These two trends lead to a guideline that a tandem crossing schedule should be used in which two donors with the fewest intrinsic (m10 and m20) as well as redundant (m12) markers are crossed in the first stage and a donor with the greatest number of intrinsic (m40) as well as redundant (m14 and m24) markers being crossed in the last stage. Such a schedule may not be constructed when donors with the fewest intrinsic markers have relatively many redundant markers. The best schedule in such a case should be based on the calculations using the formulae. Doubly redundant markers (m34) as well as trebly and quadruply redundant markers (mijks and m1234) are irrelevant to the superiority when k = 4 because all of the plants obtained through the last crossing have these markers.
|
Influence of Linkage between the Markers
For markers located on the same homologous chromosomes of the same or different donors, linkage occurs in the genome of a plant selected in a stage. The crossing schedule should be duly modified in such a case. When compared with the case of independent inheritance, a linkage in the coupling phase produces a favorable effect; with the same number of targeted markers, the segregation proportion (pj) of the targeted genotype in the progeny raised via crossing the selected plant with a new donor plant increases in the presence of a coupling linkage, and, consequently, the number of tested plants decreases. By contrast, a linkage in the repulsion phase produces an unfavorable effect; more plants must be tested for the detection of a plant with the targeted marker genotype.
In the genome of a plant selected in each stage, markers that were newly incorporated via the latest crossing should be linked in the repulsion phase with markers that had been incorporated in any stages before the latest, whereas all markers incorporated in any stages before the latest should be linked in coupling phase. In the crossing schedules discussed in this paper, coupling linkages persist and accumulate with advance of the stages, whereas any repulsion linkage occurs in only one stage because it is converted to a coupling linkage after one round of marker selection. Although the occurrence of repulsion linkage cannot be avoided for the markers that are located on the same homologous chromosomes of different donors, its disadvantage could be minimized with a properly modified disposition of the donors.
In the presence of a repulsion linkage between two markers of a recombination value of r, the segregation portion of the targeted marker genotype decreases to 2r relative to that expected under independent inheritance of the two markers (r = 0.5) and, consequently, 1/(2r) times as many plants as tested under independent inheritance must be tested. The ratio 1/(2r) takes a value of 5.517, 3.033, and 2.216 with a map distance of 10, 20, and 30 cM, respectively, indicating that even a moderate repulsion linkage invokes a substantial disadvantage (increase in tested plants). More plants are tested in later stages because targeted markers increase with advance of stages. Therefore, when evaluated in the absolute number of tested plants instead of a ratio value as given above, a repulsion linkage with the same strength invokes a heavier disadvantage when it occurs in a later stage. On the other hand, a repulsion linkage, once converted to the coupling phase after one round of marker selection, contributes to reducing tested plants in all subsequent stages. Therefore, the disadvantage of a repulsion linkage can be minimized and best recovered when it occurs in an earliest possible stage, leading to a guideline that a crossing that invokes a strong repulsion linkage should be performed as early as possible. Markers located on the same homologous chromosome of the same donor cause no disadvantage because they are linked in the coupling phase in all stages. A pair or set of markers tightly linked in the coupling phase could be treated like a single marker.
Our calculations, though omitted here for brevity, also showed that a quite heavy disadvantage is invoked when repulsion linkages occur in duplicate in the genome of a selected plant, although the disadvantage is much mitigated when the repulsion linkages coexist with coupling ones. The occurrence of duplicate repulsion linkages can be managed via modifying the disposition of donors. A disposition with a minimum occurrence of duplicate repulsion linkages should be chosen.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
The disposition of donors should be modified when some markers are possessed by two or more donors (redundant markers) or located on the same homologous chromosomes of the same or different donors (linked markers). Guidelines for the modification are as follows.
Our discussion has been based on some assumptions for brevity. Modifications may occur if the assumptions are relaxed, some of which are as follows. The comparison among schedules in category BF was based on the assumption that five generations of crossing with a recipient line V (n = 5), i.e., one generation for the initial crossing between a donor line Mi and recipient line V and four subsequent generations of backcross, are used to recover the genetic background. Fewer generations of backcross may be sufficient when the recipient line V and donor lines Mis are relatively closely related. Calculations with fewer generations of the crossing showed that ETN does not noticeably decrease compared with those calculated before (data not presented), reflecting the fact that the number of plants tested in the course of backcross does not occupy a major part of ETN in schedules of BF. Therefore, our guidelines obtained with n = 5 need not be modified.
By selection for the background markers in the course of backcrossing (Tanksley et al., 1989; Bouchez et al., 2002; Frisch, 2004), we could save time. However, more plants need to be tested for detecting plants with minimum linkage drags as well as higher cost for genotyping many background marker loci. The advantage of background selection should depend on the circumstances. In any case, the superiority of the schedule will be the same whether or not the background selection is considered because the superiority is determined by the crossing schedule after the backcross.
ETN will be good to compare the efficiencies of different crossing schedules, but it does not necessarily indicate a sufficient number for the success of a gene-pyramiding project. The number of plants (x) tested in a generation of selection is subject to a high magnitude of chance fluctuation [x fluctuates with variance (1 ' p)/p2], and many more plants than the expected number E(x) (= 1/p) may be tested until finding a plant with the target marker genotype. Therefore, sufficiently more seeds (embryo plants) than 1/p should be taken to guarantee the success of selection in each generation, although not all of these seeds are actually used for the marker test. To ensure at least one seed carrying an embryo plant with the target marker genotype with a probability P, as many seeds as ln(1 P)/ln(1 p) must be taken; this is approximately ln(1 P) times as many as 1/p [note that ln(1 p) can be approximated to 'p when p is sufficiently small]. In some plant species and/or inappropriate cultural conditions, sufficient seeds may not be obtained when only one plant is selected, and then, two or more plants with the target marker genotype must be selected. When selecting two such plants from a population, the expected number of tested plants is 2/p [because the probability of x plants being tested until finding two plants with the target marker genotype equals (x ' 1)(1 ' p)x 2p2, the expectation of x is given by p2
x(x1)(1p)x2, which equals 2/p], indicating
that ETN and ANP are two times as large as the values calculated before. However, the superiority of the schedule is the same whether based on 1/p or 2/p, and our guideline principles derived before hold without any modification.
A useful trait allele may be linked with a null marker (de Moraes et al., 2006). Because null markers cannot be screened in a heterozygous state, at least two generations are necessary for detecting a plant carrying the target allele, which requires a longer time and higher expenses. To minimize this disadvantage, a donor carrying an allele linked with a null marker should be taken into the crossing schedule in a latest possible stage.
The efficiency of marker-based gene-pyramiding will decrease substantially unless the markers are perfectly or tightly linked with useful trait genes. Association of a marker with a trait allele and consequently the reliability of the marker-based selection decreases with increasing cycles of meioses. Very simply, with a recombination value of r between a marker and trait allele, the probability of this linkage being maintained unbroken across s cycles of meiosis is equal to (1 r)s. To keep this probability higher than a certain critical value, say P, s must not exceed lnP/ln(1 r), suggesting that a phenotypic test should be performed every s generations of selection to confirm the persistence of the initial linkage. Or, to maintain this linkage across s generations with a probability higher than P, the recombination value r must be lower than 1 P1/s. The critical linkage strength under some typical conditions of P and s is presented in Table 4 in relation to the map distance (in centimorgans) and recombination value (r), showing that a linkage as close as 1 cM can be maintained across five generations with a probability higher than 0.95, whereas the linkage must be extremely tight to be maintained with a probability higher than 0.99. The reliability of marker selection could be improved if flanking markers are considered (van Berloo and Stam, 1998; Frisch et al., 1999); this, however, requires more plants to be tested and higher cost per plant. It is imperative to explore markers that are tightly linked with useful trait genes.
|
Received for publication June 29, 2006.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
T. Ishii, T. Hayashi, and K. Yonezawa Optimization of the Marker-based Procedures for Pyramiding Genes from Multiple Donor Lines: III. Multiple-Gene Assemblage Using Background Marker Selection Crop Sci., November 24, 2008; 48(6): 2123 - 2131. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Ishii and K. Yonezawa Optimization of the Marker-Based Procedures for Pyramiding Genes from Multiple Donor Lines: II. Strategies for Selecting the Objective Homozygous Plant Crop Sci., September 1, 2007; 47(5): 1878 - 1886. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| The SCI Journals | Agronomy Journal | Vadose Zone Journal | |||
| Journal of Natural Resources and Life Sciences Education |
Soil Science Society of America Journal | ||||
| Journal of Plant Registrations | Journal of Environmental Quality |
The Plant Genome | |||