|
|
||||||||
a Dep. Genetica, ESALQ-Univ. Sao Paulo, Cx.P. 83, 13.400-970, Piracicaba, SP, Brazil
b Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), Lisboa 27, Apdo. Postal 6-641, 06600 Mexico, D.F., Mexico
jcrossa{at}cimmyt.mx
| ABSTRACT |
|---|
|
|
|---|
n). The importance of these two factors is enhanced as s increases. For arbitrary rates of selfing (0
s
1), under inbreeding equilibrium (IE) and with constant population size (n = N), Ne(v) can be adequately maintained through GC with a loss of
20% within accessions. For large sample size (n
), an accession loss of
33% can be recovered. For maintaining adequate Ne(v), artificial selfing followed by GC is more efficient than accession regeneration by natural reproduction. For achieving appropriate Ne(v)s, increasing the rate of self-fertilization in polymorphic materials makes collection more difficult but regeneration easier for minimal loss (
20%) within accessions.
Abbreviations: GC, female gametic control RS, random sampling IE, inbreeding equilibrium UI, unrestricted inbreeding
| INTRODUCTION |
|---|
|
|
|---|
Several kinds of effective sizes have been defined (Crow and Denniston, 1988; Frankel et al., 1995; Lindgren et al., 1996, 1997). One which is interesting in breeding programs and genetic conservation of species is the variance effective number Ne(v) which is determined primarily by the number in the offspring generation (Nt). A general equation for Ne(v) was developed by Crow and Kimura (1970). It considered the effect of drift on allele frequency when a sample of offspring is taken from a reference parental population. It was later extended by Crow and Denniston (1988). On the other hand, a simple expression that relates Ne(v) with the natural rate of self-fertilization (s) of species and that can be derived from Cockerham's (1969) intraclass correlation approach is given by Ne(v) = n(1 - 0.5s) (Gale and Lawrence, 1984). This practical expression can be applied to both infinite (germplasm collection) or finite (accession regeneration) reference population; however, it is only applicable under very specific circumstances such as, among others, those that avoid family structure of the sampled seed.
Vencovsky (1978) and Crossa and Vencovsky (1997) developed a model for computing the variance of the number of contributed gametes, V(k), and for calculating Ne(v) when allele frequency drift is viewed as occurring at two stages during sampling in monoecious species: sampling of parents from the reference population and sampling of gametes from these parents. In this model, a set of monoecious individuals is considered. Some of these contribute both female and male gametes, others contribute only male gametes, and others do not contribute any gametes at all. Using this model, Vencovsky (1987) and Crossa and Vencovsky (1994, 1997) developed formulas adapting Ne(v) to specific aspects of germplasm collection and regeneration.
One basic assumption underlying the computation of the variance of the contributed number of gametes, V(k), for two-stage sampling of monoecious species is that there is no covariance between the number of female and male gametes contributed per individual within a set of monoecious plants. This means that the model proposed by Vencovsky (1978) and Crossa and Vencovsky (1997) for estimating predictive equations of V(k) and Ne(v) excludes any artificial or natural self-fertilization above panmixia and thus covers a restrictive range of circumstances occurring in natural and experimental populations. It is well known that self-compatible species such as sorghum [Sorghum bicolor (L.) Moench], cotton (Gossypium sp.), coffee trees (Caffea arabica L.), eucalyptus (Eucalyptus sp.), several other forest species, etc. may have a sufficiently high rate of natural self-fertilization (s) to be considered in the category of species with mixed self and random mating. In nature, species may have a wide range of self-fertilization rates that vary from zero (strict allogamous) to one (strict autogamous) (0
s
1).
The genetic structure of a mixed self and random mating population is complex. Seeds collected from this type of species consist of a mixture of selfed (S1) seeds with proportion s, and outcrossed seeds with proportion 1 - s. Families collected from individual seed parents are intermediate between S1 and half-sibs families. The proportion s can be changed artificially by hand-pollination from s = 1 with self-fertilization of all plants to s = 0 when plants are intercrossed. On the basis of a one-locus model, Wright and Cockerham (1985) described the complex genetic structure of a population with a mixed mating system under inbreeding equilibrium. Plants from such a population will have different levels of inbreeding, depending on how many generations of selfing each one has in its genetic history or pedigree. The fixation index, as a population parameter, is an expected value that even with constant s for all plants, as usually assumed, will vary among individuals. This will affect the actual Ne(v) value or the exact amount of drift expected in a sample of seeds taken from a specific set of seed parents, especially when the number of seed parents is small. With molecular markers, the mating system of species can be studied and their natural rate of self-fertilization (s) and natural level of inbreeding (f) can be estimated and appropriate effective population sizes for mixed self and random mating species can be derived by considering the additional covariance between the number of contributed female and male gametes that arises from self-fertilization.
The objectives of this study were to: (i) develop a general model for obtaining predictive equations of V(k) and Ne(v) for mixed self and random mating species by extending the two-stage sampling model of Vencovsky (1978) and Crossa and Vencovsky (1997) to cases where 0
s
1; (ii) provide direct formulas for calculating V(k) and Ne(v) for mixed self and random mating; and (iii) adapt these formulas to specific aspects of germplasm collection and regeneration and discuss the effects of different levels of self-fertilization and different mating systems on the estimated values of Ne(v).
| Model for mixed self and random mating species |
|---|
|
|
|---|
|
is the total number of seeds collected from the set of P parents. When an equal number of seeds is taken from each seed parent (female gametic control) mi is no longer a variable and becomes fixed (m1 = m2 = · · · = mP = m) within the given set of P parents. The natural or artificial rate of self-fertilization (s) is assumed constant over parents, and ns and n(1 - s) are the expected totals of gametes contributed for selfed and crossed seeds, respectively. The overall total of contributed gametes is 2ns + 2n(1 - s) = 2n.
Proportions u = P/N (0 < u
1) and v = M/N (0 < v
1) correspond to seed and pollen parents, respectively. Ratio u will depend on the reference population considered and the specific circumstances of the sampling process. In germplasm regeneration, u should be near 1 but may be reduced because of poor stand caused by poor germination, insects, diseases, or other factors. When sampling from large natural populations, however, u may be near zero. In practice, it is difficult to know the number of pollen parents (M), particularly in germplasm collection, because it depends on factors such as wind, plant density, pollen dispersal, and pollen vectors. For collecting from large reference populations of panmictic, or mixed self and random mating species, it can be assumed that M
. This is an oversimplification and is considered here for the purpose of calculating an upper limit for Ne(v). For strict autogamic species (s = 1), however, M = P and u = v. In germplasm regeneration, M is finite and may be known.
Basic Equations, Notations, Definitions, and Assumptions
The basic expression of Crow and Kimura (1970) (p. 357, Eq. [7.6.3.26]) is considered, with
t - 1 = f as a measure of departure from Hardy-Weinberg equilibrium in the parental population (Generation t - 1)
![]() | (1) |
and
are the variance and mean of the number of contributed gametes in Generation t - 1. Wright (1969) ignored the Gaussian correction
and considered the ratio
instead of
. The use of s2k is relevant only when N is small. In the derivations given here the Gaussian correction is also ignored.
Coefficient f is the level of fixation (inbreeding coefficient) of the parental population (Generation t - 1), such that the expected genotypic frequencies (Q) for homozygous A1A1 and heterozygous A1A2 individuals are
and
, respectively (p1 and p2 are the frequencies of alleles A1 and A2, respectively).
If a population is in inbreeding equilibrium, the relationship between the natural rate of self-fertilization (s) and the natural level of inbreeding (f) is
(Haldane, 1924; Wright and Cockerham, 1985). Inbreeding equilibrium is meant for populations with zygotic proportions remaining constant over generations according to Wright's Equilibrium Law. It implies that the population has a past history of reproduction by a combination of outcrossing and selfing, under a constant rate. It also implies that the population is left to reproduce naturally, that is, no hand pollination or any interference is made. The inbreeding equilibrium condition in normal circumstances of germplasm collection and regeneration is a reasonable assumption for neutral alleles. As will be described later, however, inbreeding equilibrium is not a necessary condition in the mathematical derivation of V(k) and Ne(v) given here for mixed self and random mating.
For the derivation of V(k), means, variances, and covariances involving Variables m, a, c, and d (Table 1) are necessary. These quantities are herein represented by the following symbols. If x denotes any of these variables, then
, and V(x), E(x) =
are the variance and mean of x measured in relation to the set of P seed parents or in relation to the whole set of N initial individuals, respectively. Also V(x/mi), E(x/mi) are the variance and expected value of x, conditional to the number of seeds of the ith parent. For covariances the corresponding symbols are covp(x,y) or cov(x,y) when relative to P or N, respectively. Concerning the distribution and expectation of these variables, the following is assumed.
; when seeds are sampled randomly. This is a simplified assumption. The actual variance of m may be larger because of variation in fertility; also
.
, and
. For the total number of contributed gametes, the average is
relative to the initial set of N plants. It is also assumed that, within a set of P parents, there is no covariance between the number of male gametes (d) contributed for outcrossed seeds and the number of female gametes (a) contributed for selfed seeds and also no covariance between d and the number of female gametes (c) contributed for outcrossed seeds, or that covP(a,d) = cov(c,d) = 0. With female gametic control VP(m) = 0.
| Derivation of v(k) and ne(v) under mixed self and random mating for random sampling of seeds or female gametic control |
|---|
|
|
|---|
It can be shown that the variance of the number of contributed gametes divided by
, for random sampling of seeds (RS) under mixed self and random mating, is as follows
![]() |
![]() |
![]() |
Thus, for random sampling of seed (RS) and unrestricted inbreeding (UI), Eq. [1] becomes
![]() | (2) |
![]() |
Assuming inbreeding equilibrium (IE),
, Eq. [2] can be written as
![]() | (3) |
![]() |
For female gametic control (GC), that is taking an equal number of seeds from each plant, the basic scheme and the component terms of V(k) are similar to that shown for random sampling of seeds except that now m1 = m2 = · · · = mP = m. Consequently, the ratio
becomes
![]() |
Thus, Eq. [1] for female gametic control (GC) and unrestricted inbreeding (UI) becomes
![]() | (4) |
![]() |
Assuming inbreeding equilibrium (IE),
, Eq. [4] reduces to
![]() | (5) |
![]() |
Equations [2] and [4] are unrestricted with respect to s and f. They are, for instance, applicable when gametes are manipulated through hand pollination in regeneration practices. For a set of individuals with a given f value, s = 1 corresponds to selfing all plants to obtain S1 offspring while s = 0 corresponds to crossing plants with avoidance of selfing. If inbreeding equilibrium (IE) may be assumed, then Eq. [3] and [5] are adequate.
Equations [2] through [5] for computing Ne(v) are general; however, under more restricted conditions they can be adapted and evaluated for some practical situations encountered in germplasm conservation.
| Adaptation of ne(v) for germplasm collection and regeneration for arbitrary selfing rate |
|---|
|
|
|---|
Germplasm Collection
In germplasm collection the reference population is here considered to be of infinite size (N
) and the number of pollinator plants potentially large (M
N), such that v
1. The number of seed parents (P), from which the sample of n seeds is taken, is considered much smaller than the entire population (N) implying that u = P/N
0. Incorporating these assumptions into Eq. [2] through [5] leads to simplified expressions, as can be seen next.
For the case of random sampling of seeds (RS) and unrestricted inbreeding (UI), (RS - UI), Eq. [2] can now be written as
![]() | (6) |
Similarly, Eq. [3] (RS - IE) simplifies to
![]() | (7) |
With female gametic control (GC) and unrestricted f(UI), Eq. [4] becomes
![]() | (8) |
For the GC sampling scheme and with inbreeding equilibrium (GC-IE), Eq. [5] reduces to
![]() | (9) |
Comparisons of Eq. [6] with Eq. [8] and Eq. [7] with Eq. [9] indicate that effective size attained in germplasm collection with female gametic control is equal to or larger than the corresponding Ne(v) expected when seeds are sampled randomly. Some effective population numbers for seed collection obtained from Eq. [7] and [9] and assuming a reference population of infinite size under inbreeding equilibrium are given in Table 2 . Ne(v) is clearly reduced as s increases. In absolute values, the advantage of GC over RS of seed is also smaller with higher rates of s and becomes negligible when a larger number of seeds is collected per seed parent (n/P = 10 versus n/P = 50 in Table 2). With higher levels of self-fertilization, natural heterozygosity is reduced; consequently, a larger number of seed parents would need to be sampled to reach adequate effective sizes in germplasm collection.
|
varies from 2.86 to 0.46 under RS and from 3.08 to 0.50 with GC for s = 0 and s = 1, respectively. The highest possible average effective number
expected for panmictic species in Hardy-Weinberg equilibrium (s = f = 0) was not attained in this example because of the relatively small number of seeds assumed to have been sampled per parent. On the other hand, for s = 1, all parental individuals being homozygotes,
with female gametic control, which corresponds to the sampling of a single gamete from an idealized random mating population. Expressions [6] through [9] can be further adapted to the conditions of specific mating systems. Keeping the assumptions already mentioned for germplasm collection, expressions were obtained from Eq. [7] (RS-IE) and Eq. [9] (GC-IE), for panmictic, mixed self and random mating, and self-fertilizing populations.
Panmictic Species
When s = f = 0 expressions for effective number are for RS
and for GC
(from Eq. [7] and [9], respectively). These equations were also given by Crossa and Vencovsky (1994) for monoecious species.
Mixed Self and Random Mating (Intermediate) Species
Considering, for illustration, an intermediate rate of self-fertilization, s = 0.5, such that f = 1/3, expressions for effective number are for RS
and for GC
.
Self-Fertilizing Species
When s = f = 1 effective number are for RS
and for GC Ne(v) = 0.5P. This set of equations, for s = 0.0, 0.5, and 1.0, among other aspects, can be used for quantifying in what circumstances the advantage of GC over RS becomes negligible as sample size (n) increases. Note that for autogamic species (s = 1) and with GC, Ne(v) depends only on the number of seed parents (P), and not on the number of seeds collected (n). With sufficiently large sample sizes (n), Ne(v) becomes dominated by P for all mating systems. In fact as n
, the effective size tends to
(from Eq. [7] or [9], where
is the coancestry among sibs within maternal families for equilibrium populations (Cockerham and Weir, 1984). For s = 0.0, 0.5, and 1.0, this coancestry is
, and 1.0, respectively, and the corresponding Ne(v) limits are Ne(v) = 4P, (4/3)P and 0.5P. On the other hand, when only one seed is taken per parent (n/P = 1) the effective numbers obtained from Eq. [9] for s = 0.0, 0.5, and 1 are Ne(v) = P, (3/4)P, and 0.5P, respectively. Thus, for a fixed P, it is possible to increase Ne(v) by increasing n/P (n/P = 1 versus n/P
) when s < 1. However, for s = 1, increasing n/P does not increase Ne(v). Hence, in germplasm collection, the values of Ne(v) depend primarily on the number of parents (P) but also on the number of seeds collected per parent (n/P) when s < 1.
Note that for germplasm collection from an infinite base population (u
0), Ne(v) from Eq. [7] and [9] is equivalent to Ne(v) expressed as a function of intraclass correlations (Appendix 2) as given by Cockerham (1969).
Germplasm Regeneration
With N individuals of a given accession planted in the field, all potentially producing both male and female gametes, the reference population is of finite size. For the regeneration process, it may be assumed that only a fraction u (0 < u
1) of these N individuals contribute gametes to the next generation, such that P = M = uN and u = v. This condition may simulate the loss of seeds of the original accession because of germination problems or to any other factor affecting the viability of plants in the field. Some situations with practical applications will be discussed.
Inbreeding Equilibrium (IE) and Arbitrary Sample Size (n)
As already mentioned, when seeds are produced without interference in the regeneration process and the natural rate of self-fertilization remains unchanged, the assumption of inbreeding equilibrium is reasonable. Consequently, for the P = M = uN scheme and seeds sampled randomly (RS), Eq. [3] reduces to
![]() | (10) |
Similarly, for P = M = uN and female gametic control (GC) Eq. [5] becomes
![]() | (11) |
More optimistically, it may be assumed that the integrity of the original accession is maintained such that M = N (0
s < 1) or v = 1, but seeds are collected from P = uN plants only. Now, with random sampling of seeds (RS) Eq. [3] leads to
![]() | (12) |
![]() | (13) |
In the process of seed regeneration, with preclusion of hand pollinations, rate s is inherent to a given accession. Since accession size N is also given in advance, only sample size (n) and proportions u and v can be manipulated or monitored in a regeneration plan.
Examining Eq. [10] through [13], it is seen that Ne(v) increases linearly with sample size (n) when an accession is regenerated prior to any loss of viability and all original plants (N) can potentially contribute female and male gametes to the offspring (u = 1). In addition, for autogamic species (s = 1) and also u = 1, Ne(v) =
with GC, for any sample size (n).
For the P = M = uN scheme (Eq. [10] and [11]), allowing n to become large enough (n
), effective size, at this limit, is
for both sampling procedures (RS and GC). Hence, even when the quantity of regenerated seeds is very large, Ne(v) can be small and much less than the original census number N, the determining factor being the proportion of functional parents u or, more precisely, the ratio
. This ratio varies from
0 to infinity, for 0 < u
1, being, for instance, equal to 1 and 9 for u = 0.5 and u = 0.9. A nine-fold gain in representativeness of the regenerated seeds is, therefore, expected in this example (u = 0.9 vs u = 0.5). Because in many situations u can be equated with the germination rate of an accession, results point clearly toward the extreme importance of monitoring this rate in the management of germplasm banks. In terms of Ne(v) the reward for maintaining the proportion u at adequate levels is highly compensating and more than might be initially expected.
For the M = N and P = uN scheme, u is the proportion of plants chosen as seed parents. From Eq. [12] and [13] the upper limit of effective size, as n increases (n
), is now
, which is higher than that expected from Eq. [10] and [11] (for 0
s < 1) because of the larger pollen pool available when M = N. However, the ratio
plays the same role as that discussed for the P = M = uN scheme.
Equations [10] through [13] can be applied to answer questions that may arise in the management of germplasm banks. Consider, for instance, an accession originally containing N seeds. If this material is to be regenerated only when the germination rate drops to 80%, what sample size (n) would be needed to assure that effective size is maintained at least at a level of Ne(v) = N? This will depend on the natural rate of self-fertilization and the sampling procedure. Assuming inbreeding equilibrium and female gametic control, and taking u = v = 0.8; P = M = 0.8N, with Ne(v) = N, Eq. [11] furnishes n = N - 0.42
N, for panmictic species (s = 0). A sample of n = N offspring has an effective size of Ne(v) = N despite the loss of 20% of seeds of the original accession. This loss in the parental generation is being compensated for by female gametic control. Taking seeds at random from the P = 0.8N parents would require a larger sample, as given by Eq. [10], namely n = (4/3)N - 0.83
1.33N (for s = 0) for attaining Ne(v) = N.
Considering a mixed self and random mating species, the same situation discussed above would require n = 0.875N - 0.31
0.875N for s = 0.5 and female gametic control. Hence, the mixed mating system (intermediate s) favors regeneration, as a correspondingly smaller sample would be necessary for maintaining Ne(v) = N. Under s = 1 the value given by Eq. [11] is Ne(v) = 2N, regardless of sample size n. This last outcome, which is relative to a polymorphic accession of an autogamic species, shows that effective size is always kept at Ne(v) = 2N, even when the germination rate is reduced to 80% in the original accession. However, the condition is the practice of female gametic control and the collection of at least one seed from each one of the P = 0.8N parental plants. Note that, in this case, what is assumed is the regeneration of a polymorphic (heterogeneous) accession with complete natural self-fertilization (e.g., land race of an autogamic species with s = 1). This situation is entirely different from regeneration of pure lines (homogeneous accession, with s = 1) where the theory and application of the Ne(v) measure is not necessary.
Inbreeding Equilibrium (IE) and Constant Population Size (n = N)
In the maintenance of germplasm bank accessions through regeneration, the census number of the regenerated sample of seeds (n) may remain the same as the original number (N) for a given accession. That is, population size is kept constant from one generation to the next (n = N).
With n = N, expressions adapted for regeneration become simpler. Table 3 shows four such expressions (Eq. [14] through [17]) that assume inbreeding equilibrium. Equation [14] is obtained from Eq. [11] and the negative term from the denominator -(1 - s)/P has been ignored. Equation [15] is derived from Eq. [13] and the negligible negative term from the denominator that has been ignored is -(1 - s)/N. Equation [16] is calculated from Eq. [10] and the neglected term is -[(0.5s(1 + s) + 1]/P. Equation [17] is from Eq. [12] and the neglected term is -[(1 + s)2/(2P) + (1 - s)/(2N)]. The value of each neglected terms is near zero for large values of P and/or N that may be expected in practice. In Eq. [14] through [17], Ne(v) depends on three parameters: the natural rate of self-fertilization (s), accession size (N), and the proportion of functional parents (u) contributing gametes to the seeds.
|
1) and GC, panmictic or close to panmictic species (s
0) attained much lower values of Ne(v) than those expected for more autogamic species. However, panmictic species are less affected when accessions drastically deteriorate (low values of u; e.g., u = 0.5).
|
Because certain degrees of accession deterioration or loss can be recovered by gametic control, searching for a range of acceptable u values is meaningful. Consider the mating scheme where parents are P = M = Nu such that u can be taken as the germination rate. Now, if apart from the condition of constant population size, it is also required that Ne(v) = N = n, Eq. [14] can be solved for u, and the result being
assuming GC. An evaluation of this quantity gives u* = 0.80, 0.76, and 0.67 for s = 0.0, 0.5 and 1.0, respectively. Hence, a loss of 20% of seeds within accessions of species in general, seems to be acceptable if a standard procedure of female gametic control is adopted for regeneration. Accessions of monoecious species can also be regenerated through both female and male gametic control (plant to plant hand pollinations and equal number of seeds taken per plant) (Crossa and Vencovsky, 1994). Now, with n = N and Ne(v) = n, the resulting proportion is u* = 0.67 as for self-fertilizing species. Note that for strictly autogamic species, female gametic control implies control over both types of gametes. Another way to recover losses within accessions is through increments in sample size (n > N). As discussed in the preceding section, however, there are limits to this strategy.
The alternative mating scheme in which P = uN but M = N (Eq. [15] and [17]) is considered here for the purpose of investigating how the sampling of seed parents (P out of N) can affect effective size over the range of s values. Since, by assumption, no loss has taken place in the original accession, the potential contribution of the pollen pool is not negligible and is maximized when s = 0. However, Table 4 shows that when seeds are sampled randomly, maintaining Ne(v) = n requires very high u values, even for this extreme case (s = 0). Discarding, for instance, only 10% of the seed parents during seed regeneration (u = 0.9), the effective number will be Ne(v) = 97 for a sample of 100 seeds. These numbers are considerably reduced as the contribution of the male gametic pool is diminished when s increases. It is only with female gametic control that a certain quantity of seed parents can be neglected during seed harvesting in the regeneration process. In fact, with the requirement that Ne(v) = n, and with constant population size, Eq. [15] furnishes
. For s = 0.0, 0.5, 0.8, and 1.0 this proportion is u* = 0.5, 0.64, 0.66, and 0.67, respectively. Hence, under this specific mating structure (P = uN and M = N) and with female gametic control, discarding at most 1/3 of the seed parents when seeds are harvested will maintain Ne(v) as large as or greater than the census number, for species in general.
As already mentioned, these two mating schemes, P = uN and M = N versus P = M = uN, are the same when the species is autogamic. Therefore, Eq. [14] and [15] are equivalent when s = 1 as are Eq. [16] and [17] (Table 4).
Ne(v) for Germplasm Regeneration through Selfing
Selfing can be an option for germplasm regeneration, for species that naturally self-fertilize, and more generally, when field isolation in time or space is not possible. For species with high rates of cross-fertilization, this alternative has one obvious and serious restriction, namely inbreeding depression, although it still can be recommended, in such cases, for retaining useful alleles carried by specific accessions.
With artificial selfing, Ne(v) can be computed by considering that in a base population (accession) of size N, P = Nu plants are selfed and n seeds are subsequently collected. Assuming, for simplicity, a constant population size (n = N) and that an equal number of seeds is collected in each S1 progeny, Eq. [4] can be applied, considering s = 1. The resulting expression is then
![]() | (18) |
As u approaches 1, Ne(v) (Eq. [18]) tends to 2N, 4N and
for f = 0.0, 0.5, 1.0, respectively. For comparison with previous results, consider a reference accession under Hardy-Weinberg equilibrium (f = 0). Equation [18] now simplifies to
. This now refers to n = NS1 offspring and is identical to that shown by Crossa and Vencovsky (1994)(Table 1) for biparental crosses and gametic control. This equivalence is expected when effective number is derived on the basis of gene frequency variations due to random sampling. With selfing and collecting progenies of equal size, an equal number of both female and male gametes is contributed by parents to the selfed offspring as with full-sib families of equal size. Hence, when control is exerted on both types of gametes, the genotypic constitution of the offspring generation is irrelevant for measuring Ne(v).
A particular feature of Eq. [18] is detected when the condition Ne(v) = n is imposed. This condition is met when u = 0.67, meaning that selfing 0.67N or more plants of an accession, regardless of the mating system of the species, will assure that Ne(v)
N at the offspring generation when population size is kept constant (n = N) and an equal number of seeds is collected per family.
The effectiveness of regeneration through artificial selfing may be questioned for species with a high rate of natural self-fertilization. This suggests comparing this alternative (Eq. [18]) with regeneration by natural reproduction (NR) or open pollination. Expressions [18] and [14] can be used for an insight but recalling that both equations are for constant population size and gametic control. Selfing is expected to be superior to the NR alternative. However, this advantage becomes negligible as u decreases, but it is pronounced for higher values of u (Table 5)
. It is noteworthy that under inbreeding equilibrium a f = 0.9 value, for instance, corresponds to a natural rate of self-fertilization of
, although selfing accessions is expected to be a better option than allowing plants to reproduce naturally for regeneration under constant population size. However, decisions regarding this alternative will primarily depend on anticipated depression from selfing, the viability of the procedure, and costs incurred.
|
| Discussion |
|---|
|
|
|---|
, Eq. [7] and [9] lead to
. This ratio is not affected by the level of inbreeding of the collected seeds because, for sufficiently large n, Ne(v) is dominated by the within families coancestry (
m). Assuming s = 0.5 and s = 1, Ps/P0 = 3 and 8, respectively. This result indicates that to attain the same effective size as expected for a panmictic species (s = 0), three or eight times more seed parents would be necessary in the sampling process, respectively for these two levels of s.
In practice, it is not always reasonable to assume a very large n or a high number of seeds per parent (n/P). Hence consider, alternatively, that the set of n seeds is sampled to maximize the number of parents, such that n/P = 1. In this circumstance the family structure of the seeds is avoided because a single seed is sampled from each seed parent. Now Eq. [9] reduces to
![]() |
m and 1 + f depending on the structure of the sample or, basically, on the n/P mean number. Expression Ne(v) = n(1 - 0.5s), or its corresponding Ne(v) = n/(1 + f) which can be derived from Cockerham's (1969) intraclass correlation approach (Appendix 2), assuming m = 1 and P = n and inbreeding equilibrium, is only applicable, however, under very specific circumstances, as can be shown by imposing further restrictions to some of the more general equations developed in this study. To illustrate this the following three cases are examined.
Case 1
Germplasm is collected from an infinite base population and a random sample of n seeds is obtained from a potentially large number of seed parents. In this case, if P
and M
(which excludes populations with s = 1 for which M = P; M, however, now becomes negligible since 1 - s = 0), Eq. [3] can be written, under inbreeding equilibrium, as
![]() |
Case 2
Similar to Case 1, but assuming that only M
, P is finite. With u
0, Eq. [5] becomes
![]() |
Case 3
Consider regeneration of germplasm. A random sample of n seeds is taken from a finite reference population and population size is kept constant (n = N). Now, with P = M = uN scheme (Table 3, Eq. [16]), Ne(v)
nu(1 - 0.5s), which is equivalent to the previous equations (Cases 1 and 2) when no losses took place in the original accession (u = 1).
It is interesting to note that under these specific conditions an equivalent amount of change in gene frequency due to drift is expected to occur in both germplasm collection and accession regeneration. In other words, Ne(v) attained when a sample is regenerated assuming u = 1 and random sampling (Case 3) is the same as that expected when germplasm is collected and seeds are sampled from an infinitely large number of seed and pollen parents (Case 1), or when sampling of seeds is with female gamete control, the set of seed parents (P) is finite, and a single seed is taken from each one (Case 2). However, despite the different applications of
, it should be used only under the conditions previously discussed. Equations developed in this paper, for the two-stage mixed self and random mating model are more general and have sufficient flexibility to cover a wider array of situations.
Plants from a population with a mixed mating system will have different levels of inbreeding (Wright and Cockerham, 1985). A given plant x, having t generations of natural selfing in its pedigree, will have
and its occurrence in the population is given by the probability Px = (1 - s)st. Even with constant s for all plants, as assumed, fx will vary among individuals with variance
(Cockerham and Weir, 1984). With s = 0.5 this variance is
with a standard deviation of
, which are quite large values. Within such a population, heterogeneity of the fixation index will certainly affect the actual amount of drift expected in a sample of seeds taken from a specific set of P seed parents (especially when P is small). Equations derived in this study for measuring Ne(v), therefore, should be taken as expected values with a reliability that increases as P becomes larger.
Genetic markers have been used extensively to study the mating systems of species. This allows estimation of parameters that are required for calculating Ne(v) in connection with conservation and preservation activities. In the estimation process, sampling errors inherent to
,
, or other quantities will affect the precision of computed
e(v) values. Data provided by Santos (1991) on oil palm (Elaeis guineensis Jacq.) are used here to illustrate this point. These data refer to a set of n = 97 trees, randomly sampled from a large local population and genotyped for 9 isozyme loci. Estimates of f were inconsistent among loci, ranging from
= -0.069 to
= 0.381, with an average, pooled over loci, of
= 0.145. This last estimate leads to
, for the sample of n = 97 trees. This estimate is valid only if it can be assumed that all sampled trees are genetically unrelated. The error associated with
, using bootstrapping over loci, is
, giving 95% confidence limits for Ne(v) of 79.3 and 91.5. The heterogeneity of
among loci is responsible for this relatively large confidence interval. Thus, it is evident that estimates of f based on individual loci are not very informative, and that for more precise estimation of Ne(v), a larger number of loci would be needed.
| Conclusions |
|---|
|
|
|---|
For germplasm collection, results showed that Ne(v) is reduced as s increases. This increase in difficulty for collecting from more autogamic species applies to both GC and RS. Even though GC always provides higher Ne(v) values than RS, this difference becomes negligible when seeds are collected from a smaller number of seed parents (higher n/P). Thus, the important factor in seed collection is primarily the number of seed parents (P). The number of seeds collected per parent n/P is less important and becomes irrelevant as s approaches 1. On the other hand, as s approaches zero, increasing n/P, for a given value of P, will increase Ne(v) up to a limit of 4P (n/P
).
In accession regeneration, the two major factors for maintaining high values of Ne(v) are (i) practicing female gametic control, and (ii) maintaining the integrity of the accession (avoiding loss) at acceptable levels. These factors become even more important as s increases. As opposed to germplasm collection, it is easier to regenerate more autogamic accessions, assuming GC, IE, and high u because values of Ne(v) can be maintained above N. However, when seed accessions deteriorate severely or experience losses (low u), Ne(v) decreases more when the accession becomes increasingly autogamic. That is, more panmictic species are less affected by drastic decreases in the proportion of functional parents. Thus, for achieving appropriate Ne(v)'s, increasing autogamy in polymorphic materials makes seed collection more difficult, but accession regeneration much easier, provided excessive loss of accessions is avoided (u
0.80).
In germplasm regeneration, loss of up to 20% within accessions can be recovered by practicing female gametic control. Thus, a seed viability of at least 80% is a safe lower limit for u. This study showed that artificial selfing followed by female gametic control maintained higher Ne(v) than natural reproduction. This approach can therefore be considered an option for regenerating accessions.
The equations developed in this study for estimating V(k) and Ne(v) of mixed self and random mating species should be taken as approximations; their accuracy depends on the number of seed parents sampled. Other factors, not considered in the model developed in this study, that can give rise to inaccurate estimates of Ne(v) are (i) plant-to-plant variation of the rate of natural self-fertilization caused by uncontrolled factors, (ii) different plant fertility and female and male gametes reproductive success (Morgan, 1998), and (iii) differential mating rates between plants inside the local neighborhood and outside the local neighborhood. Further investigation, based on simulation or on Bayesian statistics, are possible means for clarifying some of these issues.
It is worth noting that two very distinct approaches for estimating Ne(v), namely Eq. [7] and [9] for germplasm collection and Cockerham's (1969) intraclass correlations led to the same effective size expressions, for sampling from an infinite base populations in inbreeding equilibrium. This equivalence reinforces the adequacy of the model developed in this study.1
| NOTES |
|---|
|
|
|---|
Received for publication May 12, 1998.
| Appendix 1 |
|---|
|
|
|---|
Crossa and Vencovsky (1997) demonstrated that this splitting was adequate to represent the two stage sampling process.
![]() |
and dividing by
![]() |
V(a). Variable a is not necessarily binomial, within set P. However, it is binomial when conditional to mi. Therefore,
![]() |
This is not a binomial variance, except when s = 1.
The variance of a relative to
![]() |
and dividing by
![]() |
V(d). The fact that variable d is not correlated with a and c within set P means that it is not correlated with the number of seeds sampled (m). This allows the assumption that d is a binomial random variable within the set of M parents. This assumption of no correlation is realistic; it is reasonable to assume that the outcome of taking a smaller or larger number of seeds from a given parent has no relationship to the number of contributed male gametes that plants spread over a field and that ultimately generate crossed seeds produced by other plants in the area. Therefore, under these assumptions
![]() |
![]() |
![]() |
and dividing by
![]() |
2cov(m,a). 2cov(m,a) = 2cov[(a + c),a] = 2V(a) + 2cov(a,c). The first term of the right hand side is already known. For cov(a,c) consider that mi = ai + ci, such that within set P 2covP(a,c) = VP(m) - VP(a) - VP(c). Recalling that
and that
, then
.
Observe that a and c are negatively correlated within P (for 0 < s < 1). The covariance between a and c relative to N is
With
and
![]() |
With
and dividing by
![]() |
Adding the term
![]() |
![]() |
![]() |
Because it was assumed that covP(a