|
|
||||||||
a Colegio de Postgraduados, Km 36.5, Carretera México-Texcoco, Montecillo, Edo. de México, México
b Biometrics and Statistics Unit, International Maize and Wheat Improvement Center (CIMMYT), Apdo. Postal 6-641, México DF, México
c Universidad Autónoma de Chapingo, CP 56230, Km 38.5, Carretera México-Texcoco, Chapingo, Edo. de México, México
d Colegio de Postgraduados, Km 36.5, Carretera México-Texcoco, Montecillo, Edo. de México, México
* Corresponding author (j.crossa{at}cgiar.org)
| ABSTRACT |
|---|
|
|
|---|
| INTRODUCTION |
|---|
|
|
|---|
, is the regression of g on p. This regression should be a good predictor of g, so that individuals with the highest values of
can be selected (Bulmer, 1980). A function of the observed phenotypic values p such as E(g/p), which is used to rank and select the candidate individuals, is called a selection index (SI). Selection indices were originally defined by Smith (1936) as a linear combination of the observed phenotypic values of the expression of traits of interest and are generally used to discriminate among selection units by taking into account both the genetic and statistical structure of the population from which the genotypes originated, as well as the economic importance of the trait(s). Thus, when evaluated, only those individuals predicted to have progeny of superior economic value are reproduced (Quinton and McMillan, 1995).
Selection index applications are of two types. One is single trait improvement, where it is possible to increase selection efficiency by incorporating information into the SIs about traits related to the trait of interest (Wei et al., 1996; Falconer and Mackay, 1997). The other is multiple trait improvement, which requires assignment of relative economic weights to the different traits. Determination of appropriate economic weights for different traits can be difficult; therefore, modified indices, such as the base index, modified base index, and nonweighted multiplicative index, have been proposed (Tallis, 1962; Williams, 1962; Elston, 1963; James, 1968; Baker, 1986).
The Smith index (1936), also called the optimum SI (Bulmer, 1980; Van Vleck, 1993), takes into account both heritability and genetic correlation between traits when assigning economic weights. Among its major advantages are (i) it assigns higher weights to traits whose differences are genetic; and (ii) it is relatively simple to analyze. Its disadvantages are (i) it requires large amounts of information; (ii) economic weights are difficult to assign; and (iii) sampling error can be large (Bulmer, 1980; Bernardo, 2002). Also, the statistical sampling properties of the Smith SI and its response to selection (R) are unknown except in the case of two traits (Haye and Hill, 1980). Even for two traits, the sampling properties of Smith's SI and its R, found by Harris (1964) using the delta method, are not easy to evaluate.
To generalize Smith's methodology (1936), Kempthorne and Nordskog (1959) proposed an index that set restrictions based on a predetermined improvement level. Henderson (1963) proposed a method that not only eliminates economic weights but also uses the SIs in the context of noninbred populations. On the basis of a linear mixed model, Henderson (1963) demonstrated that it is possible to predict the selection criteria, which he defined as an SI's particular realization. One of the problems with Henderson's procedure is the large amounts of information required to estimate the biometric parameters of interest and the complexity of its application in the context of plant breeding. None of the above-mentioned studies considers the possibility of applying Smith's (1936) basic SI concepts to develop a method that would be simple to implement while maintaining mathematical rigor.
Recently, Cerón-Rojas and Sahagún-Castellanos (2005) developed an SI based on principal components analyses of the phenotypic variancecovariance matrix of the traits, where the first principal component is used as the only SI criterion and no assignation of economic weights is required. The elements of the eigenvector associated with the first eigenvalue determine the proportion of the trait that contributes to the SI. However, the specific assumptions that were made led to inconsistencies that produced an overestimation of the selection response.
The objectives of this study were (i) to propose a methodology for constructing a SI based on eigenanalysis of the phenotypic variance-covariance (or correlation) matrix of the traits of interest (hereafter called ESIM, for eigen selection index method) and show that ESIM does not require economic weights and genotypic variances-covariances; (ii) to demonstrate that although the methods of Cerón-Rojas and Sahagún-Castellanos (2005) and ESIM use the first eigenvalue and its associated eigenvector to construct an SI, they have different assumptions and therefore produce different estimates of the selection response; (iii) to show the statistical properties of ESIM estimators and the selection response (R) obtained from ESIM; and (iv) to use ESIM for constructing two functions, one for estimating selection gains (or losses) between selection cycles and the second for predicting R in future selection cycles. The main theoretical results are applied to three data sets for practical evaluation of the properties of ESIM estimators and for predicting the selection response and estimating the gains between selection cycles obtained on the basis of ESIM.
| MATERIALS AND METHODS |
|---|
|
|
|---|
![]() | [1] |
' = [
1...
q] is the vector of economic weights of the genotypic values, which, according to Smith (1936), breeders could determine on the basis of their experience and is considered a vector of constants.
Smith (1936) modeled the pj (j = 1, 2,..., q) phenotypic values as pj = gj +
j where gj is the genotypic value of the jth trait and
j is the environmental component affecting that trait. Smith (1936) assumed that the interactions between gj and
j can be considered a random effect and that gj represents only additive effects such that Z =
'g denotes the breeding value (Hazel, 1943; Kempthorne and Nordskog, 1959). Under these assumptions, selection based on Y = ß'p leads to a selection response (R) equal to ßYZD (Kempthorne and Nordkog, 1959), that is,
![]() | [2] |
'
ß/ß'Sß is the proportion of D that is expected to be realized when selection is applied (Holland et al., 2003);
and S are the variancecovariance matrices of genotypic and phenotypic values, respectively;
'
ß is the covariance between Y and Z [Cov(Z, Y)], and ß'Sß is the variance of Y (
Y2). Another way to write R is
![]() | [3] |
Y is the standardized selection differential,
Z2 =
'
is the variance of Z, and
YZ is the correlation between Y and Z;
and ß have been defined in Eq. 1; and
, S, ß'Sß and
'
ß were defined in Eq. 2.
On the basis of the observed vector p, Smith (1936) proposed maximizing R from Y = ß'p by finding partial derivatives of the natural logarithm (ln) of
YZ [ln(
ZY)] with respect to ß, such that
![]() |
![]() | [4.1] |
YZ as
![]() | [4.2] |
'
ß was considered to be constant and the coefficient of correlation, invariant to changes in scale (Kempthorne and Nordskog, 1959). Equation [4.1] is Smith's fundamental result and provides the basis of standard SI theory. Note that because ß'Sß/
'
ß is a constant, it can take any value within the domain of the real numbers including 1.0 (Henderson, 1963). Thus, the assumption ß'Sß =
'
ß and its consequences are valid.
Estimating ßS, 
, and RS of the Smith's Method
The Smith (1936) method estimates ßS directly from Eq. [4.2] as
S =
1
, where
1 is the inverse of
,
is an estimate of
, and
depends on the experience of the researcher. The estimate of RS is
S = k
. Since the probability densities of
S, and
S are unknown their sampling properties cannot be determined.
Eigenanalysis Selection Index Method (ESIM)
Assuming that ß'Sß =
'
ß, Cerón-Rojas and Sahagún-Castellanos (2005) showed that Eq. [3] can be written as R = k
and that maximizing R is equivalent to maximizing ß'Sß. This led to the equation (S
I)ß = 0 where ß and
are the eigenvector and eigenvalue of S, respectively. Therefore, ß'Sß =
, and R = k
. In contrast, the ESIM does not make the assumption that ß'Sß =
'
ß, but it only assumes that the genotypic variancecovariance matrix multiplied by the vector of economic weights is equal to the eigenvector of the phenotypic variance covariance matrix, that is, 
= ß. This assumption implies (under the restriction ß'ß = 1.0) that
=
and, therefore, Eq. [4.1] takes the form Sß =
ß, from which
![]() | [5] |
are the eigenvector and eigenvalue of S, respectively. Another way of obtaining the previous result is as follows. If 
= ß,
=
(or ß'Sß = 
'
ß), and
YZ =
=
, then by Eq. [3],
![]() | [6] |
Therefore, maximizing R is the same as maximizing the SI variance, ß'Sß. Following Anderson (2003), Cerón-Rojas and Sahagún-Castellanos (2005) derived ß'Sß with respect to ß and
, subject to the usual restriction used in eigenanalysis that the normalized eigenvectors are unit length (ß'ß = 1.0) and found that
![]() | [7] |
The only assumption made for deriving ESIM, 
= ß and its implication
=
, makes the approach of Cerón-Rojas and Sahagún-Castellanos (2005) for SI inconsistent because they imply that ß'Sß = 1.0, which makes the selection response equal to the selection differential D, that is, R = k
= k
= k = D (where k = D/
y).
The aim of the assumption 
= ß is to facilitate the theoretical development of ESIM. However,
and 
are point estimates of ß and 
, respectively. While
is the maximum likelihood estimate of ß, 
is an empirical estimate of 
because
depends on the experience of the researcher. Thus, the assumption 
= ß does not imply that 
=
but rather that 
.
Relation between the Smith SI and ESIM
Consider Eq. [3] and denote the response to selection and the SI of the Smith method as RS and YS, respectively. Using Eq. [4.2], the variance of YS is ßS'SßS = (S1
)'SS1
= (S1
)'I
, but since S and
are symmetrical matrices, ßS'SßS =
'
S1
. Similarly, the covariance between YS and Z is
'
ßS =
'
S1
. Therefore, the variance of YS and the covariance between YS and Z are the same and the response to selection of the Smith SI is RS = k
. If 
= ß (with ß
ßS), then ß'S1ß =
, where 1/
and ß are the eigenvalue and eigenvector of S1, respectively. Thus
![]() | [8] |
On the other hand, since ß'Sß =
, Eq. [6] is R =
, but because ß = 
, the response to selection of ESIM is
![]() | [9] |
The equality
= k
is valid because ß is the same eigenvector regardless of whether it is obtained from S or from S1; this is because Sß =
ß, then S1ß =
ß. Thus, when ß is computed from S it is associated with the first eigenvalue of S (
), but when it is obtained from S1, it is associated with the smallest eigenvalue of S1 (1/
).
Equations [8] and [9] show that the response to selection from the Smith's method and ESIM are the same. Also, because ßS = S1
, for 
= ßESIM, then ßESIM = SßS. Results from Eq. [8] and [9] can also be more formally derived by the Cauchy-Schwartz inequality (see Appendix).
Estimating Eigenvalues, Eigenvectors, ESIM, and the Response to Selection of ESIM
ß'Sß and ß'ß can be derived for ß'ß = 1.0 and (S
I)ß = 0 as long as (S
I) is a singular matrix, that is, the determinant |S
I| must be a function of
that equals zero. The equation |S
I| = 0 generates a polynomial with q roots
1,
2,...,
q, which are its potential solutions. The maximum likelihood estimators of the q eigenvalues and q eigenvectors are
i's and
i's, for which |
iI|
i = 0, i = 1,2,...,q, where
is an estimator of S (Anderson, 2003).These results allow writing the estimate of ESIM as
i =
i'p and the estimate of response to selection of ESIM as
ESIM =
.
Properties of
i,
i, and the estimator of ESIM (
i)
The maximum likelihood estimators of
i's and
i's are asymptotically consistent and unbiased, such that E(
i) =
i and, according to Anderson (2003)
![]() | [10] |
i) = ßi and for j
i,
![]() | [11] |
![]() | [12] |
;1 is Var(
1)
}
ßjßj' = 
and the covariance of
1 and
3 is Cov(
1,
3)
ß3ß1'.
Concerning the properties of the estimator of ESIM,
i =
i'p because Var(Y) = ß'Sß, an estimator of Var(
i) is
ar(
i) =
i'
i =
i. Furthermore, since
i is a maximum likelihood estimator, E(
i) = E(
i'p) = ßi'p = Yi (Mardia et al., 1982).
The correlations between Yi and the elements of the phenotypic vector p, according to Mardia et al. (1982), are
![]() | [13] |
i is the eigenvalue associated with ßi and sj2 is the jth variance in the diagonal of S.
Yipj thus allowing us to estimate the contribution of each trait to ESIM.
Properties of the Estimator of the Selection Response of ESIM (
)
To determine the properties of the estimator of the selection response obtained from ESIM, consider the asymptotic expected value [E(
YZ)] and variance [Var(
YZ)] of
YZ. If Y and Z have a joint normal distribution (Rahman, 1968), then
![]() | [14] |
Z are fixed, according to Eq. [9]
=
= k
Z
YZ. Thus, from Eq. [14], the expected value and variance of the estimator of the selection response are given in Eq. [15.1] and [15.2], respectively:
![]() | [15.1] |
![]() | [15.2] |
. The E(
) and Var (
) indicate that
is an asymptotically unbiased as well as a consistent estimator of R.
Additional criteria that can be used to characterize the quality of
are the sampling error (SE), the coefficient of variation (CV), the mean-square error (MSE), and accuracy (AC). The SE is the square root of Var (
). The CV is defined as the ratio of the SE to the expected value of the estimator: CV(
) =
= 
. The MSE is defined as the sum of the variance of the estimator and the square of the bias of this estimator. Thus MSE(
) = Var(
) +
. The AC is the bias divided by the sampling error and indicates how close the expected value of the estimator is to the population parameter. For
the expression of AC is AC(
) =
.
Estimating Selection Gains between Selection Cycles from ESIM
The following question may be of considerable interest to breeders: what is the gain (or loss) from one selection cycle to another or for a period of time involving several selection cycles? This implies estimating the selection gain (or loss) between two or more consecutive selection cycles as a function of the selection response in each cycle. Within the framework of ESIM, this can be answered as follows. Suppose f(
) = k/
is a continuous function of the random variable
in the interval [
j,
j+1] (for
j >
j+1), where
j and
j+1 denote the estimated eigenvalue of selection cycles j and j+1 and
j+1
j. A function F, such that
![]() | [16] |
) value indicates the importance of the change from one cycle to the next. Positive numbers denote selection gains, and negative numbers represent selection losses. Since the selection process will tend to reduce (at least partially) the traits' phenotypic variability, it is expected that the eigenvalue, which captures that variability, will also be reduced, the eigenvalues will be more similar, and Fj,j+1 (
) will tend to zero.
A Function for Predicting the ESIM Selection Response in Future Selection Cycles
A problem that has not yet been considered within the context of Smith's SIs (1936) is how to construct a function to predict the selection response based on results obtained at a given stage of the breeding process. Let us consider the estimator of the selection response
ESIM = k/
. Since k is constant, the only term that determines the variation of
ESIM is
. When 1.0 <
<
, suppose R0 = k1
ESIM, then R0 = 1/
, and we can construct the stochastic matrix T =
. A function with which to predict future realizations of
using
can be constructed on the basis of T.
Another way of writing T is T = Q
Q1, where
1* and
2* are eigenvalues of T, and Q is a matrix of eigenvectors ß1* and ß2* of T. The eigenvalues of T are obtained from the determinant |T
*I| = 0 as
![]() |
![]() |
The elements of eigenvectors ß1* and ß2*, associated with
1* and
2*, are
![]() |
According to the above results, the Q matrix is Q =
. In this case, Q1 = Q. Thus, for N generations,
![]() |
ESIM = 1/
, the function for predicting the selection response is
![]() | [17] |
In
(
), k is the standardized selection differential defined in Eq. [3]. It is expected that one or two selection cycles will be predicted with more precision than more distant selection cycles because as the value of
decreases in magnitude, the precision of the prediction will decrease. Note that k should be the same in selections cycles.
The Sign of the Scores of Eigenvectors of ESIM
The ESIM might assign a negative weight to an important trait such as grain yield; thus, individuals with low grain yield will be selected. In this case, the sign of the score of the first principal component (weight) must be changed so that individuals with high yield performance are selected. Consider any two phenotypic traits, then S =
and the eigenvalues are obtained of S as
![]() | [18] |
The elements of the first eigenvector (ß1) associated to the first eigenvalue (
1) are obtained from 
= 0 where 
= 0. The equation ß1 +
ß2 = 0 has an infinite number of solutions for ß1 and ß2, thus it is necessary to fix ß1 or ß2. For example, assume that ß2 = 1, then ß1 =
=
, and thus the numerator of ß1 can be positive or negative. If ß1 =
and ß2 = 1, then
![]() | [19.1] |
![]() | [19.2] |
However, if ß1 =
and ß2 = 1 then ß1 =
and ß2 =
. These results show that the sign of the ESIM can be arbitrarily changed without affecting the ESIM (however, the response to selection will be affected).
Data Sets
Data Set 1
This data set was used for checking the admissibility of the assumption made when deriving ESIM, that is, 
= ß. The data were taken from Becker (1985), where two traits measured in pigs (Sus domesticus) are considered: conversion of feed into weight (p1) and area of eye muscle (p2). Economic weights were given by the author. The vector of economic weights given by Becker (1985) is
' = [50 12], and the corresponding estimates of the variance-covariance matrices of phenotypic and genotypic values are
=
and
=
; thus, 
=
.
Data Set 2
This data set (from Manning, 1956) was used for computing the estimators of R from Smith's SI (1936) and determining the properties of
i's,
i's, and R from ESIM. It was also used for predicting R, for computing the total selection gains between consecutive selection cycles based on ESIM, and to exam the assumption of ESIM, 
= ß. The three traits measured in cotton (Gossypium hirsutum L.) were number of cotton balls per plant (V1), number of seeds per ball (V2), and number of lint per seed (V3), evaluated in seven annual selection cycles during 1949 through 1955 (Manning, 1956). Estimates of the phenotypic and genotypic (in parenthesis) variances of the three traits are given in Table 1.
|
) based on ESIM, the data from a CIMMYT maize (Zea mays L.) trial comprising 144 genotypes evaluated in one environment was used. The following 19 traits were measured: germination (ger); number of plants (plt); anthesis date (an); silking date (si); plant height (ph); ear height (eh); moisture (mo); root lodging (rl); stalk lodging (sl); % erect plants (pe); foliar disease ratio (fdr) measures the general health condition of the leaves on a scale of 1 to 5; number of plants harvested (phv); number of ears harvested (ehv); field weight (we); % ear rot (ert) measured on a scale of 1 to 5; ear rot (erot) measured in percentage; ear appearance (easp), agronomic scale (ags), which indicates the general agronomic condition of the plant measured on a scale of 1 to 5; and grain yield (yld). Ten percent selection pressure was used to estimate the ESIM selection response. | RESULTS |
|---|
|
|
|---|

= ß and the Estimators
and 

1 =
, and the normalized vector 
is (
)' = [
].
The maximum likelihood estimate of the first eigenvalue of
(Eq. [18]) is
1 = 9.025 and the elements of the eigenvector (Eq. [19.1] and [19.2]) associated with
1 are b1 = 0.054 and b2 = 0.998. In this case, 
indeed equals
ESIM.
For data set 2, Manning (1956) did not provide economic weights, but they can be estimated from
S =
1
as
=
1
S. Then the estimator of 
, in this case, is 
= 
1
S = 
S. Normalized and nonnormalized coefficients and estimated values of ßS are shown in Table 2. Differences between 
and
ESIM are small in some years and large in other years. The greatest discrepancy between 
and
ESIM occurred in years 1950 and 1953, where the estimate 
assigned the largest weight to variable V2 (and not to variable V1, as
ESIM did).
|
S =
1
, therefore the assumption 
= ß implies that the estimate of the eigenvector of S (
ESIM) and ßS has the relationships
ESIM = 
S and
S = S1
ESIM. This shows that the coefficients from Smith SI can be transformed to the coefficients of ESIM and vice versa. Therefore, if the previous implications are true this suggests that the assumption 
= ß is realistic. To demonstrate the validity of these relationships, consider data set 1 first. For 
=
and
1 =
then
S =
1
=
and therefore before normalization
ESIM* = 
S = 
=
so that
S =
1
ESIM*. A similar procedure can be used for data set 2 where the normalized eigenvector from ESIM can be transformed to the Smith SI coefficients and vice versa. Results from Table 2 indicate that the relationships are true, that is, values of 
(in parenthesis) of the first three columnsare similar to those obtained when transforming the Smith's SI to ESIM coefficients and the normalized Smith coefficients of the last three columns are similar to those obtained when transforming the ESIM coefficients to the Smith SI coefficients. The advantages of being able to transform the coefficients of ESIM into the Smith SI and vice versa are two-fold. First, the economic weights are not necessary for calculating
S (
S =
1
ESIM*). Second, it is unnecessary to estimate the genotypic variance-covariance matrix(
).
Estimating the Selection Response from ESIM
Denote the response to selection using the eigenvector of
and 
as
ESIM and
ESIM*, respectively. Using data set 1 and a selection pressure of 5% (k = 2.063), we obtain
ESIM =
=
= 0.687 and RESIM* =
=
= 0.687
Estimates of the selection response from Smith's SI (
S) and ESIM (
ESIM), and their proportions (
ESIM/
S), for data set 2 are shown in Table 3.
S was computed as
S = k
(with
S being a normalized vector) and ESIM,
ESIM =
(where
is the estimate of the largest eigenvalue of
). In 3 yr (1951, 1953, and 1954), the estimated selection response obtained from ESIM was greater than the response based on Smith's SI. In 1950, 1952, and 1955, the Smith's SI responses were greater than the ESIM response. Estimates of the selection response from
ESIM and <
ESIM* and their proportions (
ESIM/
ESIM*) show that except for year 1953, the ratio
ESIM/
ESIM* tends to fluctuate around 1.
|

= ß (and
=
) is generally reasonable, that is,
and 
are eigenvalues and eigenvectors, respectively, of the S phenotypic variancecovariance matrix. The assumption 
= ß places Smith's result (1936) within the context of the eigenvalues and eigenvectors of the phenotypic variancecovariance matrix, and it facilitates the estimation of SI without the need to estimate the genotypic variancecovariance matrix and the economic weights.
Estimating the ESIM parameters
and ß
From Table 1 we calculated the phenotypic variance-covariance matrix for the three traits in 1949. In this case, the first, second, and third eigenvalues are
1949 = 7.301,
1949 = 0.933 and
1949 = 0.040, respectively, and the three associated eigenvectors are
1' = [0.999 0.0190.013],
2' = [0.0200.955 0.096], and
3' = [0.0110.0960.955]. From Eq. [10] and [11], the estimated variance of the first eigenvalue
1949 is
ar(
1949) =
, while the estimated variance-covariance matrix of its associated eigenvector
1 is
ar(
1)

= 
.
Note that as n increases, the variance of the first eigenvalue
1949 and the variance of its associated first eigenvector
1' tend to zero.
Estimating Selection Gain Fj,j+1(
) between Selection Cycles
The eigenvalues and corresponding selection responses for the 7 yr are in Table 4. The selection gain between 1949 and 1950 can be calculated from Eq. [16] using a selection pressure of 5% (k = 2.063): F19491950(
) = 2(2.063)[
] = 2.30. However, a negative Fj,j+1(
) quantity would indicate a selection loss in selection cycle j+1 with respect to selection cycle j. For example, the selection gain from 1951 to 1952 is F19511952(
) = 2(2.063)[
] = 4.31, which is negative because
1952 >
1951 (Table 4). This means that selection in 1952 was less effective than in 1951. The total selection gain from 1949 to 1955 is Ft(
) = 6.41. The cumulated selection gain from 1949 to 1955 can be found as follows: F19491955(
) = 2(2.063)[
] = 6.41, which means that in 1955 there was 6.41 more selection advance than in 1949.
|
Predicting the Selection Response of ESIM in Future Cycles
To illustrate the use of Eq. [17], which was derived to predict the selection response in future selection cycles, we used the eigenvalue for the 1949 cycle (
= 7.301) to predict the value of the selection response in 1950, that is,
(7.301) = 
= 0.96, which is the same as the selection response from the ESIM of that cycle. The prediction for 1951 using
(4.600) (from 1950) was 1.03, and the actual value was 1.33. Note that in 1953, the eigenvalue was smaller than 1.0; thus, no prediction was possible. The low eigenvalue estimated in 1954 (1.84) did not allow
(1.84) to be a good predictor of 0.80, the value obtained in 1955.
It should be pointed out that the restriction 1.0 <
<
is necessary for computing
(
). Cases where 0 <
1.0 should be rare and will occur only when phenotypic variability has decreased, and thus selection will not be effective. Also, when
= 4.0,
(
) = k/2, and no prediction is possible.
Evaluating the Properties of the Selection Response of ESIM (
)
To measure, in practice, the quality of the estimator of the selection response (
), data set 3 was used. With a selection pressure of 10% (k = 1.755), the number of selected individuals with the highest ESIM values was n = 15 (Table 5). The estimated eigenvalue was 5.008 and the estimated selection response was 0.784. Using the 15 selected individuals, the expected values, variance, sampling error, coefficient of variation, mean square error, and accuracy of the selection response were computed as Ê(
) = 0.7836,
ar (
) = 1.9552,
E(
) = 1.3893,
V(
) = 1.120,
SE(
) = 1.9552, and ÂC(
) = 0.02, respectively. The bias of the selection response estimator was 0.03. Note how small the values of the estimated bias and the accuracy are, despite the fact that the sample includes only 15 individuals. This suggests that the selection response estimator is very accurate. To estimate
Z2 =
'
, we assumed that
was the eigenvector of
, so that
Z2 = 
, where 
= 9.082 is the eigenvalue of
.
|
|
| DISCUSSION |
|---|
|
|
|---|
The Sign of the Coefficients of ESIM
The contribution of the jth attribute to ESIM is given by the magnitude of the coefficient in ß, whereas the sign indicates the direction (positive or negative). The sign of the elements of ß does not affect ß'ß = 1.0 because this is a sum of square. The change in sign will not affect the variance of
ESIM.
Phenotypic and Genotypic Variances of the Traits and the ESIM
The variability of the traits directly affects ESIM because the method assigns more weight to traits with larger phenotypic variance (Tables 1 and 2). In the case of data set 2, large phenotypic variability is directly related to large genotypic variability, and therefore, ESIM gives more weight to traits with larger genotypic variability. In Table 1, variable V3 has very low phenotypic and genotypic variance; therefore, ESIM will assign less weight to this trait.
Heritability of the Traits and the ESIM
Traits with low genotypic and phenotypic variances can have high heritability. For example, heritability for V1 are 0.633, 0.2656, 0.537, 0.121, and 0.203 for years 1949, 1951, 1952, 1954, and 1955, respectively, and for V3 are 0.58, 0.84, 0.86, 0.37, and 0.344 for years 1949, 1951, 1952, 1954, and 1955, respectively. Except for the first case, heritability values of V1 are smaller than those of V3. However, the genotypic and phenotypic variances of V3 remains low and unchanged throughout the years. Because V3 has very low genetic variance, weights of the SI are low.
Effect of Genetic Correlations among Traits on ESIM
Assume that high phenotypic correlation indicates a high genotypic correlation and consider two traits. Then, S =
, where
is the phenotypic correlation, s1 and s2 are the standard deviations of the traits, and s12 and s22 the variances. The eignevalues of S are obtained from
= 
. When
= 0, S is a diagonal matrix and thus the eigenvalues are
1 = s12 and
2 = s22. From Eq. [19.1] and [19.2], the elements of the eigenvector associated to
1 (ß1) are b11 = 1, b12 = 0 and those associated to
2 (ß2) are b21 = 0 and b22 = 1. Therefore, when traits are not correlated, no selection index is required because genotypes are selected for each individual trait according to their phenotypic values. When
= 1,
1 = s12 + s22 and
2 = 0; therefore, all the information is associated to eigenvector ß1. Thus, indirect selection is recommended. When 0 <
< 1, a SI can be used and ESIM seems to be a good option.
Phenotypic Covariance Matrix versus Phenotypic Correlation Matrix in ESIM
The ESIM can be used with the covariance matrix or with the correlation matrix. When the covariance matrix is used, ESIM gives more weight to the traits with larger genotypic variances. When the correlation matrix is used, ESIM puts more weight on the traits that are correlated. Since eigenvectors obtained from a covariance matrix are not invariant to scale changes of the trait, ESIM obtained from the covariance matrix will be different than those obtained from the correlation matrix. From a practical point of view some traits are correlated; therefore, it will be appropriate to develop ESIM using the correlation matrix. Since eigenvectors are not invariant to the different scale of the measurements, using the correlation matrix is preferred to avoid these scale problems.
| CONCLUSIONS |
|---|
|
|
|---|

= ß and its implication
=
are admissible and lead to an accurate estimator of the selection response, assuming Y and Z have a normal distribution. The ESIM is easy to compute and does not require assigning economic weights to the traits. However, although the assumption 
= ß places Smith's result (1936) within the context of the eigenvalues and eigenvectors of the phenotypic variancecovariance matrix, when economic values are assigned on the basis of some (objective or subjective) criteria, the coincidences between the values of 
and ß cannot be expected to be consistent. Another advantage of ESIM is the possibility of computing selection gains between selection cycles and predicting selection response in future cycles. This gives the breeder a highly useful tool for planning a breeding program by providing relevant information on the potential genetic gains that will be achieved as the breeding process advances.
When evaluating the selection response, the highest eigenvalue should be used to estimate the response, and the eigenvector associated with this value should be used to develop the selection index. The eigenvalue should be higher than 1.0 for the prediction of the selection response to be effective. Finally, as the eigenvalue approaches zero, the expected values, bias, variance, mean square error, and accuracy of the selection response estimator will increase.
| APPENDIX |
|---|
|
|
|---|
(v'Av)(u'A1u) holds. Kempthorne and Nordskog (1959) proved that maximizing
ZY2 =
also maximizes R. Using equation Eq. [3], R can be written as R2 = k2
, such that maximizing R2 is equivalent of maximizing
. If we let 
= u, ß = v, and A = S, by the Cauchy-Schwarz inequality
'
S1
. This implies that the maximum is reached when
=
'
S1
, at which point R = k
. Received for publication November 16, 2005.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
J. J. Ceron-Rojas, F. Castillo-Gonzalez, J. Sahagun-Castellanos, A. Santacruz-Varela, I. Benitez-Riquelme, and J. Crossa A Molecular Selection Index Method Based on Eigenanalysis Genetics, September 1, 2008; 180(1): 547 - 557. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| The SCI Journals | Agronomy Journal | Vadose Zone Journal | |||
| Journal of Natural Resources and Life Sciences Education |
Soil Science Society of America Journal | ||||
| Journal of Plant Registrations | Journal of Environmental Quality |
The Plant Genome | |||