|
|
||||||||
a Facultad de Agronomia, Univ. de la Republica Oriental del Uruguay, Garzon 780, Montevideo, Uruguay
b Biometrics and Statistics Unit, CIMMYT, Apdo. Postal 6-641, 06600 Mexico DF, México
* Corresponding author (j.crossa{at}cgiar.org)
When evaluating genetic resources, data on continuous and categorical attributes of accessions is collected. The Ward method and the modified location model (MLM) (Ward-MLM strategy) classify genetic resources using a mixture of categorical and continuous variables and combine all the categorical variables into one multinomial variable (W) that is assumed to be statistically independent from the continuous variables. The main objective of this study was to examine the robustness of the MLM for recovering underlying true subpopulations when independence between variable W and the continuous variables does not hold. In addition, several scenarios, based on different degrees of overlap of the continuous and discrete variables in simulated data sets, were generated. Results showed that when the subpopulations were well-differentiated, the Ward-MLM strategy effectively predicted the true number of subpopulations and fully recovered their structure, regardless of the level of dependence between the W variable and the vector of continuous variables. When the subpopulations showed unclear boundaries and a high degree of overlap in the W variable and in the continuous variables, the Ward-MLM strategy predicted a different number of subpopulations but fully recovered the composition of the subpopulations. In this case, new groups are formed that show a balanced and consistent structure in their composition as compared with the subpopulations. The MLM proved to be a robust model under medium and strong dependence between the variable W and the vector of continuous variables and under various kinds of overlapping between subpopulations with respect to the continuous and discrete variables.
Abbreviations: Can-r, canonical correlation EM, expectation maximization GM, Gaussian mixture model LM, location model MLM, modified location model
This article has been cited by other articles:
![]() |
J. Franco, J. Crossa, and S. Desphande Hierarchical Multiple-Factor Analysis for Classifying Genotypes Based on Phenotypic and Genetic Data Crop Sci., December 30, 2009; 50(1): 105 - 117. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Franco, J. Crossa, M. L. Warburton, and S. Taba Sampling Strategies for Conserving Maize Diversity When Forming Core Subsets Using Genetic Markers Crop Sci., February 24, 2006; 46(2): 854 - 864. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Franco, J. Crossa, S. Taba, and H. Shands A Sampling Strategy for Conserving Genetic Diversity when Forming Core Subsets Crop Sci., May 6, 2005; 45(3): 1035 - 1044. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Franco, J. Crossa, S. Taba, and S. A. Eberhart The Modified Location Model for Classifying Genetic Resources: II. Unrestricted Variance-Covariance Matrices Crop Sci., September 1, 2002; 42(5): 1727 - 1736. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| The SCI Journals | Agronomy Journal | Vadose Zone Journal | |||
| Journal of Natural Resources and Life Sciences Education |
Soil Science Society of America Journal | ||||
| Journal of Plant Registrations | Journal of Environmental Quality |
The Plant Genome | |||