Statistical Implications of Zero Imputation for Missing Data in Cowpea (Vigna unguiculata L. Walp) Mutation Breeding: Evidence for Systematic Bias in Yield Traits

Authors

DOI:

https://doi.org/10.24925/turjaf.v14i1.148-154.8243

Keywords:

Missing data analysis, Zero substitution bias, Mutation breeding, Statistical validity, Cowpea improvement, Drought tolerance

Abstract

Zero imputation for missing data in plant breeding experiments represents a widespread but statistically invalid practice that threatens the reliability of genetic evaluations, especially mutation breeding study with potential high rate of plant mortality. This study quantified the statistical consequences of zero substitution versus appropriate missing data handling (listwise deletion) in a factorial cowpea experiment involving five accessions, four ethyl methane sulfonate (EMS) concentrations, and three water withdrawal regimes. Six yield-related traits were analyzed using both approaches. Zero imputation systematically biased statistical inferences, reducing trait means by 65-85% and eliminating crucial genotype × treatment interactions (accession × EMS interactions, P = 0.0008 under proper handling, inestimable under zero imputation). Correlation magnitudes between yield components were reduced by 15-35%. Proper missing data handling identified TVU17315 as consistently superior (39.74 seeds/plant ± 7.23 SE, 33.11g hundred-seed weight ± 4.18 SE) and revealed 0.75% EMS as optimal, increasing pod production by 37.9% and seed production by 113.0% compared to controls. The 24-hour water withdrawal treatment yielded optimal results across traits. Zero imputation conflates biological failure with quantitative measurement, leading to incorrect breeding decisions and potential loss of valuable germplasm. Plant breeding programmers must adopt statistically valid missing data methodologies to ensure reliable genetic evaluations and accelerate development of improved varieties.

Author Biography

Olawale M. Aliyu, Department of Crop Production, Faculty of Agriculture, Kwara State University, Malete, Nigeria

Department of crop production

kwara state university, malete

References

Ahmad, S., Abbas, G., Ahmed, M., Fatima, Z., Anjum, M. A., Rasul, G., Khan, M. A., & Hoogenboom, G. (2025). Climate change and agriculture: Impacts and adaptation strategies. Advances in Agronomy, 169, 1-50.

Aliyu, O. M., Tiamiyu, A. O., Usman, M., & Abdulkareem, Y. F. (2021). Variance components, correlation and path analyses in cowpea (Vigna unguiculata L., Walp). Journal of Crop Science and Biotechnology, 24, 445-454.

Aliyu, O. M., Lawal, O. O., Wahab, A. A., & Ibrahim, U. Y. (2019). Evaluation of advanced breeding lines of cowpea (Vigna unguiculata L. Walp) for high seed yield under farmers’ field conditions. Plant Breeding and Biotechnology, 7(1), 12-23. https://doi.org/10.9787/PBB.2019.7.1.12

Bernardo, R. (2020). Breeding for quantitative traits in plants (3rd ed.). Stemma Press.

Bewley, J. D., Bradford, K. J., Hilhorst, H. W., & Nonogaki, H. (2013). Seeds: Physiology of development, germination and dormancy (3rd ed.). Springer.

Burgueño, J., de los Campos, G., Weigel, K., & Crossa, J. (2012). Genomic prediction of breeding values when modeling genotype × environment interaction using pedigree and dense molecular markers. Crop Science, 52(2), 707-719.

Calabrese, E. J., & Mattson, M. P. (2017). How does hormesis impact biology, toxicology, and medicine? Aging Research Reviews, 35, 1-17.

Crossa, J., Pérez-Rodríguez, P., Cuevas, J., Montesinos-López, O., Jarquín, D., de los Campos, G., Burgueño, J., González-Camacho, J. M., Pérez-Elizalde, S., Beyene, Y., Dreisigacker, S., Singh, R., Zhang, X., Gowda, M., Roorkiwal, M., Rutkoski, J., & Varshney, R. K. (2017). Genomic selection in plant breeding: Methods, models, and perspectives. Trends in Plant Science, 22(11), 961-975.

de Mendiburu, F. (2021). agricolae: Statistical Procedures for Agricultural Research. R package version 1.3-5.

Farooq, M., Wahid, A., Kobayashi, N., Fujita, D., & Basra, S. M. A. (2009). Plant drought stress: Effects, mechanisms and management. Agronomy for Sustainable Development, 29(1), 185-212. https://doi.org/10.1051/agro:2008021

Greene, E. A., Codomo, C. A., Taylor, N. E., Henikoff, J. G., Till, B. J., Reynolds, S. H., Enns, L. C., Burtner, C., Johnson, J. E., Odden, A. R., Comai, L., & Henikoff, S. (2003). Spectrum of chemically induced mutations from a large-scale reverse-genetic screen in Arabidopsis. Genetics, 164(2), 731-740.

Jankowicz-Cieslak, J., Tai, T. H., Kumlehn, J., & Till, B. J. (2017). Biotechnologies for plant mutation breeding: Protocols. Springer.

Joswig, J. S., Bönisch, E., Kattge, J., Aalto, J., Bodesheim, P., Jochum, M., Jung, V., Lorenz, M., Mears, M., Pennekamp, F., Roscher, C., Scharfenberg, S. M., Wuest, J. T., & Wirth, C. (2023). Imputing missing data in plant traits: A guide to improve gap‐filling. Journal of Ecology, 111(8), 1642-1663.

Little, R. J. A., & Rubin, D. B. (2019). Statistical analysis with missing data (3rd ed.). John Wiley & Sons.

Martin, T. N., Storfer, A., & Spear, S. F. (2005). Performance of missing data approaches under varying sample sizes. Molecular Ecology Notes, 5(4), 733-736.

Oladosu, Y., Rafii, M. Y., Abdullah, N., Hussin, G., Ramli, A., Rahim, H. A., Miah, G., & Usman, M. (2016). Principle and application of plant mutagenesis in crop improvement: A review. Biotechnology & Biotechnological Equipment, 30(1), 1-16.

Piepho, H. P., Büchse, A., & Emrich, K. (2003). A hitchhiker’s guide to mixed models for randomized experiments. Journal of Agronomy and Crop Science, 189(5), 310-322.

R Core Team. (2023). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org/

Schafer, J. L., & Graham, J. W. (2002). Missing data: Our view of the state of the art. Psychological Methods, 7(2), 147-177.

Smith, A. B., Gaynor, R. C., Grondona, M., Cullis, B. R., & Thompson, R. (2024). Statistical methods for plant breeding data analysis with missing environmental information. Theoretical and Applied Genetics, 137(4), Article 78.

VSN International Ltd. (2019). GenStat for Windows (19th ed.). VSN International.

Downloads

Published

11.01.2026

Issue

Section

Research Paper