A cautionary note on phylogenetic signal estimation from imputed databases

Nenhuma Miniatura disponível

Data

2021

Título da Revista

ISSN da Revista

Título de Volume

Editor

Resumo

Given the prevalence of missing data on species’ traits – the Raunkiaeran shortfall-, several methods have been proposed to fill sparse databases. However, analyses based on these imputed databases can introduce several biases. Here, we evaluated potential estimation biases caused by the use of imputed databases. In the evaluation, we considered the estimation of descriptive statistics, regression coefficient, and phylogenetic signal for different missing and imputing scenarios. We found that percentage of missing data, missing mechanisms and imputation methods were important in determining estimation errors. Imputation errors are not linearly related to estimate errors. Adding phylogenetic information provides better estimates of the evaluated statistics, but this information should be combined with other variables such as traits correlated to the missing data variable. Using an empirical dataset, we found that even traits that are strongly correlated to each other, such as brain and body size of primates, can produce biases when estimating phylogenetic signal from missing data datasets. We advise researchers to share both their raw and imputed data as well as to consider the pattern of missing data to evaluate methods that perform better for their goals. In addition, the performance of imputation methods should be mainly based on statistical estimates instead of only in imputation error.

Descrição

Palavras-chave

Statistical bias, Multiple imputation, Trait databases, Phylogenetic eigenvector maps, Phylogenetic signal, Phylogenetic comparative methods

Citação

JARDIM, Lucas et al. A cautionary note on phylogenetic signal estimation from imputed databases. Evolutionary Biology, Berlim, v. 48, p. 246-258, 2021. DOI: 10.1007/s11692-021-09534-0. Disponível em: https://link.springer.com/article/10.1007/s11692-021-09534-0. Acesso em: 15 jun. 2023.