Epistasis-based FSA: two versions of a novel approach for variable selection in multivariate calibration

Resumo

Variable Selection in large datasets is a commonly procedure in multivariate calibration, which is a field of study from chemometrics. Selecting the most informative variables becomes an important step to build mathematical models through statistical techniques in order to predict some property of interest from an analyzed sample. Recombination-based search methods such as Genetic Algorithms (GAs) have been widely used as variable selection techniques to solve several optimization problems. However, previous works from literature have emphasized the schemata disruption problem caused by genetic operators. Therefore, this paper proposes two versions of an epistasis-based implementation (EbFSA) as a novel approach for variable selection in multivariate calibration problems, where each version is deterministic and performs a different strategy. The use of epistasis concepts becomes important to assess the genes (variables) interdependence. Based on our experimental results, we are able to claim EbFSA can select the most informative variables and overcome some state-of-the-art algorithms.

Descrição

Palavras-chave

Citação

PAULA, Lauro C. M. de et al. Epistasis-based FSA: two versions of a novel approach for variable selection in multivariate calibration. Engineering Applications of Artificial Intelligence, Amsterdam, v. 81, p. 213-222, 2019. DOI: 10.1016/j.engappai.2019.01.016. Disponível em: https://www.sciencedirect.com/science/article/abs/pii/S0952197619300168?via%3Dihub. Acesso em: 14 jun. 2023.