Acelerando a construção de tabelas hash para dados textuais com aplicações
Nenhuma Miniatura disponível
Data
2020-11-17
Autores
Título da Revista
ISSN da Revista
Título de Volume
Editor
Universidade Federal de Goiás
Resumo
Text mining is characterized by the extraction of information from textual data, in the
most diverse formats, aiming at the knowledge production, classification, clusterization,
translation of this information among other things. In order for text mining to be efficient,
some procedures are performed on the data to ensure that it contains only content
relevant to the analysis to be performed, and that it is structured in a format that is
easier to manipulate computationally. Several pre-processing tasks must be performed
on this data, in order to achieve the desired quality and representation. In this sense, the
present work proposes an implementation of a hash table capable of efficiently exploring
the high parallelism available in GPUs, as a way to increase the performance of pre-
processing tasks. However, this work not only presents more efficient algorithms, but also
demonstrates the feasibility of its use in applications such as the generation of the co-
occurrence matrix and the representation of the text using embeddings.
Descrição
Palavras-chave
Citação
BARROS, C. C. Acelerando a construção de tabelas hash para dados textuais com aplicações. 2020. 99 f. Dissertação (Mestrado em Ciência da Computação) - Universidade Federal de Goiás, Goiânia, 2020.