Cross-cancer survival prediction using machine learning models

dc.creatorCardoso, Lucas Buk
dc.creatorEgydio, Jones Eduardo
dc.creatorToporcov, Tatiana Natasha
dc.creatorUtida, Nanci Yumi
dc.creatorCurado, Maria Paula
dc.creatorFernandes, Gisele Aparecida
dc.creatorRibeiro, Adeylson Guimarães
dc.creatorChin, Bryan Gilvaz
dc.creatorParro, Vanderlei Cunha
dc.date.accessioned2026-05-11T11:21:07Z
dc.date.available2026-05-11T11:21:07Z
dc.date.issued2026
dc.description.abstractAmong the many challenges faced by healthcare systems, cancer remains one of the most urgent. This makes the application of artificial intelligence a critical tool for enhancing early detection and optimizing treatment strategies, especially given the growing volume of patient data being collected. In this study, machine learning models trained on data for a specific type of cancer were employed to predict three-year survival after diagnosis for other cancer types. Two groups were considered: the most frequent cancers and those related to the digestive system. The data were extracted from the Hospital Based Cancer Registries of São Paulo, covering 2000 to 2019, with a consistent selection protocol across all cancer types to enable cross-prediction. XGBoost and LightGBM algorithms were used, choosing the best-performing model for predictions across different topographies. Using a combined dataset of oral cavity, esophageal, and stomach cancers, the model achieved a balanced accuracy of 80.18%, compared with 79.92% for the stomach-specific model. Statistical testing showed no significant difference between these values, suggesting comparable predictive performance. These results illustrate the potential of cross-prediction, especially for rare cancer types where data scarcity represents a significant challenge.
dc.identifier.citationCARDOSO, Lucas Buk et al. Cross-cancer survival prediction using machine learning models. Scientific Reports, London, v. 16, n. 1, e9623, 2026. DOI: 10.1038/s41598-025-34133-w. Disponível em: https://www.nature.com/articles/s41598-025-34133-w. Acesso em: 5 maio 2026.
dc.identifier.doi10.1038/s41598-025-34133-w
dc.identifier.issne- 2045-2322
dc.identifier.urihttps://repositorio.bc.ufg.br//handle/ri/30340
dc.language.isoeng
dc.publisher.countryGra-bretanha
dc.publisher.departmentInstituto de Patologia Tropical e Saúde Pública - IPTSP (RMG)
dc.rightsAcesso Aberto
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectSurvival prediction
dc.subjectCancer
dc.subjectMachine learning
dc.subjectCross-prediction
dc.subjectXGBoost
dc.titleCross-cancer survival prediction using machine learning models
dc.typeArtigo

Arquivos

Pacote Original

Agora exibindo 1 - 1 de 1
Carregando...
Imagem de Miniatura
Nome:
Artigo - Lucas Buk Cardoso - 2026.pdf
Tamanho:
1.65 MB
Formato:
Adobe Portable Document Format

Licença do Pacote

Agora exibindo 1 - 1 de 1
Carregando...
Imagem de Miniatura
Nome:
license.txt
Tamanho:
1.71 KB
Formato:
Item-specific license agreed upon to submission
Descrição: