️ Episode 184: High-Accuracy Multiethnic XGBoost for Skin Cancer Identification In this episode of PaperCast Base by Base, we explore a large-scale study that builds a risk factor–based XGBoost model using the All of Us cohort to accurately identify patients with skin cancer across diverse ancestries. Study Highlights:Analyzing more than 400,000 participants, the authors quantify independent associations between genetic ancestry, lifestyle, social determinants of health, prior cancer history, and use of PDE5A inhibitors with skin cancer risk. They compare traditional logistic regression against gradient-boosted trees and show that logistic models have low precision for case identification, motivating a non-linear approach. The resulting multiethnic XGBoost model achieves high accuracy for identifying patients with any skin cancer, with F1 scores of 0.903 in individuals of European ancestry and 0.810 in non-European groups. SHAP importance and interaction analyses reveal strong non-linear effects of age and genotype principal components, and suggest that genetic and socioeconomic factors contribute more heavily to predictions in younger individuals. Conclusion:A multiethnic, non-linear model that integrates genetics, lifestyle, social determinants, and medication exposures can substantially improve early identification of skin cancer patients across ancestries, offering a precision-medicine tool to help reduce outcome disparities. Reference:D’Antonio M, Gonzalez Rivera WG, Greenes RA, Gymrek M, Frazer KA. A highly accurate risk factor–based XGBoost multiethnic model for identifying patients with skin cancer. Nature Communications. 2025;16:9542. https://doi.org/10.1038/s41467-025-64556-y License:This episode is based on an open-access article published under the Creative Commons Attribution 4.0 International License (CC BY 4.0) – https://creativecommons.org/licenses/by/4.0/ Support:If you'd like to support Base by Base, you can make a one-time or monthly donation here: https://basebybase.castos.com/
No persons identified in this episode.
This episode hasn't been transcribed yet
Help us prioritize this episode for transcription by upvoting it.
Popular episodes get transcribed faster
Other recent transcribed episodes
Transcribed and ready to explore now
3ª PARTE | 17 DIC 2025 | EL PARTIDAZO DE COPE
01 Jan 1970
El Partidazo de COPE
Buchladen: Tipps für Weihnachten
20 Dec 2025
eat.READ.sleep. Bücher für dich
BOJ alza 25pb decennale sopra 2%, Oracle vola con accordo Tik Tok, 90 mld eurobond per Ucraina | Morning Finance
19 Dec 2025
Black Box - La scatola nera della finanza
365. The BEST advice for managing ADHD in your 20s ft. Chris Wang
19 Dec 2025
The Psychology of your 20s
LVST 19 de diciembre de 2025
19 Dec 2025
La Venganza Será Terrible (oficial)
Cuando la Ciencia Ficción Explicó el Mundo que Hoy Vivimos
19 Dec 2025
El Podcast de Marc Vidal