Predicting Colorectal Cancer Survival: How Machine Learning Combines Clinical and Biological Clues
Mar 25, 2026
A deep dive into using machine learning to predict colorectal cancer survival by merging clinical records with molecular markers. They cover which data sources and preprocessing steps matter and how features like genes and non-coding RNAs are identified. The conversation reviews model choices, accuracy results, and the challenges of validation and population bias.
08:36
forum Ask episode
web_stories AI Snips
view_agenda Chapters
auto_awesome Transcript
info_circle Episode notes
insights INSIGHT
Integrating Clinical And Molecular Data Boosts Prediction
Combining clinical features like pathological stage, age, and lymph node status with molecular markers improves survival prediction for colorectal cancer.
The study used TCGA data from 545 patients and preprocessed clinical and biological features, including differential expression and CRNA network analysis.
volunteer_activism ADVICE
Address Missing Data With Multiple Strategies
Handle missing clinical and demographic data carefully because exclusions or imputation change model size and performance.
The study tested three cases: filter missing core features, exclude demographic-missing patients, or impute with most frequent category.
insights INSIGHT
Lasso Plus SHAP Creates Interpretable Feature Pipeline
The modeling pipeline combined lasso feature selection, SHAP interpretability, and ensemble classifiers like SVM, Random Forest, AdaBoost, and stacking.
Lasso ranked features; SHAP explained each feature's impact before training multiple classifiers for robust prediction.
Get the Snipd Podcast app to discover more snips from this episode
Colorectal cancer (CRC) ranks among the most common and lethal cancers worldwide, accounting for approximately 10% of all cancer diagnoses. While advances in prevention and treatment have improved outcomes, predicting which patients will survive remains a complex challenge—one that depends on an intricate interplay between molecular biology and clinical factors.
A research paper, titled “Machine learning-based survival prediction in colorectal cancer combining clinical and biological features” was published in Volume 16 of Oncotarget by an international team of researchers, demonstrating how machine learning can integrate these two domains to achieve highly accurate survival predictions.
The team’s investigation demonstrates that combining clinical features—such as pathological stage, age, and lymph node status—with biological markers—including the E2F8 gene and hsa-miR-495-3p—can significantly improve the ability to predict patient survival.
Full blog - https://www.oncotarget.org/2026/03/25/predicting-colorectal-cancer-survival-how-machine-learning-combines-clinical-and-biological-clues/
Paper DOI - https://doi.org/10.18632/oncotarget.28783
Correspondence to - Lucas M. Vieira - lvieira@health.ucsd.edu
Abstract video - https://www.youtube.com/watch?v=cy7UL5ZUKuI
Sign up for free Altmetric alerts about this article - https://oncotarget.altmetric.com/details/email_updates?id=10.18632%2Foncotarget.28783
Subscribe for free publication alerts from Oncotarget - https://www.oncotarget.com/subscribe/
Keywords - cancer, colorectal cancer, machine learning, feature selection, non-coding RNAs, genes
To learn more about Oncotarget, please visit https://www.oncotarget.com and connect with us on social media:
Facebook - https://www.facebook.com/Oncotarget/
X - https://twitter.com/oncotarget
Instagram - https://www.instagram.com/oncotargetjrnl/
YouTube - https://www.youtube.com/@OncotargetJournal
LinkedIn - https://www.linkedin.com/company/oncotarget
Pinterest - https://www.pinterest.com/oncotarget/
Reddit - https://www.reddit.com/user/Oncotarget/
Spotify - https://open.spotify.com/show/0gRwT6BqYWJzxzmjPJwtVh
MEDIA@IMPACTJOURNALS.COM