Biologically informed Deep Learning graph-based framework for unveiling critical lncRNAs in CLL
Authors/Creators
-
1.
Centre for Research and Technology Hellas
-
2.
Democritus University of Thrace
- 3. Institute of Applied Biosciences at the Centre for Research and Technology Hellas (INAB|CERTH)
- 4. University Hospital Brno
-
5.
Central European Institute of Technology
-
6.
Alexander Fleming Biomedical Sciences Research Center
- 7. BioInnovation Greece
- 8. Ancilia Biosciences
- 9. Ηellenic Army Academy
- 10. University of Athens Medical School
Description
Transcriptomic analyses elucidate complexities in diseases like Chronic Lymphocytic Leukemia (CLL), revealing specific molecules, cellular traits, and signalling patterns influencing clinical outcomes. The status of the variable region of IGHV in leukemia cells is crucial for classifying CLL cases, providing insights into their clinical condition and prognosis. Patients can be stratified based on the mutational status of the IGHV genes, with mutated CLL (M-CLL) exhibiting a more indolent course than the more aggressive unmutated CLL (U-CLL) cases. Long non-coding RNAs (lncRNAs) play diverse roles in cancer but are often overlooked due to limitations in computational approaches. To address this, we propose a biologically informed Deep Learning model-based framework for supervised patient classification, instructed by lncRNA-mRNA-pathways interactions with eXplainable Artificial Intelligence (XAI) read-outs. This pipeline utilizes NetBID2 for network-based integration of omics data, and PASNet for the deep learning analysis, further building on lncrnalyzer and EnrichR knowledge graphs as input for the biological priors. Complementing these, we deploy custom code integrating SHAP and heatmaps for interpretability, by combining a bespoke graph architecture representing lncRNA-mRNA-pathway interactions, facilitating a comprehensive analysis of regulatory mechanisms. The proposed framework is being tested by applying it into two extensive CLL transcriptomic datasets; data from ICGC-CLLES project (257 samples) are used as the training dataset, with the produced model validated on the BloodCancerMultiOmics2017 dataset (113 samples). Preliminary results showed that U-CLL cases are easily differentiated from M-CLL cases as expected, additionally revealing both biological ground-truth pathways, as well as insofar unknown but potential prognostic lncRNAs in disease.
Files
5.Poster_ECCB2024_VV.pdf
Files
(5.0 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:bd0db8f5c5931eb1f1ae4df6c6aa03dd
|
5.0 MB | Preview Download |
Additional details
Funding
Software
- Repository URL
- https://github.com/BiodataAnalysisGroup/lncAPNet_ECCB2024