Introducing a Novel Simulation Tool for Interconnected Differential Expression Signatures and Its Application to Benchmarking
Creators
Description
Pharmaceutical research has long used differential gene expression signatures to study external stimuli like pathogenic determinants or small molecule treatments. These signatures measure expression values for multiple tags and are often compared using the concept of connectivity. Despite the scientific community's efforts to produce unbiased datasets for evaluating connectivity-based methods for drug identification and repurposing, the limits of benchmarking data hinder their effectiveness.
To address this, we developed a simulation method to generate pairs of connected differential expression signatures, that is based on a three layers decomposition and relies on a statistical framework with different levels of parametrization. We benchmarked seven connectivity scores methods from the literature using our simulated signatures. We then evaluated the capacity of each method to retrieve the most connected signatures for a specific query, using the area under the precision-recall curves (AUPRC). Moreover, we introduced a novel application perspective by training a Siamese Neural Network with our simulated data to predict the connectivity score.
Overall, our method is a significant advance in pharmaceutical research, providing a reliable way to simulate connected differential expression signatures. It will help develop and evaluate algorithms for comparing signatures to find the most connected or reversed, leading to more effective drug repurposing. An open-source version of the package will be released at the end of 2023.
Files
ISMBECCB2023_1885_Gonzalez_Gomez_poster.pdf
Files
(990.2 kB)
Name | Size | Download all |
---|---|---|
md5:1ef5cc61b560ae32eb5602e16191a408
|
990.2 kB | Preview Download |
Additional details
Dates
- Other
-
2023-07-26Oral presentation ISCB-ECCB 2023
References
- J. Lamb et al., "The Connectivity Map: Using Gene-Expression Signatures to Connect Small Molecules, Genes, and Disease," Science, vol. 313, p. 8, 2006.
- A. Subramanian et al., "A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles," Cell, vol. 171, no. 6, pp. 1437-1452.e17, Nov. 2017, doi: 10.1016/j.cell.2017.10.049.
- J. Cheng and L. Yang, "Comparing gene expression similarity metrics for connectivity map," in 2013 IEEE International Conference on Bioinformatics and Biomedicine, Shanghai, China: IEEE, Dec. 2013, pp. 165–170. doi: 10.1109/BIBM.2013.6732481
- K. Samart, P. Tuyishime, A. Krishnan, and J. Ravi, "Reconciling multiple connectivity scores for drug repurposing," Briefings in Bioinformatics, vol. 22, no. 6, p. bbab161, Nov. 2021, doi: 10.1093/bib/bbab161
- J. Davis and M. Goadrich, "The relationship between Precision-Recall and ROC curves," in Proceedings of the 23rd international conference on Machine learning - ICML '06, Pittsburgh, Pennsylvania: ACM Press, 2006, pp. 233–240. doi: 10.1145/1143844.1143874
- M. I. Love, W. Huber, and S. Anders, "Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2," p. 21, 2014.
- O. J. Wouters, M. McKee, and J. Luyten, "Estimated Research and Development Investment Needed to Bring a New Medicine to Market, 2009-2018," JAMA, vol. 323, no. 9, p. 844, Mar. 2020, doi: 10.1001/jama.2020.1166.
- J. Cheng, L. Yang, V. Kumar, and P. Agarwal, "Systematic evaluation of connectivity map for disease indications," Genome Med, vol. 6, no. 12, p. 95, Dec. 2014, doi: 10.1186/s13073-014-0095-1.
- C. Yang et al., "A survey of optimal strategy for signature-based drug repositioning and an application to liver cancer," eLife, vol. 11, p. e71880, Feb. 2022, doi: 10.7554/eLife.71880.
- K. Lin et al., "A comprehensive evaluation of connectivity methods for L1000 data," Briefings in Bioinformatics, vol. 21, no. 6, pp. 2194–2205, Dec. 2020, doi: 10.1093/bib/bbz129.
- J. Cheng et al., "EVALUATION OF ANALYTICAL METHODS FOR CONNECTIVITY MAP DATA," in Biocomputing 2013, Kohala Coast, Hawaii, USA: WORLD SCIENTIFIC, Nov. 2012, pp. 5–16. doi: 10.1142/9789814447973_0002.