Published November 10, 2024 | Version v1
Dataset Open

SpaMask: Dual Masking Graph Autoencoder with Contrastive Learning for Spatial Transcriptomics

Description

Understanding the spatial locations of cell within tissues is crucial for unraveling the organization of cellular diversity. Recent advancements in spatial resolved transcriptomics (SRT) have enabled the analysis of gene expression while preserving the spatial context within tissues. Spatial domain characterization is a critical first step in SRT data analysis, providing the foundation for subsequent analyses and insights into biological implications. Graph neural networks (GNNs) have emerged as a common tool for addressing this challenge due to the structural nature of SRT data. However, current graph-based deep learning approaches often overlook the instability caused by the high sparsity of SRT data. Masking mechanisms, as an effective self-supervised learning strategy, can enhance the robustness of these models. To this end, we propose SpaMask, dual masking graph autoencoder with contrastive learning for SRT analysis. Unlike previous GNNs, SpaMask masks a portion of spot nodes and spot-to-spot edges to enhance its performance and robustness. SpaMask combines Masked Graph Autoencoders (MGAE) and Masked Graph Contrastive Learning (MGCL) modules, with MGAE using node masking to leverage spatial neighbors for improved clustering accuracy, while MGCL applies edge masking to create a contrastive loss framework that tightens embeddings of adjacent nodes based on spatial proximity and feature similarity. We conducted a comprehensive evaluation of SpaMask on eight datasets from five different platforms. Compared to existing methods, SpaMask achieves superior clustering accuracy and effective batch correction.

Files

human_breast_cancer.zip

Files (15.2 GB)

Name Size Download all
md5:e04c29b82aecccd12020cfe1bf6f3156
1.6 GB Preview Download
md5:e62b1b7b1cbc60742403e374e90cae20
9.1 GB Preview Download
md5:bcd0629551eda7333a6e283cd059e407
46.1 MB Preview Download
md5:c7d5e709b8be08771886302d34726fe5
549.9 MB Preview Download
md5:adeeeaf154e150319f668d22227d7506
1.1 GB Preview Download
md5:d715f9a8e75de7cb7c56bffe0521ed9a
1.8 GB Preview Download
md5:36df1275aca33e82128d44310755f9cb
11.1 MB Preview Download
md5:d8c1cc230dcc484bd8eb85e6fa034ee8
989.7 MB Preview Download

Additional details

Software

Repository URL
https://github.com/wenwenmin/SpaMask
Programming language
Python