CrosSplice: A Pipeline for Identifying Rare Splice-Site Creating Variants from Cross-Tissue Transcriptome Data
Abstract
Background Despite their profound impact on patients' lives, most rare and intractable diseases still lack established treatments. Genomic variants that disrupt normal splicing by creating novel splice sites (splice-site creating variants, SSCVs) substantially contribute to the pathogenesis of those conditions. Deep intronic SSCVs are particularly amenable to antisense oligonucleotide (ASO)-mediated splice modulation, yet many of them remain undetected by conventional genomic analyses. Existing approaches to identify SSCVs, including splicing quantitative trait loci (sQTL) analyses and machine learning-based methods, each show limitations in sensitivity or accuracy, hindering the comprehensive identification of clinically actionable SSCVs. Methods We developed CrosSplice, a novel pipeline that integrates machine learning-based predictions with statistical association testing to robustly identify SSCVs. By leveraging cross-tissue transcriptome data and aggregating splicing signals across multiple tissues, CrosSplice enables comprehensive detection of SSCVs, including rare and tissue-specific variants often missed by conventional methods. Results Applying CrosSplice to the GTEx dataset (8,656 transcriptomes of 54 tissues from 479 postmortem donors), we identified 1,743 significant SSCVs, 65% of which were deep intronic. Among these, 185 SSCVs were listed in ClinVar, including five pathogenic or likely pathogenic variants. CrosSplice also discovered a novel deep intronic SSCV in PLA2G6, the gene responsible for infantile neuroaxonal dystrophy. We experimentally confirmed that ASOs successfully corrected the aberrant splicing pattern induced by this variant. Conclusion CrosSplice substantially extends the detectable landscape of SSCVs by capturing rare and tissue-specific variants, uncovering pathogenic and therapeutically actionable SSCVs that are frequently overlooked by existing methods. The resulting SSCV catalogue provides a platform for systematic discovery of ASO targets and advances opportunities for precision therapies in rare and intractable diseases.
Links & Resources
Authors
Cite This Paper
Y., Y., A., O., M., O., N., M. R., T., N., Y., S. (2025). CrosSplice: A Pipeline for Identifying Rare Splice-Site Creating Variants from Cross-Tissue Transcriptome Data. arXiv preprint arXiv:10.64898/2025.12.21.25342246.
Yano, Y., Okada, A., Ono, M., Mateos, R. N., Nakayama, T., and Shiraishi, Y.. "CrosSplice: A Pipeline for Identifying Rare Splice-Site Creating Variants from Cross-Tissue Transcriptome Data." arXiv preprint arXiv:10.64898/2025.12.21.25342246 (2025).