Benchmarking long-read RNA-seq across modalities, methods, and sequencing depth in iNeurons

Botta, Gianfranco; Wissel, David; Higgins, Samuel; Chelmicki, Tomasz; Popescu, Alexander; Radoynovska, Kalina; Probst, Seraphin; Ramírez Cuéllar, Julieta; Schneider, Kim; Zeynab-Mitra, Nayernia; Bryne, Ashley; Nelson, Christopher D.; Modrusan, Zora; Chamberlain, Stormy J.; Stephenson, William; Robinson, Mark D.; Schubert, Rajib

doi:10.5281/zenodo.19209645

Published March 24, 2026 | Version v1

Publication Open

Benchmarking long-read RNA-seq across modalities, methods, and sequencing depth in iNeurons

Abstract: Long-read RNA sequencing (lrRNA-seq) provides advantages for transcript discovery and quantification through the sequencing of full-length transcripts. Although recent benchmarks have evaluated long-read technologies and quantification tools, to the best of our knowledge, no study to date has jointly compared sequencing technology, quantification choice, and depth across both bulk and single-cell platforms. Here, we generate a matched dataset using NGN2-induced neurons derived from Fragile X syndrome and isogenic rescue lines, profiled with bulk and single-cell Illumina, Oxford Nanopore Technologies (ONT), and Pacific Biosciences (PB) Kinnex technologies. All platforms and technologies capture the expected FMR1 reactivation signal. We find that PB bulk under-detects and under-quantifies short transcripts (less than 1.25 kb), ONT bulk under-detects and under-quantifies long transcripts (greater than 5 kb), and single-cell long-read technologies tend to falsely detect a large number of transcripts due to a lack of full-length reads. Across six bulk and four single-cell long-read quantification tools, Isosceles, Miniquant, and Oarfish provide the best compromise between computational efficiency and performance in terms of quantification accuracy as measured by spike-ins, comparisons to Illumina, and on consensus based downstream tasks such as differential transcript expression (DTE). Depth-equivalency analyses reveal that PB single-cell sequencing requires approximately three- to four-fold greater depth than bulk to reach comparable power for transcript discovery and differential transcript expression. Our work aims to offer practical guidance for study design, including the choice of technology, sequencing depth, and quantification method. In addition, we hope our data may serve a reference dataset to evaluate emerging long-read transcriptomic protocols and methods as well as more closely investigate FMR1 biology.

Description: The Zenodo contains the primary processed data outputs of our work, corresponding to the quantifications of all technologies and platforms, for both Illumina, PB and ONT. Specifically, we provide all considered methods for both single-cell and bulk for our downsampled depths, and provide our recommended methods (Isosceles for bulk and Oarfish for single-cell) for the full-depth files.

Files

Files (13.6 GB)

Name	Size	Download all
zenodo_counts.tar.gz md5:1bc363cf1ff34785b45d813fab7b5659	13.6 GB	Download

	All versions	This version
Views	37	37
Downloads	4	4
Data volume	54.3 GB	54.3 GB

Benchmarking long-read RNA-seq across modalities, methods, and sequencing depth in iNeurons

Authors/Creators

Description

Files

Files (13.6 GB)