Published April 9, 2024 | Version v1.0.0
Software Open

bianchini88/monitoring_sra-samples_EBIsearch: Monitoring the submission to ENA/SRA of sample data from Norwegian institutions

Description

The purpose of this code is to monitor the submissions of Norwegian sequencing data to domain repositories. This is achieved by querying the "sra-samples" endpoint of EBI search, containing metadata of the samples deposited in the Sequence Read Archive (SRA) which is also synchronised with the European Nucleotide Archive (ENA) as part of the International Nucleotide Sequence Data Collaboration (INSDC).

The code performs a query based on the country name Norway. Note that this identifies all the samples collected in Norway, not necessarily by Norwegian institutions or organisations. Extensive filtering is used to isolate the relevant data due to the lack of standardisation across the centres' names. The results are then plotted in two graphs, one for the BOTT (Bergen, Oslo, Trondheim, and Tromsø) universities and one for the Norwegian Institute of Public Health (NIPH) (Norwegian: Folkehelseinstituttet; FHI). A non-updated reference for these plots is available in the /plots4reference folder.

Powered by EBI Search.

Files

bianchini88/monitoring_sra-samples_EBIsearch-v1.0.0.zip

Files (38.4 kB)

Additional details

Funding

BioMedData - an infrastructure for data sharing and management 295932
The Research Council of Norway
ELIXIR3 - Strengthening the Norwegian Node of ELIXIR 322392
The Research Council of Norway