Report Open Access

Digitisation infrastructure design for national open science clouds

Wu, Zhengzhe; Saarenmaa, Hannu; Riihikoski, Ville-Matti; Nieva de la Hidalga, Abraham; Hardisty, Alex; Dillen, Mathias; Groom, Quentin

This report describes the feasibility and potential role of national data infrastructures in large-scale digitisation of natural history collections, especially in the case of Finland. The descriptions of services, capacities, costs, and data flows from digitisation facilities to these national level systems and further to European systems are studied in the context of Finland. Discussion of use and possible designs are also included. The report is structured in five parts:

  1. The Introduction part describes the background of this study and the general status of national data infrastructures in Europe. The case of Finland is emphasized and the available computing and data services are introduced.
  2. The Infrastructure chapter describes FinBIF (Finnish Biodiversity Information Facility), CSC (the Finnish national IT centre for science) Pouta cloud computing services, and the Finnish Fairdata services.
  3. In chapter 3, the data module used in FinBIF is presented, including the identifier, metadata, and API (Application Programming Interface).
  4. The design chapter describes the overall architectural view and data flows of the digitisation process with the use of Finnish national data infrastructures.
  5. In the last chapter, the feasibility and potential role of national data infrastructures for digitisation facilities are discussed. We concluded that national data infrastructures tailored for biodiversity data, such as FinBIF, are highly necessary for the digitisation facility of natural history collections.

Files (944.8 kB)
  • Ariño AH (2010) Approaches to estimating the universe of natural history collections data. BiodiversityInformatics 7: 81-92.

  • Blagoderov V, Smith V (2012) No specimen left behind: mass digitization of natural history collections. ZooKeys 209.

  • Lahti K (2017) FinBIF – Finnish Biodiversity Information Facility. Retrieved from

  • Oever JP, Gofferje M (2014) From pilot to production: Large scale digitization project at Naturalis Biodiversity center. ZooKeys 209, 87-92.

  • Tegelberg R, Mononen T, Saarenmaa H (2014) High-performance digitization of natural history collections: Automated imaging lines for herbarium and insect specimens. Taxon 63 (6) 1307-1313.

  • Tegelberg R, Kahanpää J, Karppinen J, Mononen T, Wu Z, Saarenmaa H 2017. Mass digitization of individual pinned insects using conveyor-driven imaging. In: Hereld M (Editor) High throughput digitization for natural history collections. 2017 IEEE 13th International Conference on e-Science (e-Science 2017). Auckland, New Zealand, 24-27 October 2017. 5 p.

  • Wu Z, Kahanpää J, Sihvonen P, Koivunen A, Saarenmaa H (2019). Automated Methods in Digitisation of Pinned Insects. Biodiversity Information Science and Standards, 3, e38260.

All versions This version
Views 3939
Downloads 3737
Data volume 35.0 MB35.0 MB
Unique views 3636
Unique downloads 3131


Cite as