Dataset Open Access

Web collection NL-blogosfeer

P. de Bode; I. Geldermans; K. Teszelszky

In 2018 I, Iris Geldermans, worked as an intern for the KB – National Library of the Netherlands to form a collection of Dutch weblogs for the webarchive named the NL-blogosfeer. For more information about the collection please visit


In 2018 a collection of websites was made with some metadata that was manually added. In this Excel document you can find the following metadata information (in Dutch): 

  • indexnr
  • Date added to the WCT  [WebCuratorTool, the harvest tool we use]
  • Date in case the website was already in the WCT
  • Geselecteerd via
  • bronbestand    
  • webadress (URL)    
  • Webpage (URI)    
  • Website sold / link removed [please be mindful that the dataset has not been updated since 2018 so information can be dated!]
  • Name
  • Theme / Genre
  • Multiple authors
  • Format
  • Date blog started
  • Special collection
  • Other special collection(s)

For more information about the metadata columns see chapter 5 and 8 in the collection description. 

In 2018 a collection description was written in Dutch: it was updated slightly for publication in 2020-2021

In 2020 the collection of the NL-blogosfeer websites (urls) was extracted from the webarchive with some metadata added automatic. In this Excel document you can find the following metadata information (in Dutch): 

  • Selection date
  • Name
  • URL
  • Status (is it still online)
  • Web collection(s) it is a part of
Files (2.0 MB)
Name Size
NL Blogosfeer web collection 2018 with manual metadata.xlsx
171.3 kB Download
NL blogosfeer web collection 2020_1.xlsx
30.9 kB Download
NL-blogosfeer collectiebeschrijving 2018 (Dutch).pdf
1.8 MB Download
All versions This version
Views 284284
Downloads 265265
Data volume 379.7 MB379.7 MB
Unique views 240240
Unique downloads 203203


Cite as