Published February 14, 2018 | Version 1.1
Project deliverable Open

Technical documentation of the infrastructure supporting the E-ARK Faceted Query Interface and Application Programming Interface (API).

  • 1. Austrian Institute of Technology
  • 2. University of Portsmouth
  • 3. National Archives of Estonia
  • 4. University of Brighton

Description

The E-ARK Work package 6 (WP6) - Archival Storage, Services, and Integration, is developing a scalable open-source reference implementation for ingesting, searching, and accessing E-ARK information packages. A major task in this context is the development of a faceted query interface for searching archived content which can be utilized by end-users as well as external software components.

The reference implementation aims at providing an archiving and search prototype that is flexible in regard to the type and volume of the ingested payloads. The reference implementation is designed to scale from a single host out to a cluster deployment by employing technologies like Apache Hadoop, Solr, and the Lily repository, supporting different types of input data ranging from text-based files and structured records to office documents and binary content.

This report provides technical documentation of the infrastructure supporting the E-ARK Faceted Query Interface and Application Programming Interface (API). It provides a description of the underlying software components utilized for the development of the search functionality of the E-ARK reference implementation and discusses the required interactions to work as an integrated solution. Furthermore, technical documentation of the developed software and system configuration is provided. The document describes also methods to customize the faceted query interface and provides examples for its utilization.

Files

E-ARK D6.1.pdf

Files (707.0 kB)

Name Size Download all
md5:29ba6080a2f1a64752f91ea09a8bb8e8
707.0 kB Preview Download