Project deliverable Open Access
The report describes a set of software components that have been developed in order to realize the E-ARK Integrated Platform Reference Implementation Prototype for storing, searching, and accessing E-ARK Information Packages on a scalable infrastructure.
The E-ARK Integrated Platform Reference Implementation Prototype integrates:
(a) an Information Package creation and management system,
(b) a repository for content search and access, and
(c) a scalable storage and execution environment based on Apache Hadoop.
The report focuses on the software components that have been individually developed and deployed on top of the core infrastructure components (Hadoop, Lily, SolR). This complements deliverable D6.1 which describes the set-up and configuration of the repository and indexing frameworks. This report accompanies a number of software development results which have been developed in order to realize the Integrated Platform Reference Implementation Prototype. Besides the individual software components, the report provides an overview of the overall system architecture, the integration approach and utilized interfaces.