Presentation Open Access

Static Search: An Archivable and Sustainable Search Engine for the Digital Humanities

Holmes, Martin; Takeda, Joseph


JSON-LD (schema.org) Export

{
  "inLanguage": {
    "alternateName": "eng", 
    "@type": "Language", 
    "name": "English"
  }, 
  "description": "<p>Abstract:The end-goal of the Endings Project&mdash;a collaboration between project leaders, programmers, and librarians to address long term sustainability of digital humanities resources&mdash;is to have completely static sites: websites composed of only HTML, CSS, and Javascript that have no reliance on server-side processing. Now in the last phase of the grant cycle, the Endings Project successfully converted its major projects&mdash;like The Map of Early Modern London, The Robert Graves Diaries, and other HCMC-hosted projects&mdash;into static sites, but one final problem remained: search. Most search engines (like Solr) require the use of a server to index content and while Javascript search engines, such as Lunr, do exist, they cannot feasibly handle the vast amounts of data that comprise the standard digital edition.<br>\n<br>\nThis presentation outlines the creation of Static Search: an open-access codebase for creating a completely client-side search engine for static websites. Built using XSLT3, Saxon, and Ant, Static Search creates a JSON file for every distinct stem in a document collection and harvests metadata from each document containing that stem, providing a rapid mechanism for querying a document collection. In its current version, Static Search provides boolean searches (CAN CONTAIN, CANNOT CONTAIN, MUST CONTAIN) as well as exact phrase searching alongside faceted search filters based on document metadata. While currently implemented only for modern English, we are developing methods for querying early modern English as well as early modern and contemporary French as well as adding mechanisms for wildcard searches. Our presentation thus demonstrates both the feasibility of creating a static site with robust search capabilities and the advantages of Static Search for complex digital humanities projects.</p>", 
  "license": "https://creativecommons.org/licenses/by/4.0/legalcode", 
  "creator": [
    {
      "affiliation": "University of Victoria", 
      "@id": "https://orcid.org/0000-0002-3944-1116", 
      "@type": "Person", 
      "name": "Holmes, Martin"
    }, 
    {
      "affiliation": "Simon Fraser University", 
      "@type": "Person", 
      "name": "Takeda, Joseph"
    }
  ], 
  "url": "https://zenodo.org/record/3883150", 
  "datePublished": "2020-06-06", 
  "version": "1.0", 
  "keywords": [
    "static websites", 
    "archivability", 
    "search engines"
  ], 
  "@context": "https://schema.org/", 
  "identifier": "https://doi.org/10.5281/zenodo.3883150", 
  "@id": "https://doi.org/10.5281/zenodo.3883150", 
  "@type": "PresentationDigitalDocument", 
  "name": "Static Search: An Archivable and Sustainable Search Engine for the Digital Humanities"
}
16
17
views
downloads
All versions This version
Views 1616
Downloads 1717
Data volume 104.7 MB104.7 MB
Unique views 1616
Unique downloads 1414

Share

Cite as