Presentation Open Access

Static Search: An Archivable and Sustainable Search Engine for the Digital Humanities

Holmes, Martin; Takeda, Joseph

MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="">
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">static websites</subfield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">archivability</subfield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">search engines</subfield>
  <controlfield tag="005">20200607221820.0</controlfield>
  <controlfield tag="001">3883150</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Simon Fraser University</subfield>
    <subfield code="a">Takeda, Joseph</subfield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">32616014</subfield>
    <subfield code="z">md5:212320c36448535c1e0568ab8607c31b</subfield>
    <subfield code="u"></subfield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">490985</subfield>
    <subfield code="z">md5:7ba31c68b50aedf4544f618ad9399fd1</subfield>
    <subfield code="u"></subfield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2020-06-06</subfield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire</subfield>
    <subfield code="o"></subfield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">University of Victoria</subfield>
    <subfield code="0">(orcid)0000-0002-3944-1116</subfield>
    <subfield code="a">Holmes, Martin</subfield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Static Search: An Archivable and Sustainable Search Engine for the Digital Humanities</subfield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u"></subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2"></subfield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;Abstract:The end-goal of the Endings Project&amp;mdash;a collaboration between project leaders, programmers, and librarians to address long term sustainability of digital humanities resources&amp;mdash;is to have completely static sites: websites composed of only HTML, CSS, and Javascript that have no reliance on server-side processing. Now in the last phase of the grant cycle, the Endings Project successfully converted its major projects&amp;mdash;like The Map of Early Modern London, The Robert Graves Diaries, and other HCMC-hosted projects&amp;mdash;into static sites, but one final problem remained: search. Most search engines (like Solr) require the use of a server to index content and while Javascript search engines, such as Lunr, do exist, they cannot feasibly handle the vast amounts of data that comprise the standard digital edition.&lt;br&gt;
This presentation outlines the creation of Static Search: an open-access codebase for creating a completely client-side search engine for static websites. Built using XSLT3, Saxon, and Ant, Static Search creates a JSON file for every distinct stem in a document collection and harvests metadata from each document containing that stem, providing a rapid mechanism for querying a document collection. In its current version, Static Search provides boolean searches (CAN CONTAIN, CANNOT CONTAIN, MUST CONTAIN) as well as exact phrase searching alongside faceted search filters based on document metadata. While currently implemented only for modern English, we are developing methods for querying early modern English as well as early modern and contemporary French as well as adding mechanisms for wildcard searches. Our presentation thus demonstrates both the feasibility of creating a static site with robust search capabilities and the advantages of Static Search for complex digital humanities projects.&lt;/p&gt;</subfield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.3883149</subfield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.3883150</subfield>
    <subfield code="2">doi</subfield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">presentation</subfield>
All versions This version
Views 1616
Downloads 1717
Data volume 104.7 MB104.7 MB
Unique views 1616
Unique downloads 1414


Cite as