There is a newer version of this record available.

Dataset Open Access

OpenITI: a Machine-Readable Corpus of Islamicate Texts

Lorenz Nigst; Maxim Romanov; Sarah Bowen Savant; Masoumeh Seydi; Peter Verkinderen


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Arabic; Classical Arabic; Corpus</subfield>
  </datafield>
  <controlfield tag="005">20220708131836.0</controlfield>
  <controlfield tag="001">4075046</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">University of Vienna</subfield>
    <subfield code="a">Maxim Romanov</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Aga Khan University</subfield>
    <subfield code="a">Sarah Bowen Savant</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">University of Leipzig</subfield>
    <subfield code="a">Masoumeh Seydi</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Aga Khan University</subfield>
    <subfield code="a">Peter Verkinderen</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">4063843696</subfield>
    <subfield code="z">md5:491639bebaa364ca766221615039c32b</subfield>
    <subfield code="u">https://zenodo.org/record/4075046/files/OpenITI-v2020.2.3.zip</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2020-10-06</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="o">oai:zenodo.org:4075046</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Aga Khan University</subfield>
    <subfield code="a">Lorenz Nigst</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">OpenITI: a Machine-Readable Corpus of Islamicate Texts</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution Non Commercial Share Alike 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;&lt;strong&gt;Co-PIs&lt;/strong&gt;: Matthew Thomas Miller (University of Maryland, College Park), Maxim G. Romanov (University of Vienna), Sarah Bowen Savant (Aga Khan University&amp;mdash;ISMC, London).&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Open Islamicate Texts Initiative&lt;/em&gt; (&lt;strong&gt;OpenITI&lt;/strong&gt;, see &lt;a href="https://iti-corpus.github.io/"&gt;https://iti-corpus.github.io/&lt;/a&gt;)&amp;nbsp;is a multi-institutional effort to construct the first machine-actionable scholarly corpus of premodern Islamicate texts. Led by researchers at the Aga Khan University, Institute for the Study of Muslim Civilisations (AKU-ISMC), University of Vienna (UV), Leipzig University (LU), and the Roshan Institute for Persian Studies at the University of Maryland (College Park) and an interdisciplinary advisory board of leading digital humanists and Islamic, Persian, and Arabic studies scholars, &lt;strong&gt;OpenITI&lt;/strong&gt; aims to provide the essential textual infrastructure in Arabic, Persian and other Islamicate languages for new forms of textual analysis and digital scholarship. In the process, OpenITI will enable new synergies between Digital Humanities and the inter-related Islamicate fields of Islamic, Persian, and Arabic Studies. In addition to support from the researchers&amp;rsquo; home institutions, it is supported by funding from the &lt;a href="https://erc.europa.eu/"&gt;European Research Council&lt;/a&gt; under the European Union&amp;rsquo;s Horizon 2020 research and innovation programme, awarded to the &lt;a href="http://kitab-project.org/"&gt;KITAB&lt;/a&gt; project (Grant Agreement No. 772989, PI Sarah Bowen Savant) and the &lt;a href="https://www.qnl.qa/en"&gt;Qatar National Library&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;Currently, &lt;strong&gt;OpenITI&lt;/strong&gt; contains almost exclusively Arabic texts, which were first assembled into a corpus within the &lt;strong&gt;OpenArabic&lt;/strong&gt; project, developed first at Tufts University (at &lt;em&gt;The Perseus Project&lt;/em&gt;, 2013&amp;ndash;2015) and then at Leipzig University (at the Alexander von Humboldt Chair for Digital Humanities, 2015&amp;ndash;2017)&amp;mdash;in both cases with the support and under the patronage of Prof. Gregory Crane. The much more limited number of Persian texts were compiled during 2015&amp;ndash;2016 in the Persian Digital Library (PDL) pilot (see &lt;a href="https://persdigumd.github.io/PDL/"&gt;Persian Digital Library by PersDigUMD&lt;/a&gt;) at Roshan Institute for Persian Studies at the University of Maryland. These texts have not been made fully compatible with OpenITI mARkdown yet and will be made fully available in next releases.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Note on Release Numbering&lt;/strong&gt;: Version &lt;strong&gt;2019.1.1&lt;/strong&gt;&amp;mdash;where &lt;strong&gt;2019&lt;/strong&gt; is the year of the release, the first dotted number&amp;mdash;&lt;strong&gt;.1&lt;/strong&gt;&amp;mdash;is the ordinal release number in 2019, and the second dotted number&amp;mdash;&lt;strong&gt;.1&lt;/strong&gt;&amp;mdash;is the overall release number; the first dotted number will reset every year, while the second one will continue on increasing.&lt;/p&gt;

&lt;p&gt;For more details: &lt;a href="https://github.com/OpenITI/RELEASE"&gt;https://github.com/OpenITI/RELEASE&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">url</subfield>
    <subfield code="i">isSupplementTo</subfield>
    <subfield code="a">https://github.com/OpenITI/RELEASE/tree/v2019.1.1</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.3082463</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.4075046</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
5,824
1,672
views
downloads
All versions This version
Views 5,824731
Downloads 1,67261
Data volume 6.4 TB247.9 GB
Unique views 4,529619
Unique downloads 76255

Share

Cite as