Dataset Open Access

Mars Target Encyclopedia - LPSC abstracts labeled data set

Raymond Francis; Kiri Wagstaff


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Kiri L. Wagstaff, Raymond Francis, Thamme Gowda, You Lu, Ellen Riloff, Karanjeet Singh, and Nina Lanza. "Mars Target Encyclopedia: Rock and Soil Composition Extracted from the Literature."  Proceedings of the Thirtieth Annual Conference on Innovative Applications of Artificial Intelligence, 2018.</subfield>
  </datafield>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Mars</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">information extraction</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">named entity recognition</subfield>
  </datafield>
  <controlfield tag="005">20171115083007.0</controlfield>
  <controlfield tag="001">1048419</controlfield>
  <datafield tag="711" ind1=" " ind2=" ">
    <subfield code="d">Feb. 2-7, 2018</subfield>
    <subfield code="g">IAAI</subfield>
    <subfield code="a">Thirtieth Annual Conference on Innovative Applications of Artificial Intelligence</subfield>
    <subfield code="c">New Orleans, LA</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Jet Propulsion Laboratory</subfield>
    <subfield code="0">(orcid)0000-0003-4401-5506</subfield>
    <subfield code="a">Kiri Wagstaff</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">567483</subfield>
    <subfield code="z">md5:fb31dc47cfaf6627e4492bb4325aa848</subfield>
    <subfield code="u">https://zenodo.org/record/1048419/files/lpsc-annotated.zip</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="y">Conference website</subfield>
    <subfield code="u">https://aaai.org/Conferences/AAAI-18/iaai-18/</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2017-11-14</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="p">user-zenodo</subfield>
    <subfield code="o">oai:zenodo.org:1048419</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">Jet Propulsion Laboratory</subfield>
    <subfield code="a">Raymond Francis</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Mars Target Encyclopedia - LPSC abstracts labeled data set</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-zenodo</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by-sa/4.0/</subfield>
    <subfield code="a">Creative Commons Attribution Share-Alike 4.0</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;This data set contains annotated text versions of 2-page abstracts published at the Lunar and Planetary Science Conference in 2015 and 2016.&lt;/p&gt;

&lt;p&gt;The original PDF abstracts are available at:&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;https://www.hou.usra.edu/meetings/lpsc2015/programAbstracts/view/&lt;/li&gt;
	&lt;li&gt;https://www.hou.usra.edu/meetings/lpsc2016/programAbstracts/view/&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The text files in this archive were extracted using the Apache Tika PDF parsing tool.  The text is provided here so that the annotations can be viewed.  The text content remains copyright of the original abstract authors.&lt;/p&gt;

&lt;p&gt;The annotations (entities and relations) are provided in the format used by the brat annotation tool.  To view the annotations in a web-based graphical form, install the brat tool (http://brat.nlplab.org/).  These annotations were generated using brat v1.3.  The annotation files are also human-readable and can be parsed in to be used directly in code.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Contents&lt;/strong&gt;:&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;lpsc15/: 62 abstracts&lt;/li&gt;
	&lt;li&gt;lpsc16/: 55 abstracts&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Each directory contains a .txt and .ann file for each abstract.  The .ann file is in brat standoff format (http://brat.nlplab.org/standoff.html).&lt;/p&gt;

&lt;p&gt;Additional .conf files are provided to generate color highlighting and keyboard shortcuts.  These are used by the brat tool.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Attribution&lt;/strong&gt;:&lt;/p&gt;

&lt;p&gt;If you use this data set in your own work, please cite this DOI:&lt;/p&gt;

&lt;p&gt;10.5281/zenodo.1048419&lt;/p&gt;

&lt;p&gt;Please also cite this paper, which provides additional details about the data set.&lt;/p&gt;

&lt;p&gt;Kiri L. Wagstaff, Raymond Francis, Thamme Gowda, You Lu, Ellen Riloff, Karanjeet Singh, and Nina Lanza. "Mars Target Encyclopedia: Rock and Soil Composition Extracted from the Literature."  &lt;em&gt;Proceedings of the Thirtieth Annual Conference on Innovative Applications of Artificial Intelligence&lt;/em&gt;, 2018.&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isPartOf</subfield>
    <subfield code="a">10.5281/zenodo.1048418</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.1048419</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>

Share

Cite as