Conference paper Open Access

Connecting Resources: Which Issues Have to be Solved to Integrate CMC Corpora from Heterogeneous Sources and for Different Languages?

Beißwenger, Michael; Wigham, Ciara; Etienne, Carole; Grumt Suárez, Holger; Herzberg, Laura; Darja Fišer; Hinrichs, Erhard; Horsmann, Tobias; Karlova-Bourbonus, Natali; Lemnitzer, Lothar; Longhi, Julien; Lüngen, Harald; Ho-Dac, Lydia-Mai; Parisse, Christophe; Poudat, Céline; Schmidt, Thomas; Stemle, Egon W.; Storrer, Angelika; Zesch, Torsten


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nam##2200000uu#4500</leader>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">corpora</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">research infrastructures</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">annotation</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">anonymisation</subfield>
  </datafield>
  <controlfield tag="005">20171104025317.0</controlfield>
  <controlfield tag="001">1041877</controlfield>
  <datafield tag="711" ind1=" " ind2=" ">
    <subfield code="d">3-4 October 2017</subfield>
    <subfield code="g">cmccorpora17</subfield>
    <subfield code="a">5th Conference on CMC and Social Media Corpora for the Humanities</subfield>
    <subfield code="c">Bolzano, Italy</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Université Clermont Auvergne, France</subfield>
    <subfield code="a">Wigham, Ciara</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">ICAR Laboratory Lyon, France</subfield>
    <subfield code="a">Etienne, Carole</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Justus-Liebig-Universität Gießen, Germany</subfield>
    <subfield code="a">Grumt Suárez, Holger</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">University of Mannheim, Germany</subfield>
    <subfield code="a">Herzberg, Laura</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Faculty of Arts - University of Ljubljana, Slovenia</subfield>
    <subfield code="a">Darja Fišer</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Eberhard-Karls-Universität Tübingen, Germany</subfield>
    <subfield code="a">Hinrichs, Erhard</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">University of Duisburg-Essen, Germany</subfield>
    <subfield code="a">Horsmann, Tobias</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Justus-Liebig-Universität Gießen, Germany</subfield>
    <subfield code="a">Karlova-Bourbonus, Natali</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Berlin-Brandenburg Academy of Sciences, Germany</subfield>
    <subfield code="a">Lemnitzer, Lothar</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Université de Cergy-Pontoise, France</subfield>
    <subfield code="a">Longhi, Julien</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Institute for the German Language, Germany</subfield>
    <subfield code="a">Lüngen, Harald</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Université Toulouse 2, France</subfield>
    <subfield code="a">Ho-Dac, Lydia-Mai</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Université Paris Nanterre, France</subfield>
    <subfield code="a">Parisse, Christophe</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Université Nice Côte d'Azur, France</subfield>
    <subfield code="a">Poudat, Céline</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Institute for the German Language, Germany</subfield>
    <subfield code="a">Schmidt, Thomas</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Eurac Research, Italy</subfield>
    <subfield code="0">(orcid)0000-0002-7655-5526</subfield>
    <subfield code="a">Stemle, Egon W.</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">University of Mannheim, Germany</subfield>
    <subfield code="a">Storrer, Angelika</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">University of Duisburg-Essen, Germany</subfield>
    <subfield code="a">Zesch, Torsten</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">248895</subfield>
    <subfield code="z">md5:5d82d31da8d4c6ac337663119b4536d7</subfield>
    <subfield code="u">https://zenodo.org/record/1041877/files/cmccorpora17-28.pdf</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="y">Conference website</subfield>
    <subfield code="u">https://cmc-corpora2017.eurac.edu</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2017-09-30</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire</subfield>
    <subfield code="p">user-cmccorpora17</subfield>
    <subfield code="o">oai:zenodo.org:1041877</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">University Duisburg-Essen, Germany</subfield>
    <subfield code="a">Beißwenger, Michael</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Connecting Resources: Which Issues Have to be Solved to Integrate CMC Corpora from Heterogeneous Sources and for Different Languages?</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-cmccorpora17</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">http://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;The paper reports on the results of a scientific colloquium dedicated to the creation of standards and best practices which are needed to facilitate the integration of language resources for CMC stemming from different origins and the linguistic analysis of CMC phenomena in different languages and genres. The key issue to be solved is that of interoperability – with respect to the structural representation of CMC genres, linguistic annotations metadata, and anonymization/pseudonymization schemas. The objective of the paper is to convince more projects to partake in a discussion about standards for CMC corpora and for the creation of a CMC corpus infrastructure across languages and genres. In view of the broad range of corpus projects which are currently underway all over Europe, there is a great window of opportunity for the creation of standards in a bottom-up approach.&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isPartOf</subfield>
    <subfield code="a">10.5281/zenodo.1040713</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.1041876</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="b">cmc-corpora conference series</subfield>
    <subfield code="t">Proceedings of the 5th Conference on CMC and Social Media Corpora for the Humanities (cmccorpora17)</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.1041877</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">publication</subfield>
    <subfield code="b">conferencepaper</subfield>
  </datafield>
</record>
40
30
views
downloads
All versions This version
Views 4040
Downloads 3030
Data volume 7.5 MB7.5 MB
Unique views 3838
Unique downloads 2727

Share

Cite as