Poster Open Access

Leveraging Open Access publishing to fight fake news

Sylvain Massip; Charles Letaillieur


JSON Export

{
  "files": [
    {
      "links": {
        "self": "https://zenodo.org/api/files/2b68d1fc-2de3-436c-bcff-ee368f2966d4/OSC2020_14-1_Poster.pdf"
      }, 
      "checksum": "md5:af071b8dfa807b6598798a60ab090ee2", 
      "bucket": "2b68d1fc-2de3-436c-bcff-ee368f2966d4", 
      "key": "OSC2020_14-1_Poster.pdf", 
      "type": "pdf", 
      "size": 331692
    }, 
    {
      "links": {
        "self": "https://zenodo.org/api/files/2b68d1fc-2de3-436c-bcff-ee368f2966d4/OSC2020_14-2_Abstract.pdf"
      }, 
      "checksum": "md5:770433da2aa7974cd4595f9fdb583f83", 
      "bucket": "2b68d1fc-2de3-436c-bcff-ee368f2966d4", 
      "key": "OSC2020_14-2_Abstract.pdf", 
      "type": "pdf", 
      "size": 104153
    }
  ], 
  "owners": [
    75040
  ], 
  "doi": "10.5281/zenodo.3776797", 
  "stats": {
    "version_unique_downloads": 63.0, 
    "unique_views": 359.0, 
    "views": 382.0, 
    "version_views": 382.0, 
    "unique_downloads": 63.0, 
    "version_unique_views": 359.0, 
    "volume": 21833973.0, 
    "version_downloads": 72.0, 
    "downloads": 72.0, 
    "version_volume": 21833973.0
  }, 
  "links": {
    "doi": "https://doi.org/10.5281/zenodo.3776797", 
    "conceptdoi": "https://doi.org/10.5281/zenodo.3776796", 
    "bucket": "https://zenodo.org/api/files/2b68d1fc-2de3-436c-bcff-ee368f2966d4", 
    "conceptbadge": "https://zenodo.org/badge/doi/10.5281/zenodo.3776796.svg", 
    "html": "https://zenodo.org/record/3776797", 
    "latest_html": "https://zenodo.org/record/3776797", 
    "badge": "https://zenodo.org/badge/doi/10.5281/zenodo.3776797.svg", 
    "latest": "https://zenodo.org/api/records/3776797"
  }, 
  "conceptdoi": "10.5281/zenodo.3776796", 
  "created": "2020-04-30T06:11:50.455106+00:00", 
  "updated": "2020-04-30T08:20:22.179767+00:00", 
  "conceptrecid": "3776796", 
  "revision": 2, 
  "id": 3776797, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281/zenodo.3776797", 
    "description": "<p>Since the very first experiences in Open Access publishing at the end of 20 th century,<br>\n(arXiv and PLOS, two pioneers of open access distribution of academic articles were<br>\ncreated in 1991 and 2001, respectively), Open Access has developed tremendously.</p>\n\n<p>Today, a significant fraction of research is published open access. Evaluation estimates<br>\nit to be as high as 28% [Piwowar, 2018] and it occupies an ever-growing position in the<br>\nscientific debate with the adoption, in 2018 of the plan S which creates an European<br>\nlevel mandate for Open Access.</p>\n\n<p>In addition to being ethically desirable per se, there are many academic, economic and<br>\nsocietal arguments in favor of open access. These arguments, based on an improvement<br>\nof the exploitation and reuse of research results, are well described theoretically in the<br>\nlitterature [Tennant, 2017]. Nevertheless, the practical demonstration of the use of Open<br>\nAccess outside research communities are not common, and we have not many reports of<br>\nthese. The objective of our project is to illustrate the possible uses of Open Access<br>\noutside of academia.</p>\n\n<p>In this study, we will examine how open access combined with the right machine<br>\nlearning tools can help fight fake news.</p>\n\n<p>Natural Language processing has been revolutionized these last years, by the use of<br>\nneural networks based language models such as word2Vec [Mikolov, 2013] and Bert<br>\n[Devlin, 2018].</p>\n\n<p>By building space representation of the words and concepts used in texts, these models<br>\nare able to take into account the meanings of studied texts. These methods have been<br>\nshown to be of use to create knowledge bases from corpus of texts [Petroni, 2019] in a<br>\nunsupervised manner. More specifically, [Tshitoyan, 2019] has shown that these<br>\nmethods, applied to a scientific corpus in an unsupervised manner, were able to retrieve<br>\nthe links between concepts that exists in the texts.</p>\n\n<p>This study will investigate how these principles will be used to build a text-mining<br>\npipeline that indicates whether a scientific claim is backed by the scientific literature or<br>\nnot.</p>\n\n<p>In this exploratory phase, the following methods will be applied:</p>\n\n<ul>\n\t<li>data from Euro Pubmed Central database will be used to train a Word2Vec model.</li>\n\t<li>claims will be restricted to health-related questions of the pattern &ldquo;Does X cure/cause/prevent Y?&rdquo;.</li>\n\t<li>Claims will then be classified by exploring the links between X, Y and the concept of cure / cause / prevent as learned in the language model.</li>\n</ul>\n\n<p>The pipeline will be evaluated with claims taken from expert-based scientific<br>\nfact-checking network such as metafact.io or sciencefeedback.co.</p>\n\n<p>By validating the principle of fact-checking scientific claims with Open Access<br>\nliterature, we hope to pave the way to improved automatic fact-checking tools, which<br>\nwill allow an increased understanding of research results by the broad public and to<br>\nshow a strong impact of open science in society.</p>", 
    "language": "eng", 
    "title": "Leveraging Open Access publishing to fight fake news", 
    "license": {
      "id": "CC-BY-4.0"
    }, 
    "relations": {
      "version": [
        {
          "count": 1, 
          "index": 0, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "3776796"
          }, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "3776797"
          }
        }
      ]
    }, 
    "communities": [
      {
        "id": "osc2020"
      }
    ], 
    "keywords": [
      "Open Access", 
      "Text-mining", 
      "Fake News", 
      "Fact-checking", 
      "Word2Vec"
    ], 
    "publication_date": "2020-04-30", 
    "creators": [
      {
        "affiliation": "Opscidia", 
        "name": "Sylvain Massip"
      }, 
      {
        "affiliation": "Opscidia", 
        "name": "Charles Letaillieur"
      }
    ], 
    "meeting": {
      "url": "https://www.open-science-conference.eu", 
      "dates": "11-12 March 2020", 
      "place": "Berlin, Germany", 
      "title": "Open Science Conference 2020"
    }, 
    "access_right": "open", 
    "resource_type": {
      "type": "poster", 
      "title": "Poster"
    }, 
    "related_identifiers": [
      {
        "scheme": "doi", 
        "identifier": "10.5281/zenodo.3776796", 
        "relation": "isVersionOf"
      }
    ]
  }
}
382
72
views
downloads
All versions This version
Views 382382
Downloads 7272
Data volume 21.8 MB21.8 MB
Unique views 359359
Unique downloads 6363

Share

Cite as