Conference paper Open Access

Automatic detection of inadequate claims in biomedical articles: first steps

Koroleva Anna; Paroubek Patrick


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nam##2200000uu#4500</leader>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Aronson A. Effective mapping of biomedical text to the UMLS metathesaurus: the metamap program. In: Proc. AMIA Symposium (2001).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Boutron I., Altman D.G., Hopewell S., Vera-Badillo F., Tannock I., Ravaud P. Impact of spin in the abstracts of articles reporting results of randomized controlled trials in the field of Cancer: the SPIIN randomized controlled trial. J Clin Oncol, 32, 4120–4126 (2014).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Boutron I., Dutton S., Ravaud P., Altman D.G. Reporting and interpretation of randomized controlled trials with statistically nonsignificant results for primary outcomes. JAMA, 303, 2058–2064 (2010).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Chung G.Y. Towards identifying intervention arms in randomized controlled trials: extracting coordinating constructions. J Biomed Inform, 42(5), 790-800 (2009).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Cohen A.M., Smalheiser N.R., McDonagh M.S., Yu C., Adams C.E., Davis J.M., Yu P.S. Automated confidence ranked classification of randomized controlled trial articles: an aid to evidence-based medicine. J Am Med Inform Assoc., 22(3):707–17 (2015).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">De Bruijn B., Carini S., Kiritchenko S., Martin J., Sim I. Automated information extraction of key trial design elements from clinical trial publications. In: Proceedings of the AMIA Annual Symposium, 141-145 (2008).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Friburger N., Maurel D. Finite-state transducer cascade to extract named entities in texts. Theoretical Computer Science, 313, 94-104 (2004).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Glanville J. M., Lefebvre C., Miles J. N., Camosso-Stefinociv J. How to identify randomized controlled trials in MEDLINE: 10 years on. Journal of the Medical Library Association, 94, 130–136 (2006).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Hall M., Frank E., Holmes G., Pfahringer B., Peter R., Witten I. H. The weka data mining software: An update. SIGKDD Explorations, 11(1) (2009).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Haneef R., Lazarus C., Ravaud P., Yavchitz A., Boutron I. Interpretation of results of studies evaluating an intervention highlighted in Google Health News: a cross-sectional study of news. PLoS ONE, 10(10) (2015).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Higgins J.P., Green S., eds. Cochrane handbook for systematic reviews of interventions. Wiley &amp; Sons Ltd., West Sussex (2008).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Hirohata K., Okazaki N., Ananiadou S., Ishizuka M. Identifying sections in scientific abstracts using conditional random fields. In: Proceedings of the Third International Joint Conference on Natural Language Processing. Hyderabad, 381–388 (2008).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Kim S.N., Martinez D., Cavedon L., Yencken L.. Automatic classification of sentences to support evidence based medicine. BMC bioinformatics, 12(Suppl 2):S5 (2011).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Kiritchenko S., De Bruijn B., Carini S., Martin J., Sim I. ExaCT: automatic extraction of clinical trial characteristics from journal publications. BMC Med Inform Decis Mak., 10: 56-10.1186/1472-6947-10-56 (2010).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Kouylekov M., Magnini B. Tree Edit Distance for Textual Entailment. RANLP (2005).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Lazarus C., Haneef R., Ravaud P., Boutron I. Classification and prevalence of spin in abstracts of non-randomized studies evaluating an intervention. BMC Med Res Methodol., 15:85 (2015).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Maurel D., Friburger N., Antoine J.-Y., Eshkol-Taravella I., Nouvel D. Cascades autour de la reconnaissance des entités nommées. TAL 52-1 (2011).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">McKibbon K.A., Wilczynski N.L., Haynes R.B. Retrieving randomized controlled trials from medline: a comparison of 38 published search filters. Health Information and Libraries Journal, 26(3), 187-202 (2009).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">McKnight L., Srinivasan P. Categorization of sentence types in medical abstracts. In: AMIA Annu. Symp. Proc., 440–444 (2003).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Paumier S. (2016). Unitex 3.1 User Manual. http://unitexgramlab.org/releases/3.1/man/Unitex-GramLab-3.1-usermanual-en.pdf, last accessed 2017/07/12.</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Raja K., Dasot N., Tech B., Goyal P., Jonnalagadda S.R. Towards evidence-based precision medicine: extracting population information from biomedical text using binary classifiers and syntactic patterns. In: AMIA Jt Summits Transl Sci Proc, 203-212 (2016).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Summerscales R.L., Argamon S., Bai S., Hupert J., Schwartz A. Automatic summarization of results from clinical trials. In: The 2011 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 372–377 (2011).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Summerscales R.L., Argamon S., Hupert J., Schwartz A. Identifying treatments, groups, and outcomes in medical abstracts. In: The Sixth Midwest Computational Linguistics Colloquium (MCLC 2009) (2009).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Xu R., Garten Y., Supekar K.S., Das A.K., Altman R.B., Garber A.M. Extracting subject demographic information from abstracts of randomized clinical trial reports. In: Proceedings of the 12th World Congress on Health (Medical) Informatics, 550-554 (2007).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Yamamoto Y., Takagi T. A sentence classification system for multi biomedical literature summarization. In: Proceedings of the 21st International Conference on Data Engineering Workshops (2005).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Yavchitz A., Boutron I., Bafeta A., Marroun I., Charles P., Mantz J., et al. Misrepresentation of randomized controlled trials in press releases and news coverage: a cohort study. PLoS Med, 9:e1001308 (2012).</subfield>
  </datafield>
  <datafield tag="999" ind1="C" ind2="5">
    <subfield code="x">Yavchitz A., Ravaud P., Altman D.G., Moher D., Hrobjartsson A., Lasserson T., Boutron I. A new classification of spin in systematic reviews and meta-analyses was developed and ranked according to the severity. Journal of Clinical Epidemiology, 75, 56-65 (2016).</subfield>
  </datafield>
  <datafield tag="041" ind1=" " ind2=" ">
    <subfield code="a">eng</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Inadequate Reporting</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Spin</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Biomedical Articles</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Text Classification</subfield>
  </datafield>
  <datafield tag="653" ind1=" " ind2=" ">
    <subfield code="a">Entity Extraction</subfield>
  </datafield>
  <controlfield tag="005">20200120173029.0</controlfield>
  <controlfield tag="001">1164161</controlfield>
  <datafield tag="711" ind1=" " ind2=" ">
    <subfield code="d">12 - 13 September 2017</subfield>
    <subfield code="g">MEDA</subfield>
    <subfield code="a">WORKSHOP on Curative Power of MEdical DAta</subfield>
    <subfield code="c">Constanța, Romania</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">LIMSI-CNRS</subfield>
    <subfield code="a">Paroubek Patrick</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">236588</subfield>
    <subfield code="z">md5:fee735171a73a2adc2cb7dbafdc3ff86</subfield>
    <subfield code="u">https://zenodo.org/record/1164161/files/MEDA-2017_paper_4.pdf</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="y">Conference website</subfield>
    <subfield code="u">http://eurolan.info.uaic.ro/2017/details.html#Satellite</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2017-09-12</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire</subfield>
    <subfield code="p">user-miror</subfield>
    <subfield code="o">oai:zenodo.org:1164161</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">LIMSI-CNRS</subfield>
    <subfield code="a">Koroleva Anna</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">Automatic detection of inadequate claims in biomedical articles: first steps</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">user-miror</subfield>
  </datafield>
  <datafield tag="536" ind1=" " ind2=" ">
    <subfield code="c">676207</subfield>
    <subfield code="a">Methods in Research on Research</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">http://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;In this article we present the first steps in developing an NLP algorithm for automatic detection of inadequate reporting of research results (known as spin) in biomedical articles. Inadequate reporting consists in presenting the experimental treatment as having a greater beneficial effect than it was shown by the research results. We propose a scheme for an algorithm that would automatically identify important claims in the articles abstracts, extract possible&lt;br&gt;
supporting information from the article and check the adequacy of the claims. We present the state of the art and our first experiments for three tasks related to spin detection: classification of articles according to the type of reported clinical trial; classification of sentences in the abstracts aimed at identifying mentions of the Results and Conclusions of the experiment; and extraction of some trial characteristics. For each task, we outline possible directions of&amp;nbsp; further work.&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.1164160</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.1164161</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">publication</subfield>
    <subfield code="b">conferencepaper</subfield>
  </datafield>
</record>
95
41
views
downloads
All versions This version
Views 9595
Downloads 4141
Data volume 9.7 MB9.7 MB
Unique views 9191
Unique downloads 3636

Share

Cite as