Text Mining Scholarly Publications using APIs

Sarraf, Ishita; Fu, Yuanxi; Schneider, Jodi

doi:10.5281/zenodo.10581542

Published October 27, 2023 | Version v1

Conference paper Open

Text Mining Scholarly Publications using APIs

1. Grinnell College
2. University of Illinois Urbana-Champaign

Researchers often create custom datasets for their work instead of using whole corpora of scholarly publications. In this extended abstract, I describe my work constructing a pipeline that will make the creation of these custom datasets easy. My pipeline will be reusable such that given any Digital Object Identifier (DOI) of scholarly papers it can extract the full texts, if available, and researchers can create their own datasets to analyze the papers. My pipeline uses Crossref, Elsevier, and Wiley’s TDM APIs to help navigate the license problems and other access issues related to full-text extraction and allow researchers to focus on their analysis work.

Files

ASIST_METSTI2023_poster_Sarraf_et_al.pdf

Files (343.3 kB)

Name	Size	Download all
ASIST_METSTI2023_poster_Sarraf_et_al.pdf md5:f5dfbbbb2887f22d4ff54fe516bfea18	343.3 kB	Preview Download

Additional details

Is version of: Presentation: 2142/120049 (Handle)

U.S. National Science Foundation
Sustainable Diversity in the Computing Research Pipeline 1246649
U.S. National Science Foundation
CAREER: Using network analysis to assess confidence in research synthesis 2046454

Repository URL: https://github.com/infoqualitylab/text-mining-scholarly-API
Programming language: Python

540

Views

127

Downloads

Show more details

	All versions	This version
Views	540	540
Downloads	127	127
Data volume	67.1 MB	67.1 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Zenodo

Conference

METSTI 2023: Workshop on Informetric, Scientometric, and Scientific and Technical Information Research (METSTI 2023) , London, UK, 27. October 2023

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: March 5, 2024
Modified: July 7, 2024

ASIST_METSTI2023_poster_Sarraf_et_al.pdf

Files (343.3 kB)

Related works

Funding

Software

Text Mining Scholarly Publications using APIs

Authors/Creators

Description

Files

ASIST_METSTI2023_poster_Sarraf_et_al.pdf

Files (343.3 kB)

Additional details

Related works

Funding

Software