There is a newer version of the record available.

Published December 2, 2023 | Version MEDLINE2019-S22020-MAG2019
Dataset Open

An enhanced author name dataset for PubMed/MEDLINE

Authors/Creators

  • 1. Wuhan University

Contributors

Data collector:

Description

The incompleteness of author names is a well-known issue in the MEDLINE database. It was since 2002, the full author name has been systematically indexed in MEDLINE. Although many full author names have been added to MEDLINE, we still found a significant number of abbreviated names in papers published after 2002.

Here we built an enhanced author name dataset for MEDLINE, called EAN,  achieved by linking the whole PubMed to other large literature databases and conducting a large-scale name comparison and restoration with obtained multi-sources author names. Our evaluation shows that more than 90% of author names in EAN are complete as compared to the ratio of ~60% in MEDLINE.

Files

Files (1.9 GB)

Name Size Download all
md5:5c12eec5886f2aa387d1b44b1f4bf3f3
1.9 GB Download

Additional details

Dates

Submitted
2023-12-01