There is a newer version of the record available.

Published November 29, 2021 | Version 2.0
Dataset Open

Modern China Geospatial Database - Main Dataset

  • 1. Aix-Marseille University


MCGD_Data_V2 contains all the data that we have collected on locations in modern China. Altogether there are 466,162 entries. The data include the name of locations and their variants in Chinese, pinyin, and any recorded transliteration; the name of the province in Chinese and in pinyin; Province ID; the latitude and longitude; the Name ID and Location ID. The Name IDs all start with N followed by seven digits, except for locations in Taiwan that start with "T" (data from Geonames). This is the internal ID system of MCGD. Locations IDs that start with "DH" are data points extracted from China Historical GIS (Harvard University); those that start with "D" are locations extracted from the data points in Geonames; those that have only digits (8 digits) are data points we have added from various map sources.

One of the main features of the MCGD Main Dataset is the systematic collection and compilation of place names from non-Chinese language historical sources. Locations were designated in transliteration systems that are hardly comprehensible today, which makes it very difficult to find the actual locations they correspond to. This dataset allows for the conversion from these obsolete transliterations to the current names and geocoordinates.



Files (57.9 MB)

Name Size Download all
30.4 MB Preview Download
27.5 MB Download