3382057
doi
10.1145/3202662
oai:zenodo.org:3382057
user-cutler-h2020
user-eu
Giorgos Kordopatis-Zilos
Information Technologies Institute, CERTH, Greece
Symeon Papadopoulos
Information Technologies Institute, CERTH, Greece
Yiannis Kompatsiaris
Information Technologies Institute, CERTH, Greece
Location Extraction from Social Media: Geoparsing, Location Disambiguation and Geotagging
Stuart E. Middleton
Electronic and Computer Science, University of Southampton, UK
info:eu-repo/semantics/openAccess
Creative Commons Attribution 4.0 International
https://creativecommons.org/licenses/by/4.0/legalcode
Linguistic processing
Natural Language Processing
Text analysis
Location Extraction
Toponym Extraction
Information Extraction
Geoparsing
Geocoding
Geotagging
<p>Location extraction, also called toponym extraction, is a field covering geoparsing, extracting spatial representations from location mentions in text, and geotagging, assigning spatial coordinates to content items. This paper evaluates five ‘best of class’ location extraction algorithms. We develop a geoparsing algorithm using an OpenStreetMap database, and a geotagging algorithm using a language model constructed from social media tags and multiple gazetteers. Third party work evaluated includes a DBpediabased entity recognition and disambiguation approach, a named entity recognition and Geonames gazetteer approach and a Google Geocoder API approach. We perform two quantitative benchmark evaluations, one geoparsing tweets and one geotagging Flickr posts, to compare all approaches. We also perform a qualitative evaluation recalling top N location mentions from tweets during major news events. The OpenStreetMap approach was best (F1 0.90+) for geoparsing English, and the language model approach was best (F1 0.66) for Turkish. The language model was best (F1@1km 0.49) for the geotagging evaluation. The map-database was best (R@20 0.60+) in the qualitative evaluation. We report on strengths, weaknesses and a detailed failure analysis for the approaches and suggest concrete areas for further research.</p>
Stuart E. Middleton, Giorgos Kordopatis-Zilos, Symeon Papadopoulos, Yiannis Kompatsiaris, "Location Extraction from Social Media: Geoparsing, Location Disambiguation and Geotagging", in Proc. 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, Paris, France, July 2019.
Zenodo
2019-08-30
info:eu-repo/semantics/conferencePaper
3382056
user-cutler-h2020
user-eu
1.0
award_title=Coastal Urban developmenT through the LEnses of Resiliency; award_number=770469; award_identifiers_scheme=url; award_identifiers_identifier=https://cordis.europa.eu/projects/770469; funder_id=00k4n6c32; funder_name=European Commission;
1606850838.100164
434372
md5:74f9c999a8f03646cf75219ea05575c4
https://zenodo.org/records/3382057/files/Location Extraction from Social Media.pdf
public
ACM Transactions on Information Systems
36
4
2019-08-30