Software Open Access
Biester, Laura;
Banea, Carmen;
Mihalcea, Rada
Code for building and evaluating location embeddings for the 2020 AACL-IJCNLP paper "Building Location Embeddings from Physical Trajectories and Textual Representations."
Abstract: Word embedding methods have become the de-facto way to represent words, having been successfully applied to a wide array of natural language processing tasks. In this paper, we explore the hypothesis that embedding methods can also be effectively used to represent spatial locations. Using a new dataset consisting of the location trajectories of 729 students over a seven month period and text data related to those locations, we implement several strategies to create location embeddings, which we then use to create embeddings of the sequences of locations a student has visited. To identify the surface level properties captured in the representations, we propose a number of probing tasks such as the presence of a specific location in a sequence or the type of activities that take place at a location. We then leverage the representations we generated and employ them in more complex downstream tasks ranging from predicting a student's area of study to a student's depression level, showing the effectiveness of these location embeddings.
Contact: Please contact Laura Biester (lbiester@umich.edu) with questions.
Name | Size | |
---|---|---|
Location-Embeddings.zip
md5:1f2671cb1cc2fdf33b4429e4eceea890 |
196.7 kB | Download |
All versions | This version | |
---|---|---|
Views | 157 | 157 |
Downloads | 23 | 23 |
Data volume | 4.5 MB | 4.5 MB |
Unique views | 149 | 149 |
Unique downloads | 20 | 20 |