Published August 6, 2024 | Version v2
Dataset Restricted

Human Mobility Prediction Challenge 2024: Multi-City Prediction

  • 1. Massachusetts Institute of Technology
  • 2. LY Corporation

Description

The challenge takes place in 4 metropolitan areas (cities A, B, C, D), somewhere in Japan. Each area is divided into 500 meters x 500 meters cells, which span a 200 x 200 grid. The human mobility datasets contain the movement of individuals across a 75-day period, discretized into 30-minute intervals and 500-meter grid cells.

The task is to predict the movement of 3000 individuals each in cities B, C, and D, during days 61 to 75, using movement data of individuals in city A (100,000 individuals' full trajectories from day 1 to 75) and cities B, C, D (total of 25,000, 20,000, and 6,000 individuals each). The locations (x,y) that need to be predicted are masked with cell numbers '999'. 

Not all cities’ data are required to be used for prediction. For instance, to predict city B’s movement from days 61 to 75, one can just use the movement patterns in city B between 1 to 60. Using data from other cities (e.g., city A) may or may not improve the prediction accuracy!  

While the name or location of the city is not disclosed, the participants are provided with points-of-interest (POIs; e.g., restaurants, parks) data for each grid cell (~85 dimensional vector) for the four cities as supplementary information (e.g., POIdata_cityA). The list of 85 POI categories can be found in POI_datacategories.csv. 

For more details of the HuMob Data Challenge 2024, see https://wp.nyu.edu/humobchallenge2024/ 

If you want to participate in the challenge and use this dataset, please 'Request Access' with your teamname, team leader's name (please choose 1 person from your team), and email address. 

Researchers may use this dataset for publications and reports, as long as: 1) Users shall not carry out activities that involve unethical usage of the data, including attempts at re-identifying data subjects, harming individuals, or damaging companies, and 2) The Data Descriptor paper of an earlier version of the dataset (citation below) needs to be cited when using the data for research and/or commercial purposes. Downloading this dataset implies agreement with the above two conditions. 

  • Yabe, T., Tsubouchi, K., Shimizu, T., Sekimoto, Y., Sezaki, K., Moro, E., & Pentland, A. (2024). YJMob100K: City-scale and longitudinal dataset of anonymized human mobility trajectories. Scientific Data11(1), 397. https://www.nature.com/articles/s41597-024-03237-9  

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.

Request access

If you would like to request access to these files, please fill out the form below.

You need to satisfy these conditions in order for this request to be accepted:

If you want to participate in the challenge and use this dataset, please 'Request Access' with the following information:

  1. your teamname
  2. team leader's name (please choose 1 person from your team)
  3. number of members on team
  4. email address for contact 

Researchers may use this dataset for publications and reports, as long as: 1) Users shall not carry out activities that involve unethical usage of the data, including attempts at re-identifying data subjects, harming individuals, or damaging companies, and 2) The Data Descriptor paper of an earlier version of the dataset (citation below) needs to be cited when using the data for research and/or commercial purposes. Downloading this dataset implies agreement with the above two conditions. 

  • Yabe, T., Tsubouchi, K., Shimizu, T., Sekimoto, Y., Sezaki, K., Moro, E., & Pentland, A. (2024). YJMob100K: City-scale and longitudinal dataset of anonymized human mobility trajectories. Scientific Data11(1), 397. https://www.nature.com/articles/s41597-024-03237-9  

You are currently not logged in. Do you have an account? Log in here