Published August 18, 2022 | Version v1
Software Open

National impacts of e-commerce growth: Development of a spatial demand based tool

Authors/Creators

  • 1. University of California, Davis

Description

This project aims to study the impacts of e-commerce on shopping behaviors and related externalities. The objectives are divided into five major tasks in this project. Methods used include Weighted Multinomial Logit (WMNL) models, time series forecasting, and Monte Carlo (MC) simulations. The American Time Use Survey (ATUS) and the National Household Travel Survey (NHTS) databases are used for identifying the independent and dependent variables for behavioral modeling. At the same time, we collected all MSA population data from the U.S. Census Bureau and combined the shares of each variable from ATUS to generate a synthesized population, which serves as input into the MC simulation framework together with the behavioral model. This simulation framework includes the generation of shopping travel parameters and the calculation of negative externalities. We do this to estimate e-commerce demand and impacts every decade until 2050. The results and analyses provide information that supports the generation of shopping travel and the estimations of a series of negative externalities using MC simulation, which includes shopping travel parameters, last-mile delivery parameters, and emission rate per person. For different parameters, a unique probability distribution or a regression relation is obtained for different MSAs, and this distribution is fed into the subsequent MC simulation. Finally, we simulated shopping behaviors for synthesized populations (until 2050) and estimated the expected negative externalities. The MC simulation generates aggregate average vehicle miles traveled (VMT) and emissions (negative externalities) for different shopping activities in the planning years and different MSAs.

Notes

These data are from multiple sources in order to support the project titled "National Impacts of E-commerce Growth: Development of a Spatial Demand Based Tool", funded by the National Center for Sustainable Transportation (NCST). The purpose of this project is to study the impacts of e-commerce on consumers' shopping behaviors and the related externalities. Methods used include Weighted Multinomial Logit (WMNL) models, time series forecasting, and Monte Carlo (MC) simulations. 

This project makes use of three primary datasets: 

1. American Time Use Survey (ATUS)

The project uses the 2004-2020 ATUS data to analyze shopping behaviors. The use of ATUS data is mainly for specifying shopping behavior models and extracting variables for the six chosen metropolitan areas.

The ATUS data can be accessed at: https://timeuse.ipums.org/ 

2. National Household Travel Survey (NHTS)

The project uses the 2009 and 2017 NHTS data, which are based on trip-based surveys, to extract shopping travel parameters and last-mile delivery parameters for the six chosen metropolitan areas. Extracting shopping tours from NHTS requires identifying trip chains; we developed scripts to convert the raw trip-based data to tour-based data (refer to the code related to this project).

The NHTS data can be accessed at: https://nhts.ornl.gov/ 

3. population projections

The population projection data are produced in five-year increments from 2020 through 2100. These files are provided in .csv format. This project uses the data from 2020 through 2050. 

The projections are provided as totals, or segmented by age category (in four year increments), race/ethnicity and sex. 

Note that the population projections are publicly available at the following DOI: https://doi.org/10.17605/OSF.IO/9YNFC 

4. Individual Income Tax ZIP Code Data

The Individual Income Tax ZIP Code data show selected income and tax items classified by State, ZIP Code, and size of adjusted gross income. Data are based on individual income tax returns filed with the IRS. 

The Tax ZIP Code data can be accessed at: https://www.irs.gov/statistics/soi-tax-stats-data-by-geographic-area 

5. MOVES Emission Rates

The emission rates are compiled from EPA's Motor Vehicle Emission Simulator (MOVES) model for the six chosen metropolitan areas and planning years (2020-2050). 

The emission estimates are stored in MOVES_Emission_Rates.xlsx. 

6. ZIP code level geographic data

These data contain geographic boundaries for the six chosen metropolitan areas, as well as the socio-demographic data for each of the ZIP codes. 

These data are stored in multiple files in different formats (specified below) in the folder geographic_data/. 

Funding provided by: National Center for Sustainable Transportation*
Crossref Funder Registry ID:
Award Number: USDOT Grant 69A3551747114

Files

code.zip

Files (23.7 MB)

Name Size Download all
md5:dadcdc646b5f4fa03c90ad2dd2a2c83a
23.7 MB Preview Download

Additional details

Related works

Is cited by
10.7922/G25B00SM (DOI)
Is source of
10.25338/B89H0F (DOI)