#brexit tweets collected in May and June 2016

brexit_timeline.svg

Files

Collecting the data

You can get the data using twarc and poultry:

# Set up the credentials
export CONSUMER_KEY=...
export CONSUMER_SECRET=...
export ACCESS_TOKEN=...
export ACCESS_TOKEN_SECRET=...

mkdir t

# Hydrate the tweets using twarc and group them with poultry by day.
time gzcat brexit_tweets_ids.csv.gz | twarc.py --hydrate - | poultry group -t 't/%Y-%m-%d.gz'
t/2016-05-06.gz
t/2016-05-07.gz
...


# Half a million tweets are collected in about 4 hours.

Software

Attribution

The tweets were collected with Poultry using the infrastructure of the School of Electronic Engineering and Computer Science at Queen Mary University of London.

Poultry was created as a part of my master thesis where it was used to collect tweets about music festivals in Europe and the London Olympics. Later it resulted in a paper presented at the WWW 13 workshop RAMSS in Rio.