Dataset Open Access

#Élysée2017fr: The 2017 French Presidential Campaign on Twitter

Fraisier; Cabanac; Pitarch; Besançon; Boughanem

# README

This archive contains the #Élysée2017fr dataset.

(Initially published at https://web.archive.org/web/20200530171644if_/https://dataverse.mpi-sws.org/dataverse/icwsm18 on June 24, 2018. This dataverse being defunct now, we repost on Zenodo)


## Content

### keywords.csv
The keywords used to collect the initial dataset, each presented with the start and stop dates of use (date format: YYYY-MM-DD).


### profiles_annotations.csv
The manual profiles annotations. The file contains the following columns:

#### FROM_USER_ID
The profile's id used by Twitter

#### PROFILE_NATURE
"**individual**" if the profile is managed by a single person, else "**non individual**".
The "**non individual**" label is itself divided in 3 subcategories:
  - "**political**" for profiles of political parties or associations, and profiles representing groupes of militants.
  - "**media**" for profiles of media outlets.
  - "**other**" for profiles not included in the previous categories.

#### PARTY
The profile's political affiliation(s), indicated as the shortcut for the political party:
  - "**fi**": France Insoumise (far-left)
  - "**ps**": Parti Socialiste (left)
  - "**em**": En Marche ! (center)
  - "**lr**": Les Républicains (right)
  - "**fn**": Front National (far-right)
  - **null**: no political affiliation

When a profile has 2 affiliations, they are separated by a slash (ex: "ps/fi"). 

#### MEDIA_PROFESSIONAL
*For individual profiles only.*
Indicates if the profile's owner self-identify as a media professional (journalist, editorialist, ...)

#### SEX
*For individual profiles only.*
Indicates the sex of the profile's owner:
 - "**m**": male
 - "**f**": female
 - **null**: undetermined or other


### posts_ids_*
Files containing the tweets and retweets ids, divided according to the political affiliation of their authors for more flexibility.
  - **posts_ids_fi.csv**: Tweet ids for profiles affiliated to France Insoumise (far-left)
  - **posts_ids_ps.csv**: Tweet ids for profiles affiliated to Parti Socialiste (left)
  - **posts_ids_em.csv**: Tweet ids for profiles affiliated to En Marche ! (center)
  - **posts_ids_lr.csv**: Tweet ids for profiles affiliated to Les Républicains (right)
  - **posts_ids_fn.csv**: Tweet ids for profiles affiliated to Front National (far-right)
  - **posts_ids_multi_affiliations.csv**: Tweet ids for profiles affiliated to more than one party
  - **posts_ids_indetermined.csv**: Tweet ids for profiles not affiliated to any party

Each file contains one tweet id per line.


### networks_*
Files containing the mention and retweet networks, in NCOL and GraphML format.

The NCOL files contains the directed weighted edges between profiles, one per line, in the following format:
profile1_twitter_id profile2_twitter_id edge_weight

The GraphML files contains the directed weighted edges between profiles, as well as all the profiles annotations presented in *profiles_annotations.csv*. They can be opened using a graph visualisation software like Gephi.

 

## How to get tweets from ids
You can use various tools to help you get tweets from their ids, we suggest the following:
- DMI-TCAT: https://github.com/digitalmethodsinitiative/dmi-tcat
- Twarc: https://github.com/DocNow/twarc

 

## How to cite this work
Fraisier Ophélie, Cabanac Guillaume, Pitarch Yoann, Besançon Romaric, Boughanem Mohand. 2018. #Élysée2017fr: the French Presidential Election on Twitter. In International Conference on Weblogs and Social Media. https://aaai.org/ocs/index.php/ICWSM/ICWSM18/paper/view/17821 (https://hal.archives-ouvertes.fr/hal-02319715)

Initially published at https://web.archive.org/web/20200530171644if_/https://dataverse.mpi-sws.org/dataverse/icwsm18 on June 24, 2018. This dataverse being defunct now, we repost on Zenodo.
Files (582.3 MB)
Name Size
keywords.csv
md5:dbd2fe12cfd0f132fad31da32f2dcb90
5.8 kB Download
networks_mention.graphml
md5:0a010c79f3ca8c879336e2bc714cfcbe
176.7 MB Download
networks_mention.ncol
md5:b9e88689d66553da2500e3d1d938a952
50.4 MB Download
networks_retweet.graphml
md5:177bd8cec4a06e39b2228c09a69bce2f
125.7 MB Download
networks_retweet.ncol
md5:0acd39f9b7699576e273398c51735a02
35.3 MB Download
posts_ids_em.csv
md5:3e72f38634b3d7609fee847b2963daf6
28.9 MB Download
posts_ids_fi.csv
md5:4b0a85288ceaafc8cc7144d9ebf63078
41.7 MB Download
posts_ids_fn.csv
md5:4835693a33b903ca4f10ee99906e3057
35.3 MB Download
posts_ids_indetermined.csv
md5:4bd4cb0bf871264a47239668db12a6bd
16.3 MB Download
posts_ids_lr.csv
md5:925ac0b68b5fa6d8216f1b020be898a5
50.9 MB Download
posts_ids_multi_affiliations.csv
md5:3b3bf4a5dfe68c3c199ca9b2f0bccc4c
7.1 MB Download
posts_ids_ps.csv
md5:f7b7c481f824e8f943b29eb2b9ae3103
13.2 MB Download
profiles_annotations.csv
md5:f27610a391f89944a90114f503a50169
865.2 kB Download
README.md
md5:ad4686ae5b1e51a7ad01ebbfb0c3a309
3.6 kB Download
87
87
views
downloads
All versions This version
Views 8787
Downloads 8787
Data volume 3.3 GB3.3 GB
Unique views 7171
Unique downloads 1818

Share

Cite as