Dataset Open Access

A Large Open Dataset from the Parler Social Network

Max Aliapoulios; Emmi Bevensee; Jeremy Blackburn; Barry Bradlyn; Emiliano De Cristofaro; Gianluca Stringhini; Savvas Zannettou


MARC21 XML Export

<?xml version='1.0' encoding='UTF-8'?>
<record xmlns="http://www.loc.gov/MARC21/slim">
  <leader>00000nmm##2200000uu#4500</leader>
  <controlfield tag="005">20210117232518.0</controlfield>
  <controlfield tag="001">4442460</controlfield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">SMAT</subfield>
    <subfield code="a">Emmi Bevensee</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Binghamton University</subfield>
    <subfield code="a">Jeremy Blackburn</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">University of Illinois at Urbana-Champaign</subfield>
    <subfield code="a">Barry Bradlyn</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">University College London</subfield>
    <subfield code="a">Emiliano De Cristofaro</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Boston University</subfield>
    <subfield code="a">Gianluca Stringhini</subfield>
  </datafield>
  <datafield tag="700" ind1=" " ind2=" ">
    <subfield code="u">Max Planck Institute for Informatics</subfield>
    <subfield code="a">Savvas Zannettou</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">34162171212</subfield>
    <subfield code="z">md5:14e618167b499e717292351169e544d8</subfield>
    <subfield code="u">https://zenodo.org/record/4442460/files/parler_data.zip</subfield>
  </datafield>
  <datafield tag="856" ind1="4" ind2=" ">
    <subfield code="s">1084547274</subfield>
    <subfield code="z">md5:f3744f15740684f4100c7a33fafc4c33</subfield>
    <subfield code="u">https://zenodo.org/record/4442460/files/parler_users.zip</subfield>
  </datafield>
  <datafield tag="542" ind1=" " ind2=" ">
    <subfield code="l">open</subfield>
  </datafield>
  <datafield tag="260" ind1=" " ind2=" ">
    <subfield code="c">2021-01-15</subfield>
  </datafield>
  <datafield tag="909" ind1="C" ind2="O">
    <subfield code="p">openaire_data</subfield>
    <subfield code="o">oai:zenodo.org:4442460</subfield>
  </datafield>
  <datafield tag="100" ind1=" " ind2=" ">
    <subfield code="u">New York University</subfield>
    <subfield code="a">Max Aliapoulios</subfield>
  </datafield>
  <datafield tag="245" ind1=" " ind2=" ">
    <subfield code="a">A Large Open Dataset from the Parler Social Network</subfield>
  </datafield>
  <datafield tag="540" ind1=" " ind2=" ">
    <subfield code="u">https://creativecommons.org/licenses/by/4.0/legalcode</subfield>
    <subfield code="a">Creative Commons Attribution 4.0 International</subfield>
  </datafield>
  <datafield tag="650" ind1="1" ind2="7">
    <subfield code="a">cc-by</subfield>
    <subfield code="2">opendefinition.org</subfield>
  </datafield>
  <datafield tag="520" ind1=" " ind2=" ">
    <subfield code="a">&lt;p&gt;There are two main zip files in this dataset. Each zip file contains multiple &lt;a href="http://ndjson.org/"&gt;newline delimited JSON files&lt;/a&gt;. The JSON objects in each file contain all the key/value pairs returned from their respective Parler API endpoints.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;NOTE:&lt;/strong&gt; All identifying data has been redacted by removing the &amp;quot;name&amp;quot; field from posts, comments and user profiles.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://arxiv.org/abs/2101.03820"&gt;Current paper&lt;/a&gt; for more details and preferred citation information.&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Parler Users (parler_user.zip)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Returned by the &lt;em&gt;/v1/profile&lt;/em&gt; endpoint. Each line consists of an individual user profile object.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Parler Posts and Comments (parlerpostcomments.zip)&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Returned by the &lt;em&gt;/v1/post[comment] &lt;/em&gt;endpoints. Each line consists of an individual post or comment object.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;How to extract the data:&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The data provided is compressed in zip format. See the instructions below on how to extract it:&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;&lt;strong&gt;Linux and Mac&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Navigate to the file in a terminal window or file browser.&lt;/p&gt;

&lt;p&gt;Run the following command (or typically you can double click if you are in a file browser):&lt;/p&gt;

&lt;p&gt;User profiles:&lt;/p&gt;

&lt;pre&gt;&lt;code&gt;unzip parler_users.zip&lt;/code&gt;&lt;/pre&gt;

&lt;p&gt;Posts and comments:&lt;/p&gt;

&lt;pre&gt;&lt;code&gt;unzip parler_data.zip&lt;/code&gt;&lt;/pre&gt;

&lt;p&gt;The unzipped version contains multiple files for each data type.&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;&lt;strong&gt;Windows&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Microsoft provides instructions on to extract a zip file. The authors cannot recommend a particular method or software.&lt;/p&gt;</subfield>
  </datafield>
  <datafield tag="773" ind1=" " ind2=" ">
    <subfield code="n">doi</subfield>
    <subfield code="i">isVersionOf</subfield>
    <subfield code="a">10.5281/zenodo.4442459</subfield>
  </datafield>
  <datafield tag="024" ind1=" " ind2=" ">
    <subfield code="a">10.5281/zenodo.4442460</subfield>
    <subfield code="2">doi</subfield>
  </datafield>
  <datafield tag="980" ind1=" " ind2=" ">
    <subfield code="a">dataset</subfield>
  </datafield>
</record>
6,521
4,088
views
downloads
All versions This version
Views 6,5216,521
Downloads 4,0884,088
Data volume 102.4 TB102.4 TB
Unique views 5,7475,747
Unique downloads 2,0002,000

Share

Cite as