Published June 29, 2024 | Version 1.0
Dataset Open

Reddit r/cryptocurrency posts and comments January 2021 - December 2022

  • 1. ROR icon Aristotle University of Thessaloniki

Description

The dataset comprises data from two primary sources: Bitcoin market data and user activity data from the r/cryptocurrency subreddit. The time period covered by the dataset spans from January 1, 2021, to December 31, 2022.

Bitcoin Market Data

The Bitcoin market data includes the following metrics:

  • Open: The opening price of Bitcoin recorded daily.
  • High: The highest price of Bitcoin recorded daily.
  • Low: The lowest price of Bitcoin recorded daily.
  • Close: The closing price of Bitcoin recorded daily.
  • Volume: The daily trading volume of Bitcoin.

These metrics were collected from CoinMarketCap (https://coinmarketcap.com/).

Reddit Activity Data

The Reddit activity data consists of posts and comments from the r/cryptocurrency subreddit, focusing on the most popular content. The data includes:

  • Posts: 770 of the most popular posts from the specified time period, selected based on upvotes and engagement.
  • Comments: 14,886 comments associated with the collected posts, representing the most popular comments in terms of upvotes and responses.

For each post, the following attributes were recorded:

  • Title: The post title
  • Score: The upvote score of the post or comment, indicating its popularity.
  • URL: The URL of the post
  • Number of comments: The total number of comments received by each post.
  • Body: The text posted with the post submission
  • Date: The date and time of the post submission

For each comment, the following attributes were recorded:

  • Date: The date the comment was posted
  • Comment: The content of the comment

The combined dataset aims to provide a comprehensive view of both market and social media activity related to Bitcoin, enabling a detailed analysis of the interplay between market dynamics and user sentiment.

Files

BTC-USD.csv

Files (3.9 MB)

Name Size Download all
md5:25b3696828994deb4d23a535602e0475
63.7 kB Preview Download
md5:22d261ab511944043aa35ee72149cb19
2.6 MB Preview Download
md5:eab2541e15d295d8b2d4363620a83330
1.3 MB Preview Download