Reddit r/cryptocurrency posts and comments January 2021 - December 2022
Description
The dataset comprises data from two primary sources: Bitcoin market data and user activity data from the r/cryptocurrency subreddit. The time period covered by the dataset spans from January 1, 2021, to December 31, 2022.
Bitcoin Market Data
The Bitcoin market data includes the following metrics:
- Open: The opening price of Bitcoin recorded daily.
- High: The highest price of Bitcoin recorded daily.
- Low: The lowest price of Bitcoin recorded daily.
- Close: The closing price of Bitcoin recorded daily.
- Volume: The daily trading volume of Bitcoin.
These metrics were collected from CoinMarketCap (https://coinmarketcap.com/).
Reddit Activity Data
The Reddit activity data consists of posts and comments from the r/cryptocurrency subreddit, focusing on the most popular content. The data includes:
- Posts: 770 of the most popular posts from the specified time period, selected based on upvotes and engagement.
- Comments: 14,886 comments associated with the collected posts, representing the most popular comments in terms of upvotes and responses.
For each post, the following attributes were recorded:
- Title: The post title
- Score: The upvote score of the post or comment, indicating its popularity.
- URL: The URL of the post
- Number of comments: The total number of comments received by each post.
- Body: The text posted with the post submission
- Date: The date and time of the post submission
For each comment, the following attributes were recorded:
- Date: The date the comment was posted
- Comment: The content of the comment
The combined dataset aims to provide a comprehensive view of both market and social media activity related to Bitcoin, enabling a detailed analysis of the interplay between market dynamics and user sentiment.