RedScrap: Python Tool for Netnographic Data Collection
Authors/Creators
Description
This is the first stable release of RedScrap, a lightweight and customizable Python tool designed for netnographic research using Reddit data.
RedScrap allows researchers to extract Reddit comments and metadata from posts and subreddits, with filtering options by keyword, post type, and date range. It supports both command-line and graphical user interface (GUI) modes, and outputs data in structured formats (CSV or JSON) suitable for qualitative analysis.
This release (v1.0.0) includes:
-
CLI and GUI interfaces
-
Support for subreddit and post-level scraping
-
Filtering by keyword and date range
-
Export to CSV or JSON
-
Basic error handling and documentation
Developed as part of a PhD research project at Universiti Teknologi MARA (UiTM), this version was submitted to the Journal of Open Source Software (JOSS) for peer review and citation.
Files
saprimad/RedScrap-v1.0.0.zip
Files
(8.4 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:61af99026f079a6fe563724e1850407f
|
8.4 kB | Preview Download |
Additional details
Related works
- Is supplement to
- Software: https://github.com/saprimad/RedScrap/tree/v1.0.0 (URL)
Dates
- Other
-
2025-10-07Publication date
Software
- Repository URL
- https://github.com/saprimad/RedScrap
- Programming language
- Python
- Development Status
- Active
References
- Cherecheș, M., Finta, H., Prisada, R., & Rusu, A. (2024). Pharmacists' Professional Satisfaction and Challenges: A Netnographic Analysis of Reddit and Facebook Discussions. Pharmacy, 12, 1–33. https://doi.org/10.3390/pharmacy12050155
- Chi, Y., & Chen, H. Y. (2023). Investigating Substance Use via Reddit: Systematic Scoping Review. Journal of Medical Internet Research, 25, Article e48905. https://doi.org/10.2196/48905
- Jeacle, I. (2020). Navigating netnography: A guide for the accounting researcher. Financial Accountability & Management. https://doi.org/10.1111/faam.12237
- Kozinets, R. V., & Gretzel, U. (2024). Netnography evolved: New contexts, scope, procedures and sensibilities. Annals of Tourism Research, 104, 103693. https://doi.org/10.1016/j.annals.2023.103693
- Rocha-Silva, T., Nogueira, C., & Rodrigues, L. (2024). Passive data collection on Reddit: A practical approach. Research Ethics, 20(3), 453–470. https://doi.org/10.1177/17470161231210542
- Strand, M. (2022). Attitudes towards disordered eating in the rock climbing community: A digital ethnography. Journal of Eating Disorders, 10(1), 96. https://doi.org/10.1186/s40337-022-00619-5