Published March 23, 2026
| Version 0.2.0
Dataset
Open
Meneame Media Bias Dataset: Interaction Features and Bias Labels
Authors/Creators
- 1. UNED
Description
A processed dataset of news articles submitted to Meneame (Spanish social news aggregator) with automatic media bias labels and rich interaction features derived from user comments.
Contents
- articles_with_features.parquet: 14,995 articles with 38 columns including bias labels (from DistilBERT trained on MBBMD), interaction features (karma statistics, comment engagement metrics), and metadata (outlet, tags, timestamp).
- articles_labeled.parquet: Articles with bias probability scores.
- karma_features.parquet: Advanced karma distribution features per article (entropy, Gini, bimodality, skewness) for 183K+ articles.
- comments_with_sentiment.parquet: 20K comment sample with sentiment (POS/NEG/NEU) and emotion (joy, anger, sadness, fear) scores from pysentimiento/robertuito.
- user_profiles.parquet: User-level bias exposure metrics.
- user_outlet_interactions.parquet: Bipartite graph data (user-outlet comment counts).
Pipeline
Data was collected from meneame.net (2005-2021), processed through a 5-step pipeline: ingestion, filtering, automatic bias labeling (franfj/fdtd_media_bias_E), interaction feature extraction, and statistical analysis. See the GitHub repository for full reproducibility.
Key Statistics
- 14,995 articles from 2,868 media outlets
- 13.2M comments from 96K unique users
- 61.5% articles labeled as biased (automatic labeling)
- Timespan: 2005-2021
Notes
Files
articles_preview.csv
Files
(43.8 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:334d9e1cdae65d4e69258e07a55822f6
|
7.9 MB | Download |
|
md5:6c98129e737defb5311ded02217e5469
|
33.0 kB | Preview Download |
|
md5:c95f98bcf25f41f9702e9cb8ebea9c68
|
8.7 MB | Download |
|
md5:b2b0d71da5d5758fc58b0b8593179238
|
4.8 MB | Download |
|
md5:fd2672337a75b7438bc0cb117b49783d
|
2.6 kB | Preview Download |
|
md5:8756588930d0f9dc16003ecd888ee445
|
19.8 MB | Download |
|
md5:1271f8247395575f472b70a593922620
|
1.9 MB | Download |
|
md5:b049d7913a60b30bf1f6131ff2061d59
|
773.9 kB | Download |
Additional details
Related works
- Is derived from
- 10.5281/zenodo.14806064 (DOI)
- Is supplement to
- https://github.com/franfj/media-bias-mining-crowd (URL)