Published March 23, 2026 | Version 0.2.0
Dataset Open

Meneame Media Bias Dataset: Interaction Features and Bias Labels

Description

A processed dataset of news articles submitted to Meneame (Spanish social news aggregator) with automatic media bias labels and rich interaction features derived from user comments.

Contents

  • articles_with_features.parquet: 14,995 articles with 38 columns including bias labels (from DistilBERT trained on MBBMD), interaction features (karma statistics, comment engagement metrics), and metadata (outlet, tags, timestamp).
  • articles_labeled.parquet: Articles with bias probability scores.
  • karma_features.parquet: Advanced karma distribution features per article (entropy, Gini, bimodality, skewness) for 183K+ articles.
  • comments_with_sentiment.parquet: 20K comment sample with sentiment (POS/NEG/NEU) and emotion (joy, anger, sadness, fear) scores from pysentimiento/robertuito.
  • user_profiles.parquet: User-level bias exposure metrics.
  • user_outlet_interactions.parquet: Bipartite graph data (user-outlet comment counts).

Pipeline

Data was collected from meneame.net (2005-2021), processed through a 5-step pipeline: ingestion, filtering, automatic bias labeling (franfj/fdtd_media_bias_E), interaction feature extraction, and statistical analysis. See the GitHub repository for full reproducibility.

Key Statistics

  • 14,995 articles from 2,868 media outlets
  • 13.2M comments from 96K unique users
  • 61.5% articles labeled as biased (automatic labeling)
  • Timespan: 2005-2021

Notes

v0.2.0: Updated bias labels with high-confidence threshold (P75, ~25% biased). Added CSV preview file and dataset description for quick inspection. Bias labels produced by DistilBERT fine-tuned on MBBMD (Spanish media bias corpus).

Files

articles_preview.csv

Files (43.8 MB)

Name Size Download all
md5:334d9e1cdae65d4e69258e07a55822f6
7.9 MB Download
md5:6c98129e737defb5311ded02217e5469
33.0 kB Preview Download
md5:c95f98bcf25f41f9702e9cb8ebea9c68
8.7 MB Download
md5:b2b0d71da5d5758fc58b0b8593179238
4.8 MB Download
md5:fd2672337a75b7438bc0cb117b49783d
2.6 kB Preview Download
md5:8756588930d0f9dc16003ecd888ee445
19.8 MB Download
md5:1271f8247395575f472b70a593922620
1.9 MB Download
md5:b049d7913a60b30bf1f6131ff2061d59
773.9 kB Download

Additional details

Related works