Data for Managing Retractions and Their Afterlife: A Tripartite Framework for Research Datasets Contact: Renata Curty (rcurty@ucsb.edu)

This repository contains supplementary data for the paper "Managing Retractions and Their Afterlife: A Tripartite Framework for Research Datasets," authored by Renata Curty (UCSB). The paper has been accepted for presentation at the International Digital Curation Conference (IDCC) 2025 in The Hague, Amsterdam. The data was collected in November 2024.
Data Collection Methodology Google Dataset Search was used to identify retracted papers by querying various terms, including:
	•	"retracted"
	•	"[retracted]"
	•	"retracted dataset"
	•	"retracted data"
	•	"retracted article"
	•	"retracted paper"
	•	"data for: retracted"
	•	"data from: retracted"
	•	"[retracted] data from"

These terms were selected based on common labels used by scientific publishers for retracted papers. Only records in English were included. After identifying relevant datasets, we filtered out records where "retracted" appeared in studies analyzing retractions (e.g., citation trends for retracted papers).
Dataset Files This repository contains two CSV files:
	1	retracted-records.csv This file includes a list of 40 records retrieved from Google Dataset Search, with columns for:
	◦	Title
	◦	DOI (or URL)
	◦	Perma.cc links
	2	selected-sample.csv This file includes 13 deposit records with actual data files, which were analyzed. It includes the following columns:
	◦	ID: Assigned two-digit number
	◦	Title: Title of the record
	◦	DOI/URL: Digital object identifier or URL of the record
	◦	Perma.cc: Permalink assigned to capture a screenshot of the record and preserve the version used in the analysis
	◦	Pub-date: Publication date in ISO format (YYYY-MM-DD)
	◦	Last update date: Date of the last update to the record, indicating retraction (ISO format)
	◦	Lifespan-br: Time (in days) between first publication and last update (retraction)
	◦	Version: Record version, if applicable
	◦	Repository: Repository or hosting service of the record
	◦	Publisher: Publisher of the associated manuscript, if applicable
	◦	Journal: Journal title for the associated manuscript, if applicable
	◦	Title-change: 0 (No) or 1 (Yes) indicating if the title changed upon retraction
	◦	Files-accessible: 0 (No) or 1 (Yes) indicating if files are still accessible for download after retraction
	◦	Files-renamed: 0 (No) or 1 (Yes) indicating if files were renamed after retraction
	◦	Files-added: 0 (No) or 1 (Yes) indicating if additional files were added after retraction
	◦	Retrac-note: 0 (No) or 1 (Yes) indicating if retraction information was provided
	◦	Citations: Total number of citations
	◦	Downloads: Total number of downloads
	◦	Views: Total number of views
	◦	Retraction-note: Text or notice detailing the retraction status
Missing data is marked as "NA."
License This dataset is available under the CC0 License.
