Published October 5, 2023 | Version 2.0.0
Dataset Open

Credit scoring with class imbalance data: An out-of-sample and out-of-time perspective

  • 1. University of KwaZulu-Natal

Description

The raw datasets provided here are intended for use in a Data in Brief article. These comprehensive files, sourced from the Freddie Mac website, offer quarterly snapshots of mortgage loans that have been originated in the USA since 1999, along with details of their subsequent repayment behaviours. This data remains current and is updated every three months. Specifically, the loan origination data present here encompasses amortized fixed-rate mortgage loans from 1999 up to June 2022. In contrast, the performance data is presented on a monthly basis, detailing loan repayment profiles from 1999 until September 30, 2022. Both the origination and performance datasets feature a unique loan ID, which can be utilized to integrate the data on loan originations with that of loan repayments.

Files

originations data.zip

Files (31.5 GB)

Name Size Download all
md5:769e8c26cfa26089bbe0f461768719af
928.1 MB Preview Download
md5:c22c27d17a9094b7c379e8f1739e9764
10.0 GB Preview Download
md5:5be4e04697c4af36c474415122d6f514
6.1 GB Preview Download
md5:cd9f89638f97558783e96331df729518
8.0 GB Preview Download
md5:696def212475909a67f0d6306964161f
4.5 GB Preview Download
md5:8db4b550294438054562c137ce653695
2.0 GB Preview Download

Additional details

Related works

Is cited by
Journal article: 10.1016/j.eswa.2023.121876 (DOI)
Is described by
Journal article: 10.1016/j.eswa.2022.117233 (DOI)
Is previous version of
Dataset: https://www.freddiemac.com/research/datasets/sf-loanlevel-dataset (URL)