Credit scoring with class imbalance data: An out-of-sample and out-of-time perspective
Description
The raw datasets provided here are intended for use in a Data in Brief article. These comprehensive files, sourced from the Freddie Mac website, offer quarterly snapshots of mortgage loans that have been originated in the USA since 1999, along with details of their subsequent repayment behaviours. This data remains current and is updated every three months. Specifically, the loan origination data present here encompasses amortized fixed-rate mortgage loans from 1999 up to June 2022. In contrast, the performance data is presented on a monthly basis, detailing loan repayment profiles from 1999 until September 30, 2022. Both the origination and performance datasets feature a unique loan ID, which can be utilized to integrate the data on loan originations with that of loan repayments.
Files
originations data.zip
Files
(31.5 GB)
| Name | Size | Download all |
|---|---|---|
|
md5:769e8c26cfa26089bbe0f461768719af
|
928.1 MB | Preview Download |
|
md5:c22c27d17a9094b7c379e8f1739e9764
|
10.0 GB | Preview Download |
|
md5:5be4e04697c4af36c474415122d6f514
|
6.1 GB | Preview Download |
|
md5:cd9f89638f97558783e96331df729518
|
8.0 GB | Preview Download |
|
md5:696def212475909a67f0d6306964161f
|
4.5 GB | Preview Download |
|
md5:8db4b550294438054562c137ce653695
|
2.0 GB | Preview Download |
Additional details
Related works
- Is cited by
- Journal article: 10.1016/j.eswa.2023.121876 (DOI)
- Is described by
- Journal article: 10.1016/j.eswa.2022.117233 (DOI)
- Is previous version of
- Dataset: https://www.freddiemac.com/research/datasets/sf-loanlevel-dataset (URL)