Published March 26, 2024 | Version v1
Computational notebook Open

Data and Code to reproduce results in paper "Network centrality and credit risk: A comprehensive analysis of peer-to-peer lending dynamics"

Authors/Creators

  • 1. University of Twente
  • 2. BFH Bern University of Applied Sciences
  • 3. ROR icon Bern University of Applied Sciences

Description

Data and Code to reproduce results in paper "Network centrality and credit risk: A comprehensive analysis of peer-to-peer lending dynamics"

This repository contains the necessary codes to reproduce results in the paper:

Yiting Liu, Lennart John Baals, Jörg Osterrieder, Branka Hadji-Misheva,
Network centrality and credit risk: A comprehensive analysis of peer-to-peer lending dynamics,
Finance Research Letters,
Volume 63,
2024,
105308,
ISSN 1544-6123,
https://doi.org/10.1016/j.frl.2024.105308.
(https://www.sciencedirect.com/science/article/pii/S1544612324003386)
Abstract: This letter analyzes credit risk assessment in the Peer-to-Peer (P2P) lending domain by leveraging a comprehensive dataset from Bondora, a leading European P2P platform. Through combining traditional credit features with network topological features, namely the degree centrality, we showcase the crucial role of a borrower’s position and connectivity within the P2P network in determining loan default probabilities. Our findings are bolstered by robustness checks using shuffled centrality features, which further underscore the significance of integrating both financial and network attributes in credit risk evaluation. Our results shed new light on credit risk determinants in P2P lending and benefit investors in capturing inherent information from P2P loan networks.
Keywords: Peer-to-Peer lending; Credit-default prediction; Machine Learning; Network centrality

 

Raw data:

The raw dataset was downloaded on April 22nd, 2022, as a part of Bandora’s daily updated public report.3 Loan starting dates span from June 16th, 2009, to April 21st, 2022. The original dataset covers 231,039 individual borrowers characterized through 112 categorical and continuous variables. Among these loans, 79,424 have been recorded with delayed interest payments according to the platform, while 151,615 loans have no recorded delay on interest payments before the download date of the data. Specifically, the dataset details borrower demographics, financial attributes, and past credit market interactions.

The raw dataset cannot be made public due to the restrictions of the Bondora platform (https://bondora.com/en/terms/): 
13.4 The Portal, Portal's website and the copyright of the contents thereof belong to the Company. The User does not have the right to save, copy, change, transfer, forward or disclose the pages of the Portal for a purpose other than personal use.

 

Data cleaning:

Bondora.R

further data cleaning.qmd

Thes two files cleans the data, as described in the paper Section 2.2.

Data metadata:

Description cleaned.xlsx

This file describe the meaning of features in the cleaned dataset.

The cleaned dataset, as second-hand data, can also not be made public due to the restriction of the platform.

 

Modeling:

3rd paper code.qmd

3rd paper code (2).qmd

These two notebooks generate the results presentated in the paper.

 

Files

Files (96.2 kB)

Name Size Download all
md5:a86e96b33edcdd75c841d162160f77e9
32.0 kB Download
md5:923b04ffb72d7d1263242e70406c3913
6.2 kB Download
md5:89d34a85ed3e6ae78f61b2d252889620
29.5 kB Download
md5:7a79ddebd1f75985722deecd5fc88d76
24.6 kB Download
md5:c41a3b81a7128f047567c74076163a68
3.8 kB Download

Additional details

Funding

Swiss National Science Foundation
100019E-205487
European Cooperation in Science and Technology
COST Action CA19130