Home > Research > Publications & Outputs > The netlog transformation and quantile regressi...
View graph of relations

The netlog transformation and quantile regression for the analysis of a large credit scoring database.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Published

Standard

The netlog transformation and quantile regression for the analysis of a large credit scoring database. / Whittaker, Joseph; Somers, Mark; Whitehead, Chris.
In: Journal of the Royal Statistical Society: Series C (Applied Statistics), Vol. 54, No. 5, 11.2005, p. 863-878.

Research output: Contribution to Journal/MagazineJournal articlepeer-review

Harvard

Whittaker, J, Somers, M & Whitehead, C 2005, 'The netlog transformation and quantile regression for the analysis of a large credit scoring database.', Journal of the Royal Statistical Society: Series C (Applied Statistics), vol. 54, no. 5, pp. 863-878. https://doi.org/10.1111/j.1467-9876.2005.00520.x

APA

Whittaker, J., Somers, M., & Whitehead, C. (2005). The netlog transformation and quantile regression for the analysis of a large credit scoring database. Journal of the Royal Statistical Society: Series C (Applied Statistics), 54(5), 863-878. https://doi.org/10.1111/j.1467-9876.2005.00520.x

Vancouver

Whittaker J, Somers M, Whitehead C. The netlog transformation and quantile regression for the analysis of a large credit scoring database. Journal of the Royal Statistical Society: Series C (Applied Statistics). 2005 Nov;54(5):863-878. doi: 10.1111/j.1467-9876.2005.00520.x

Author

Whittaker, Joseph ; Somers, Mark ; Whitehead, Chris. / The netlog transformation and quantile regression for the analysis of a large credit scoring database. In: Journal of the Royal Statistical Society: Series C (Applied Statistics). 2005 ; Vol. 54, No. 5. pp. 863-878.

Bibtex

@article{a00c51eefa194ac99af6052b2233e1a7,
title = "The netlog transformation and quantile regression for the analysis of a large credit scoring database.",
abstract = "Summary. A statistical analysis of a bank's credit card database is presented. The database is a snapshot of accounts whose holders have missed a payment on a given month but who do not subsequently default. The variables on which there is information are observable measures on the account (such as profit and activity), and whether actions that are available to the bank (such as letters and telephone calls) have been taken. A primary objective for the bank is to gain insight into the effect that collections activity has on on-going account usage. A neglog transformation that highlights features that are hidden on the original scale and improves the joint distribution of the covariates is introduced. Quantile regression, a novel methodology to the credit scoring industry, is used as it is relatively assumption free, and it is suspected that different relationships may be manifest in different parts of the response distribution. The large size is handled by selecting relatively small subsamples for training and then building empirical distributions from repeated samples for validation. In the application to the database of clients who have missed a single payment a substantive finding is that the predictor of the median of the target variable contains different variables from those of the predictor of the 30% quantile. This suggests that different mechanisms may be at play in different parts of the distribution.",
author = "Joseph Whittaker and Mark Somers and Chris Whitehead",
note = "RAE_import_type : Journal article RAE_uoa_type : Statistics and Operational Research",
year = "2005",
month = nov,
doi = "10.1111/j.1467-9876.2005.00520.x",
language = "English",
volume = "54",
pages = "863--878",
journal = "Journal of the Royal Statistical Society: Series C (Applied Statistics)",
issn = "0035-9254",
publisher = "Wiley-Blackwell",
number = "5",

}

RIS

TY - JOUR

T1 - The netlog transformation and quantile regression for the analysis of a large credit scoring database.

AU - Whittaker, Joseph

AU - Somers, Mark

AU - Whitehead, Chris

N1 - RAE_import_type : Journal article RAE_uoa_type : Statistics and Operational Research

PY - 2005/11

Y1 - 2005/11

N2 - Summary. A statistical analysis of a bank's credit card database is presented. The database is a snapshot of accounts whose holders have missed a payment on a given month but who do not subsequently default. The variables on which there is information are observable measures on the account (such as profit and activity), and whether actions that are available to the bank (such as letters and telephone calls) have been taken. A primary objective for the bank is to gain insight into the effect that collections activity has on on-going account usage. A neglog transformation that highlights features that are hidden on the original scale and improves the joint distribution of the covariates is introduced. Quantile regression, a novel methodology to the credit scoring industry, is used as it is relatively assumption free, and it is suspected that different relationships may be manifest in different parts of the response distribution. The large size is handled by selecting relatively small subsamples for training and then building empirical distributions from repeated samples for validation. In the application to the database of clients who have missed a single payment a substantive finding is that the predictor of the median of the target variable contains different variables from those of the predictor of the 30% quantile. This suggests that different mechanisms may be at play in different parts of the distribution.

AB - Summary. A statistical analysis of a bank's credit card database is presented. The database is a snapshot of accounts whose holders have missed a payment on a given month but who do not subsequently default. The variables on which there is information are observable measures on the account (such as profit and activity), and whether actions that are available to the bank (such as letters and telephone calls) have been taken. A primary objective for the bank is to gain insight into the effect that collections activity has on on-going account usage. A neglog transformation that highlights features that are hidden on the original scale and improves the joint distribution of the covariates is introduced. Quantile regression, a novel methodology to the credit scoring industry, is used as it is relatively assumption free, and it is suspected that different relationships may be manifest in different parts of the response distribution. The large size is handled by selecting relatively small subsamples for training and then building empirical distributions from repeated samples for validation. In the application to the database of clients who have missed a single payment a substantive finding is that the predictor of the median of the target variable contains different variables from those of the predictor of the 30% quantile. This suggests that different mechanisms may be at play in different parts of the distribution.

U2 - 10.1111/j.1467-9876.2005.00520.x

DO - 10.1111/j.1467-9876.2005.00520.x

M3 - Journal article

VL - 54

SP - 863

EP - 878

JO - Journal of the Royal Statistical Society: Series C (Applied Statistics)

JF - Journal of the Royal Statistical Society: Series C (Applied Statistics)

SN - 0035-9254

IS - 5

ER -