Volume 4, Issue 6, December 2016, Page: 289-297
Modeling Loan Defaults in Kenya Banks as a Rare Event Using the Generalized Extreme Value Regression Model
Stephen Muthii Wanjohi, Department Statistics and Actuarial Science, Jomo Kenyatta University of Agriculture and Technology, Nairobi, Kenya
Anthony Gichuhi Waititu, Department Statistics and Actuarial Science, Jomo Kenyatta University of Agriculture and Technology, Nairobi, Kenya
Anthony Kibira Wanjoya, Department Statistics and Actuarial Science, Jomo Kenyatta University of Agriculture and Technology, Nairobi, Kenya
Received: Oct. 4, 2016;       Accepted: Oct. 25, 2016;       Published: Nov. 16, 2016
DOI: 10.11648/j.sjams.20160406.17      View  3794      Downloads  185
Extreme value theory is the study of extremal properties of random processes, it models and measures events that occur with little probability. The extreme value theory is a robust framework to analyze the tail behavior of distributions. It has been applied extensively in hydrology, climatology, insurance and finance industry. The information of probability of customer default is very useful while analyzing the credit risks in banks. Logistic regression model has been used extensively to model the probability of loan defaults. However, it has some limitations when it comes to modeling rare events, for example, the underestimation of the default probability which could be very risky for the bank. The second limitation/drawback is that the logit link is symmetric about 0.5, this means that the response curve п(x i) approaches one at the same rate it approaches zero. To overcome these limitations the study sought to implement regression method for binary data based on extreme value theory. The objective of the study was to model loan defaults in Kenya banks using the GEV regression model. The results of GEV were compared with the results of the logistic regression model. The study found out for rare events such as loan defaults the GEV performed better than the logistic regression model. As the percentage of defaulters in a sample became smaller the GEV model to identify defaults improves whereas the logistic regression model becomes poorer.
Logistic, Generalized Extreme Value Regression, Extreme Value Theory, Confusion Matrix
To cite this article
Stephen Muthii Wanjohi, Anthony Gichuhi Waititu, Anthony Kibira Wanjoya, Modeling Loan Defaults in Kenya Banks as a Rare Event Using the Generalized Extreme Value Regression Model, Science Journal of Applied Mathematics and Statistics. Vol. 4, No. 6, 2016, pp. 289-297. doi: 10.11648/j.sjams.20160406.17
Copyright © 2016 Authors retain the copyright of this article.
This article is an open access article distributed under the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Andrew, C. (2004). Basel II: The reviewed framework of June 2004. Geneva, Switzerland.
Agresti, A. (2002). An introduction to categorical data analysis. New York: Wiley.
Anatoly B. J (2014). The probability of default models of Russian banks. Journal of Institute of Economics in Transition 21 (5), 203-278.
Adrea Ruth.(2010). Measuring the likelihood of small Business default; Journal of Applied Sciences 33 (7), 1289-1386.
Altman E.(1968). Financial ratios, discriminant analysis, and prediction of corporate bankruptcy. Journal of Finance 23 (4) 589-609.
Alexander B.(2012) Determinant of bank failures the case of Russia, Journal of Applied Statistics, 78 (32), 235-403.
Beirlant, (2004). Statistics of extremes. Hoboken, NJ: Wiley.
Calabrese, R. (2012). Modelling SME loan defaults as rare events: The generalized extreme value regression. Journal Of Applied Statistics, 00 (00), 1-17.
Calabrese R. (2011). Generalized extreme value regression for binary rare events data: an application to credit default. Journal of Applied Statistics, 2 (4), 4-8.
Castillo, E. (2005). Extreme value and related models with applications in engineering and science. Hoboken, N. J.: Wiley.
Coles, S. (2001). An introduction to statistical modeling of extreme values. London: Springer.
David (1977). Early warning of bank failure: A logit regression approach. Journal of Banking and Finance, 19, 109-301.
Dobson A. J (2002). An Introduction to Generalized linear models. 2nd ed. Boca rayon.
Eliason, S. R (1993) Maximum Likelihood Estimation: Logic and Practice. Sage University Paper series on Quantitative Application in social sciences, series no. 07-096. Newbury Park.
Falk, M., Hüsler, J. & Reiss, R. (1994). Laws of small numbers: extremes and rare events; [based on lectures given at the DMV Seminar on "Laws of small numbers; extremes and rare events," held at the Katholische UniversitätEichstätt from October 20 - 27, 1991]. Basel [u.a.]: Birkhäuser.
Galambos, J. (1978). The asymptotic theory of extreme order statistics. New York: Wiley.
Gilli, M., &Këllezi, E. (2000). Extreme value theory for tail-related risk measures. Geneva: FAME.
Goodhart, C. (2011). The Basel Committee on Banking Supervision. Cambridge: Cambridge University Press.
Gumbel, E. (1958). Statistics of extremes. New York: Columbia University Press.
Haan, L., & Ferreira, A. (2006). Extreme value theory. New York: Springer.
Haotian chen and Ziyuan Chen. Data mining on loan Default prediction. Journal of Institute of Economics in Transition 214 (7), 256-298.
Jenkison, (1956). Statistics of extremes. Hoboken, NJ: Wiley.
Junjie Liang (2013) Predicting borrowers chance of defaulting on credit loans. American Journal of Theoretical and Applied Statistics, 1345 (2), 4556-4598.
Leadbetter, M., Lindgren, G., &Rootzén, H. (1983). Extremes and related properties of random sequences and processes. New York: Springer-Verlag.
Leadbetter, M., Lindgren, G., &Rootzen, H. (1980). Extremal and Related Properties of Stationary Processes. Part II. Extreme Values in Continuous Time. Ft. Belvoir: Defense Technical Information Center.
Lenntand Golet (2014). Symmetric and asymmetric binary choice models for corporate bankruptcy, Journal of social and behavior sciences, 124 (14), 282-291.
McCullagh P., Nelder J. A (1989) Generalized linear model, Chapman Hall, Newyork.
O. Adem., & Waititu, A. (2012). Parametric modeling of the probability of bank loan default in Kenya. Journal of Applied Statistics, 14 (1), 61-74.
Oliveira, J. (1984). Statistical Extremes and Applications. Dordrecht: Springer Netherlands.
Omkar G. (2002). Predicting loan defaults. American Journal of Theoretical and Applied Statistics, 15 (3), 3543-3789.
Rafaella, C. Giampiero, M. Bankruptcy Prediction of small and medium enterprises using s flexible binary GEV extreme value model. American Journal of Theoretical and Applied Statistics, 1307 (2), 3556-3798.
Paul Embrechts, Resnick, Sydney. (1987). Extreme values, regular variation, and point processes. New York: Springer-Verlag.
Semmes, T. (2011). Gumbel. Newyork.
Sjur Westgaard (2002). Capital Structure and the prediction of bankruptcy. American Journal of applied statistics, 45 (57), 543-678.
Singhee, A., & Rutenbar, R. (2010). Extreme statistics in nanoscale memory design. New York: Springer.
Uday Rajan (2010). Statistical models and incentives, Journal of Applied Sciences, 100 (2) 3456-3500
Von Mises, (1936). Theory of Statistics of extremes. Hoboken, NJ: Wiley.
Wikipedia, (2015). Generalized extreme value distribution. Retrieved 2 December 2015, from http://en.wikipedia.org/wiki/Extreme_value_distribution
Browse journals by subject