Scoring
The Project Team chose to score the model using a scorecard method by running a new regression to get the ten
most important data points: Loan Amount, Checking Acct, Age, Loan Duration, Credit History, Employment, Savings
Account, Installment Rate, Present Residence, and Job.
Next, the variables, with equal weight levels, were binned using the weight of evidence (WOE) binning technique.
The WOE is a simple formula that calculates the importance of an independent variable to the dependent variable.
The charts below show how well the model performed. It appears that the test and traininĀ¬g data are very similar.
Both the training and test data proved that the model can discern from a good-risk or bad-risk customer data 80%
of the time.
Performance Evaluation
The performance evaluation ( perf_psi ) calculates the stability for the score and variables. It is used to run
ongoing tests on the model to ensure the data received from the population does not change. This is important
because overtime the data will change and the model should be tested and calibrated often. The formula is as
follows:
When the PSI is less than 0.1 there is no real change in new data. When the PSI reaches 0.1 - 0.25 there is some
minor change, and the model should be evaluated. Anything greater than 0.25 signifies a major change and model
will need to be changed.
The PSI score for this model is fine at this time, however as previously stated, this should be tested often.
Risk Charts
The use of a risk chart is most helpful when evaluating models of fraud. As seen below: