The Data Analytics Blog

Our news and views relating to Data Analytics, Big Data, Machine Learning, and the world of Credit.

All Posts

How Quick-Step Machine-Learning Models will help you through COVID-19

July 23, 2020 at 11:30 AM

In a previous blog, we looked at assessing your credit models and the challenge of building and deploying models representative of the COVID-19 crisis. At the crux of the challenge was the fact that:

  1. Current models are not going to be working that well, due to the current environment.
  2. Standard new models will also not cut it as models with the standard development data periods will not represent the current credit environment and so these models will most likely not work as well as needed.

The solution: Short outcome or strict performance models

A solution to this is short outcome or strict performance models. Whilst there are some limitations, these will be models that best represent the COVID-19 economic upheaval period. As I said, you can read more about this in my previous blog.

Whilst these models may help in the short term, during a tumultuous period, it may be necessary to redevelop your models more regularly. Scorecard building projects can be costly in both time and effort. The deployment of a machine learning environment can reduce costs and ensure the models are most representative of the consumers in the current economic environment.

“Over the last four years, Principa have been creating a machine learning environment where scorecards can be developed, deployed, monitored, and redeveloped seamlessly."

Principa’s Quick-Step machine learning models

Principa’s Quick-Step machine learning models allow lenders and collectors to adapt and deploy rapidly. Models can be built and rebuilt at regular intervals considering all relevant data observation and outcome periods.


Develop, deploy, monitor and redevelop scorecards seemlessly

Over the last 4 years, Principa have been creating a machine learning environment where scorecards can be developed, deployed, monitored and redeveloped seamlessly. We’ve deployed this solution at South Africa’s leading collections recovery agency.  The components of the machine learning environment are as follows:

1.  Data management platform (DMP) – this platform allows us to:

(a) Ingest data from a variety of sources, clean the data and then combine the data.

(b) Create features (essentially these are variables that can be used for scoring, for example aggregated variables across a time series or/and at customer level). We use Spark SQL.

(c) Create a data asset. The data asset is a data set from which reports can be pulled and the data is also modelling-ready.

2.  ML Retraining Environment (Python– this environment allows us to:

(a) Retrain models using a variety of methods. Our methodology is to build multiple scorecards and then to choose the scorecard that offers the greatest lift.

3.  Stratus ML Model Execution Engine– this platform allows:

(a) Scores to be calculated – models are migrated directly from the ML Retraining Environment to Stratus and then tested.

4.  Monitoring Reports – Populations and models need to be monitored.

(a) Our reporting component allows for daily reporting (execution control reports and scorecard monitoring report) which can help determine when data shifts have occurred.

Modelling methodologies

With our quick-step models you are not stuck to one type of model. The modelling approach involves preparing the data, sampling the data, selecting an outcome definition, and then modelling. The advantages of our ML retraining environment are that we build models utilising multiple methodologies concurrently. Some of the methodologies utilised include:

The model strengths can then be analysed, and the best model selected for deployment into Stratus – our machine learning execution engine.

Modelling for the future with machine learning

Once the machine learning model has been deployed, the data will accumulate within the data asset. The data is modelling ready and extraction and sampling is all that is required prior to model retraining. A traditional model build typically takes 3 months with another 3 months for testing and deployment (sometimes longer). In adopting Quick-Step machine learning you will be able to rebuild models within a matter of days and then deploy and test within a matter of weeks thus reducing the time to market by up to 90%.

If you would like to learn how Principa’s Quick-Step machine learning models can assist your business, please get in touch with us on 

Contact Us to Discuss Your data analytics Business Requirements

Thomas Maydon
Thomas Maydon
Thomas Maydon is the Head of Credit Solutions at Principa. With over 17 years of experience in the Southern African, West African and Middle Eastern retail credit markets, Tom has primarily been involved in consulting, analytics, credit bureau and predictive modelling services. He has experience in all aspects of the credit life cycle (in multiple industries) including intelligent prospecting, originations, strategy simulation, affordability analysis, behavioural modelling, pricing analysis, collections processes, and provisions (including Basel II) and profitability calculations.

Latest Posts

Solving the Credit Unaware Challenge with Psychometrics

At Principa, we engage with clients and organisations across the entire credit lifecycle and track the focus of the South African credit industry. For nearly ten years the focus has consistently been in the collection space, but recently (since early 2021) this has changed and a large number of our clients are focused on acquisitions and originations.

Predicting Customer Behaviour (PART 2)

In Part One of this two-part blog, we started providing a short overview of just some of the propensity models that Principa has developed. In this Part Two, we continue to look at different types of propensity models available across the customer engagement lifecycle that are used to predict behaviour and solve business problems. 

PART 2: How to Cure the Post Pandemic “Collections” Symptoms

In PART 1 of this two-part series, we explored how the current socio-economic climate resulting from the lingering financial hangover caused by the pandemic is negatively impacting the consumer's ability to settle a debt.