Implementation of an overdraft scoring model

Migration of an overdraft scoring model into the Big Data infrastructure for the prediction of default probabilities.

Implementation of a Random Regression Forest in Spark and H20 (Sparkling Water)

Automated retraining of the model with current data possible

Fulfilment of all risk management requirements

Challenge

A German direct bank faces the challenge of migrating the prediction of loan defaults into the new Big Data infrastructure. The prototype of a scoring model for predicting loan default probabilities from a previous project is to be implemented in Spark.

Solution

In the first step, several possible Big Data technologies are evaluated for the implementation. After the evaluation, the model is implemented using the combination of Spark and H20 (Sparkling Water) using a random regression forest.

Result

The implemented model can be used within the new Big Data infrastructure for online scoring of customers. The implementation makes it possible to re-train the model with current data at any time and thus optimise the bank's default risks. The risk management requirement is also met by historising and storing the models.

Are you interested in your own use cases?

Our Case Studies

- Get even more detailed insights into our customer projects -

Smart cooking with Thermomix

Smart cooking with the Thermomix

Download
Case Study AI at Munich Re

Data Operations at Munich Re

Download

Data & AI Knowledge

Creating added value from data & AI together

Blog

Discover professional articles on Data & AI as well as the latest industry news.

Webinars

Dive into our Best Practices and Industry Exchanges. Discover new dates and recordings of past webinars.

Whitepaper

Learn more about the use of Data & AI in your industry with our white papers, case studies and research.