Actuarial Analytics with clustering algorithms (in Python)

D. Hainaut, PhD

Description

Clustering algorithms aim to uncover hidden structure in data sets.  Objects are grouped in such a way that the created groups are as much as possible heterogeneous between each-others.

The aim of this session is to cover unsupervised learning techniques for visualizing and analyzing a dataset. This course focuses on actuarial applications of clustering methods. By adapting these algorithms to manage categorical features, we can detect dominant sub-populations of policies. The analysis of their claims allows a posteriori to draw a synthetic map of insured risks.

The course is illustrated with examples in Python provided to participants.

Program

  1. Principal components in a nutshell.

  2. Treatment of categorical variables with a chi-square distance.

  3. Factorial components analysis of categorical datasets.

  4. K-means, K-means++, batch K-means for categorical datasets

  5. Fuzzy clustering.

  6. Graph representation of data and spectral clustering.

Speaker

Donatien Hainaut

Donatien Hainaut

Scientific Advisor, Detralytics
Professor, UCLouvain

Date : On-Demand

Duration : 3h

Accreditation : 3CPD | 18PPC

Level : All

Acquired skills

At the end of the training, participants will be able to:

  • To visualize objects and features of categorical data sets
  • To identify clusters of insured risks and their dominant features
  • To convert the dataset in a graph to uncover non-convex shaped clusters.

About our Speaker

Donatien Hainaut