Data Sets
The following file is supposed to provide actuaries with links to publicly available data that can be used to apply and learn data science methods. The list of datasets is maintained by the Data Science Working Group of the Swiss Actuarial Association.
Actuarial Data Science Tutorials
Below we present all the tutorials that have been prepared by the working party. We are intensively working on additional ones.
All tutorials consist of an article and the corresponding code. In the article, we describe the methodology and the statistical model. By providing you with the code you can easily replicate the analysis performed and test it on your own data.
Case Study 15: Privacy-preserving Machine Learning
Case Study 14: SHAP for Actuaries: Explain any Model
Case Study 13: Gini Index and Friends
Data Simulator: Individual Claims Generator for Claims Reserving Studies: data simulation.R
Case Study 12: Actuarial Applications of Natural Language Processing Using
Transformers: Case Studies for Using Text Features in an Actuarial Context
Article on arXiv ; Article in British Actuarial Journal
Code on GitHub ; Notebook (Part 1) ; Notebook (Part 2) ; Notebook (Part 3)
Code on GitHub ; Notebook (Mortality) ; Notebook (Digits) ; Notebook (Image)
Code on GitHub ; Notebook (Pip) ; Notebook (ML) ; Notebook (RNN)
Case Study 6: Lee and Carter go Machine Learning: Recurrent Neural Networks
Case Study 2: Insights from Inside Neural Networks
Code on GitHub ; R Notebook (desciptive) ; R Notebook (GLM) ; Python Notebook (descriptive) ; Python Notebook (GLM)