SOA ASA Exam: Predictive Analysis (PA) – 4.3. Generalized Linear Models Case Study 2
[mathjax] Case Study 2: GLMs for Binary Target Variables Learning Objectives Compared to GLMs for numeric target variables, GLM-based classifiers enjoy some subtly unique features, which will be revealed in the course of this case study. At the completion of this section, you should be able to: Combine factor levels to reduce the dimension of the data. Select appropriate link functions for binary target variables. Implement different kinds of GLMs for binary target variables in R. Incorporate an offset into a logistic regression model. Interpret the results of a fitted logistic regression model. Background In this case study, we will examine the dataCar dataset in the insuranceData package. This dataset is based on a total of n = 67,856 one-year vehicle insurance policies taken out in 2004 or 2005. The variables in this dataset pertain to different characteristics of the policyholders and their vehicles. The target variable is clm, a binary variable equal to 1 if a claim occurred over the policy period and 0 otherwise. Stage 1: Define the Business Problem Objective Our objective here is to construct appropriate GLMs to identify key factors associated with claim occurrence. Such factors will provide insurance companies offering vehicle insurance …