Thank you for Subscribing to CIO Applications Weekly Brief
Predictive Analytics in the Insurance Industry
Hongyuan Wang, PhD & MAS, Director of Predictive Analytics, Data and Analytics, Employers
PA can help insurance companies to reduce the Loss and Expense Ratio and grow organically through different applications. For example, predicting the individual policy’s claim frequency and severity, targeting the right policy, quantifying the potential lifetime value of a policy, and smoothing the operational processes.
Business Requirements and Understanding the Data
A PA application always comes from business needs. Understanding what business is required and the related historical data are most important. For example, a sales team likes to know how the retention ratio could impact both growth and profit, two important goals of the insurance company.
Building a consistent and high-quality PA process requires a complete understanding of business, data, modeling, and tools for the project leader
Data Preparation and Development of Predictive Model
There are a variety of analytics tools which can be used for data preparation and predictive model development. For example, SAS, R, or Python. Which tool to use depends on the company’s IT infrastructure and the analyst’s coding and modeling knowledge. But one thing is for sure, the tool should be able to connect to all internal databases, be able to easily import available external data, be able to efficiently handle large data manipulation and run smoothly and fast during model development. Some external data may be very complicated, with different formats or structures which need to borrow other specific tools to grab the useful data.
After the data preparation is done, analysts may continue to use the tool to do further data transformations and start model development. Other modeling tools can be used to automate the model tuning and variable selection from many different model methodologies with a very high computational speed, such as a black-box modeling tool. One of the challenges is to determine which model methodology (for example, GLM vs GBM) should be used. Business requirements, regulations or IT Infrastructure may restrict you from using many different advanced model methodologies, however, if very high quality and relevant data is available for the PA, different methodologies may not make a big difference.
Implementation of the Model
It is crucial to create an implementation system or API to automatically pull the data from different data resources, run the model scoring process, and integrate with the other business operation systems to make the final business decisions based on the model scores. It can be a real-time process or can be scheduled for a daily, weekly or monthly run. The bigger challenge here is creating an implementation system extensive enough to integrate future IT infrastructure changes (Data Warehouse changes, Operation System changes, or Cloud Environment changes, etc.).
A successful application of PA must be satisfied at every step of the process. Building a consistent and high-quality PA process requires a complete understanding of business, data, modeling, and tools for the project leader. It is a collaboration of different teams which may include Business Operation, Data and Analytics, and IT.