Last Updated: 2/19/2025
Issue: As insurers collect more granular data about insurance consumers from a variety of external sources, state insurance regulators need greater insight into what data is being used by the industry, how it is being used, and whether it should be used by insurers. Big data is a term that refers to large amounts of structured data (which can be organized into tables and defined fields) and unstructured data (which refers to text and data from images, videos, or sounds, possibly sourced from social media postings, reports, and recorded interviews). While the use of big data can aid insurers’ underwriting, rating, marketing, anti-fraud, and claim settlement practices, the challenge for insurance regulators is to examine whether it is beneficial or harmful to consumers considering the source of the data and use in making decisions. Additional consumer concerns include how collected data is safeguarded and how consumer privacy is maintained. Beyond financial and market conduct data collected today, state insurance regulators may need to collect more useful data to allow for greater insight into insurers’ use of big data in the development of machine learning models to further enhance regulation.
Background: The digital revolution, also known as the Third Industrial Revolution, is considered to have begun with the invention of the transistor in 1947. These smaller, faster, and more reliable replacements for vacuum tubes were crucial for the development of personal computers, which rose to prominence in the late 1970s and early 1980s to meet the increasing need for data processing and computation. Since that time, massive technological advancements from the shift to digital electronics have transformed the way we live, work, and interact with each other by allowing for the processing and storage of large and diverse amounts of information from a variety of sources.
Insurance companies use such big data to develop statistical and machine learning models to influence underwriting, pricing, marketing, and claims handling decisions in a number of ways:
• To more accurately underwrite, price risk and incentivize risk reduction. Telematics, for example, allows insurers to collect real-time driver behavior and usage data to provide premium discounts and usage-based insurance.
• To enrich customer experience by quickly resolving service issues.
• To improve marketing effectiveness by tailoring products to individual preferences.
• To create operating efficiencies by streamlining the application process. An example of this is a pre-filled homeowners application.
• To facilitate better claims processing by applying machine learning algorithms to estimate outcomes.
• To reduce fraud through better identification techniques. For example, text analytics can identify potential “red flag” trends across adjusters’ reports.
• To improve solvency by more accurately assessing outstanding liabilities.
According to YFS Magazine, “How Big Data Impacts The Insurance Industry And Beyond,” the use of big data in modeling has resulted in “30% better access to insurance services, 40-70% cost savings, and 60% higher fraud detection rates.” However, all disruptive technologies bring challenges. The concerns regarding the use big data include:
• The complexity and volume of data may present hurdles for smaller-sized insurers with limited resources.
• The availability of insurance regulatory resources for reviewing complex rate filings is limited.
• The lack of transparency and potential for bias in algorithms which use big data.
• The potential for the collection of sensitive consumer information violating consumer privacy concerns or potentially resulting in discriminatory actions.
• Cyberthreats to stored data.