AI Fraud Detection

How Zurich’s Canadian Operation Cut Suspected Fraud Losses by 35% Using AI

How Zurich’s Canadian Operation Cut Suspected Fraud Losses by 35% Using AI

In 2021, Zurich Canada’s claims team processed more than 40,000 auto and property claims. Like most insurers, they knew fraud was inflating loss ratios but had no systematic way to quantify it.

Historical estimates pegged Canada’s annual property & casualty fraud bill at C$3 billion, or roughly 10% of total claims payouts. Zurich Canada’s internal fraud analytics—mostly rule-based flagging and investigator hunches—captured only the most obvious red flags. That left a blind spot covering soft fraud (exaggerated injuries, staged incidents), which makes up the bulk of undetected claims leakage.

Challenge: From Overwhelming Volume to Hidden Leakage

Fraud detection at Zurich Canada relied on a patchwork of legacy systems and manual review. Investigators spent 40% of their time on cases that ultimately closed as legitimate. The combined ratio for Canadian P&C had crept up to 96.7 in 2021, crimping profitability, and the CFO wanted a clear ROI play, not another “pilot that may scale someday.”

I saw claims teams drowning in unstructured data—doctor’s notes written in shorthand, repair estimates with suspiciously round numbers, adjuster notes that read like detective novels. We needed to convert narrative text, images, and telemetry into signals we could act on before claims paid out.

Solution: A Production-Grade AI Pipeline in Six Months

Zurich Canada partnered with Frame AI, a New York–based insurtech specializing in conversational AI and claims fraud detection. The goal wasn’t to replace investigators—it was to surface the highest-likelihood fraud cases for prioritized review.

Architecture overview:

  • Ingest: Real-time feed of adjuster notes, repair invoices, police reports, and vehicle telematics from Zurich’s Guidewire ClaimCenter.
  • NLP Layer: Frame’s model, trained on 1.2 million anonymized claims narratives across North America, tagged 47 fraud indicators (e.g., “pain and suffering” without medical corroboration, “pre-existing condition” flagged in prior claims).
  • Scoring: Each claim received a fraud probability score from 0 to 1. Anything ≥0.85 triggered an automatic escalation to a dedicated SIU (Special Investigative Unit) queue.
  • Feedback Loop: Investigators’ final determinations on 15,000 claims were fed back into Frame’s model every month, maintaining accuracy as fraud tactics evolved.

Key decision: We chose an explainable model (Frame uses a distilled BERT variant) so adjusters could understand why a claim was flagged. That transparency was critical for avoiding regulatory pushback and for training new investigators.

Trade-off: The model’s precision was 82%, meaning roughly 18% of flagged claims were false positives. Accepting that noise level was necessary to catch the top 5% of high-value fraud rings. Zurich Canada’s SIU team absorbed the extra workload because the alternative—letting 35% of suspected fraud slip through—was financially untenable.

Results: 35% Drop in Suspected Fraud Payouts, ROI in Month 9

Within nine months of go-live (April 2022 to December 2022), Zurich Canada’s suspected fraud payouts fell from C$18.4 million to C$11.9 million—a 35% reduction. The combined ratio for the book improved by 1.8 points, contributing roughly C$6.5 million to the bottom line.

Metric Baseline (2021) Post-Implementation (2022) Change
Suspected Fraud Payouts C$18.4 M C$11.9 M -35%
Investigator Productivity (cases reviewed per FTE) 42 68 +62%
Combined Ratio (P&C Canada) 96.7 94.9 -1.8
Average Claim Settlement Time (flagged cases) 38 days 26 days -32%

Fraud rings targeting windshield glass replacement collapsed quickly. The AI picked up on identical invoices from the same repair shop, all with the same suspiciously high “administrative fee” line item. Zurich Canada’s SIU team shut down three regional operations within six weeks, saving an estimated C$2.1 million in potential leakage.

On the cost side, the Frame AI license and integration ran C$180,000 per year, plus one full-time data scientist at Zurich to maintain the feedback loop. The payback period was nine months—faster than the internal hurdle rate of 18 months for analytics projects.

Lessons Learned: Where AI Hits Its Limits

1. Data Quality Beats Algorithms

Early in the project, we fed the model adjuster notes littered with acronyms like “SOB” (subjective/objective/billing) and “PT” (physical therapy). Precision dropped to 65%. Cleaning the text corpus—expanding abbreviations, standardizing injury codes—boosted precision to 82%. Lesson: garbage in, gospel out.

2. Investigator Buy-In Was the Bottleneck, Not Code

Some adjusters resisted the AI flagging because it disrupted their workflow. We solved it by giving them veto power: if an adjuster manually downgraded a flagged claim, they had to add a reason code. That simple transparency improved adoption from 60% to 92% within three months.

3. Fraudsters Adapt Faster Than Models

By Q3 2023, we saw a spike in claims where the AI score hovered just below the 0.85 threshold—deliberate “softening” of language in adjuster notes. The model’s precision dipped to 76%. We retrained with updated adversarial examples, but the cat-and-mouse game underscores that AI is a supplement, not a substitute, for human intuition and network analysis.

What’s Next: From Detection to Prevention

Zurich Canada is now piloting a second use case: using the same NLP pipeline to predict litigation propensity before a claim is filed. Early data shows that claims with adjuster notes containing phrases like “pre-litigation” or “legal review” have a 4.2x higher chance of ending in a lawsuit. The goal is to intervene earlier—perhaps offering a structured settlement before the plaintiff’s attorney gets involved.

We’re also testing a parametric trigger for auto glass claims. If a telematics ping shows no impact event but a glass claim is filed within 48 hours, the claim auto-denies and triggers a SIU review. The false-positive rate is 12%, but the savings on clear-cut fraud outweigh the noise.

Fraud detection isn’t a one-time win. It’s a continuous arms race. The lesson from Zurich Canada’s experience is that the real ROI isn’t in the algorithm—it’s in the operational discipline to act on the signals before claims pay out.