The Challenge
A mid-size US property and casualty insurance company processing 8,000+ claims monthly required claims adjusters to manually extract data from submitted documents — photos, repair estimates, medical records, and police reports. Each claim required an average of 45 minutes of manual data entry across 3 different systems. Error rates ran at 12%, causing downstream processing delays and customer complaints. The company was losing $2.1M annually in labor costs for manual data entry alone. Previous attempts with off-the-shelf OCR solutions achieved only 68% accuracy on their document types (many were handwritten, photographed at angles, or poor quality scans), requiring more manual review than they saved. The VP of Claims needed a solution that could handle their specific document types with 95%+ accuracy while integrating with their existing Guidewire ClaimCenter system.
Our Approach
We assembled a 4-person offshore team: 1 senior ML engineer specializing in computer vision, 1 NLP engineer, 1 backend developer, and 1 QA/data annotation specialist. Phase 1 (weeks 1-4): Data analysis of 5,000 historical claims documents across 12 document types. Built a custom annotation pipeline and labeled 3,200 documents for training. Established baseline accuracy metrics for each document type. Phase 2 (weeks 5-10): Developed a multi-model pipeline — document classification using a fine-tuned Vision Transformer (ViT), text extraction using PaddleOCR for printed text and a custom CNN for handwritten content, and entity extraction using a fine-tuned BERT model trained on insurance-domain language. Implemented confidence scoring to auto-route low-confidence extractions to human review. Phase 3 (weeks 11-14): Built a FastAPI backend with async processing, deployed on AWS Lambda for auto-scaling. Integrated with Guidewire ClaimCenter via REST API — extracted data populates claim fields automatically with human-in-the-loop approval for fields below 90% confidence. Phase 4 (weeks 15-18): Production deployment with A/B testing — running AI-assisted claims alongside manual processing for 4 weeks to validate accuracy and measure time savings. Iterative model improvements based on production corrections.
Project Timeline
Data Analysis & Annotation
Weeks 1-4Analyzed 5,000 historical claims across 12 document types. Built annotation pipeline and labeled 3,200 documents for training.
Model Development
Weeks 5-10Multi-model pipeline: Vision Transformer for classification, PaddleOCR + custom CNN for text extraction, fine-tuned BERT for entity extraction with confidence scoring.
Backend & Integration
Weeks 11-14FastAPI backend with async processing on AWS Lambda. Guidewire ClaimCenter integration via REST API with human-in-the-loop approval.
Production & Validation
Weeks 15-18A/B testing alongside manual processing for 4 weeks. Iterative model improvements based on production corrections.
Key Outcomes
The Results
Document processing accuracy reached 97.3% across all 12 document types (up from 68% with off-the-shelf OCR). Average claim intake time dropped from 45 minutes to 87 seconds for fully automated claims (72% of volume) and 8 minutes for human-in-the-loop claims (28% of volume). Claims adjusters were redeployed from data entry to higher-value investigation and customer communication roles. Error rates dropped from 12% to 2.7%. The system processes 8,000+ claims monthly with auto-scaling handling peak periods (natural disaster claims surges) without degradation. First-year ROI was 340% when accounting for labor reallocation, error reduction, and faster claim resolution. Customer satisfaction scores improved by 18 points as average claim processing time decreased from 12 days to 5 days.
"We evaluated three enterprise IDP platforms before finding Offshore1st. Their AI team didn't just build a document extraction tool — they built a system that actually understands insurance documents. The accuracy numbers are remarkable, and the human-in-the-loop design gives our adjusters confidence in the output."
Tech Stack Used
Ready to Achieve Similar Results?
Tell us your project requirements and we'll build the right offshore team to deliver exceptional outcomes.