Key Points
Research suggests AWS Textract is widely used for extracting data from documents like invoices and medical records, saving time and reducing errors.
It seems likely that industries like healthcare, insurance, and lending benefit most, with real-world examples including processing claims and loan applications.
The evidence leans toward major clients like Change Healthcare and Pennymac using it, with case studies showing significant efficiency gains.
An unexpected detail is its application in public sector, like digitizing historical weather data for the Met Office.

Overview
AWS Textract is a machine learning service that extracts text and data from documents, such as scanned PDFs and images, making it easier for businesses to automate document processing. It’s particularly useful for industries needing to handle large volumes of paperwork efficiently.
Real-Life Use Cases
AWS Textract is applied in various sectors to streamline operations:
Healthcare: Used to extract information from medical documents, helping organizations like Change Healthcare manage millions of documents compliantly, and Roche for processing medical PDFs for NLP.
Insurance: Automates claims and policy processing, with Symbeo reducing document processing time from 3 minutes to 1 minute per document, achieving 68% automation.
Lending: Streamlines loan applications, with Pennymac cutting processing time from hours to minutes, and Biz2Credit seeing an 80% reduction in human effort.
Public Sector: Digitizes records, such as the NHS processing 54 million prescriptions monthly and the Met Office handling historical weather data.
Other uses include invoice processing, compliance documents, and legal forms, enhancing efficiency across various business functions.
Clients Using AWS Textract
Many organizations across industries rely on AWS Textract, including:
Healthcare: Change Healthcare, Roche
Insurance: Symbeo, Elevance Health, Healthfirst, nib Group, Wrapped Insurance
Lending: Pennymac, Black Knight, Sun Finance, Biz2Credit
Public Sector: NHS, Business Services Authority, Met Office
Software & Internet: Alfresco, Cox Automotive
Others: BlueVine, Kabbage, Paymerang, Assent Compliance, and many more, with detailed examples like Filevine for legal document management.
For more insights, check out case studies on Amazon Textract Customers and Indecomm
Survey Note: Comprehensive Analysis of AWS Textract Use Cases and Clients
This note provides a detailed examination of Amazon Web Services (AWS) Textract, focusing on its real-life applications and the clients utilizing this service. AWS Textract is a machine learning service designed to extract text and data from various document types, including scanned PDFs, images, and forms, leveraging advanced optical character recognition (OCR) and natural language processing (NLP) capabilities. It is particularly valuable for automating document processing, reducing manual effort, and enhancing operational efficiency across multiple industries. The analysis is based on available documentation, customer case studies, and industry-specific implementations, current as of February 27, 2025.
Real-Life Use Cases by Industry
AWS Textract’s versatility is evident in its adoption across diverse sectors, each with specific needs for document analysis and data extraction. Below, we categorize the use cases by industry, highlighting key examples and benefits:
Healthcare:
Change Healthcare: Utilizes Textract to unlock information from millions of documents, ensuring compliance with HIPAA regulations. This facilitates efficient management of patient records and medical data, reducing manual processing time.
Roche: Employs Textract to extract text from medical PDFs for natural language processing, enabling a comprehensive view of patient data for research and clinical purposes.
The service’s ability to handle sensitive medical documents with high accuracy supports better data-driven decision-making and patient care.
Insurance:
Symbeo, a CorVel Company: Processed 16 million pages using Textract, reducing document processing time from 3 minutes to 1 minute per document, achieving 68% automation. This significantly speeds up claims processing and enhances operational efficiency.
Elevance Health: Uses OCR capabilities to extract and index claims data, improving data accessibility and reducing manual errors.
Healthfirst: Analyzed over 50,000 charts, achieving revenue savings 10-20 times more than usual downstream operations, and referred around 5,000 members for care management, demonstrating cost-effectiveness and scalability.
nib Group: Speeds up claims processing, enhancing customer experience by automating receipt submissions via mobile apps.
Wrapped Insurance: Automatically reads insurance policies from different providers, streamlining policy management and comparison.
These cases highlight Textract’s role in reducing processing times and improving accuracy in high-volume document environments.
Lending:
Pennymac: Reduced document processing time from hours to minutes, accelerating loan approvals and enhancing customer satisfaction.
Black Knight: Leverages Textract through AIVA, driving efficiency in loan processing, and collaborates with Amazon ML Solutions Lab for advanced implementations.
Sun Finance: Automates Know Your Customer (KYC) processes, processing loan requests every 0.63 seconds, showcasing real-time document analysis capabilities.
Biz2Credit: Achieved an 80% reduction in human effort with a near 0 error rate, utilizing the Textract API for loan document processing, demonstrating significant labor savings.
The lending sector benefits from Textract’s ability to handle complex financial documents, reducing turnaround times and operational costs.
Public Sector:
NHS, Business Services Authority: Processes 54 million paper prescriptions per month, leveraging Amazon Augmented AI with Textract for efficient digitization, supporting public health initiatives.
Met Office: Digitizes millions of historical weather observations, enhancing data accessibility for climate research and forecasting, an unexpected application in environmental science.
These use cases illustrate Textract’s role in managing large-scale public records, improving service delivery and archival efficiency.
Software & Internet:
Alfresco: Automates data extraction, improving data integrity and ensuring security compliance, integrating Textract into document management systems.
Cox Automotive: Captures data from loan applications and vehicle titles, streamlining processes for automotive financing and sales.
This sector uses Textract to enhance application functionality, particularly in document-centric software solutions.
Others (Miscellaneous):
Rekeep: Automates 75% of the document pipeline, clearing backlogs and improving workflow efficiency in facility management.
BlueVine: Achieved high automation for Paycheck Protection Program (PPP) loans, saving 400,000 jobs, and collaborated with the Textract team for implementation, as detailed in a case study (BlueVine Case Study).
Kabbage: Automated 80% of PPP applicants, reducing approval time to a median of 4 hours, serving 297,000 businesses and preserving 945,000 jobs, showcasing rapid response capabilities.
Paymerang: HIPAA eligible, extracts data from invoices, standardizing fields for financial operations, ensuring compliance in healthcare billing.
Assent Compliance: Processes compliance documents, using Amazon Comprehend and Amazon A2I alongside Textract, saving hundreds of hours in manual review, as seen on their website (Assent Compliance).
Foresight Group: Automates invoicing with 90% accuracy, saving 15-20 minutes per invoice, enhancing financial reporting.
Baker Tilly: Reads digital forms, leveraging handwriting recognition, integrating with AWS S3 and RDS for seamless data storage and retrieval.
Hnry: Reduces manual transcription, increasing accuracy by 80%, processing thousands of documents daily for accounting purposes.
HelloSign, a Dropbox Company: Increased user engagement, with 83% finding it useful, achieving 26% month-over-month growth and tripling form ratio, detailed in a case study (Dropbox HelloWorks Textract).
HighIQ Robotics Inc.: Extracts data from invoices and contracts, improving straight-through-processing in supply chain management.
Arq Group: Implements a hybrid solution, reducing downtime by 22% and maintenance costs by 18%, enhancing operational resilience.
BDO: Developed an Intelligent Document Processing (IDP) solution, identifying errors in source documents, saving time and cost in auditing.
The Washington Post: Reveals structured data from documents, aiding journalists in reporting, enhancing investigative journalism.
Informed.IQ: Automates verifications, analyzing millions of documents annually, compliant with SOC and ISO standards, for fraud detection.
Eliiza: Achieved 97% labor reduction for Personally Identifiable Information (PII) redaction and 70% man-hours saved for data entry, supporting paperless workflows.
Belle Fleur: Detects text for variety, velocity, and volume, enhancing solutions for medical, legal, and real estate sectors.
PitchBook: Gains 60% process improvement, enhancing data collection from PDFs for financial research.
BGL: Saves 100-150 hours per year per fund, automating bank statements, tax statements, and contracts for fund management.
Lumiq: Reduces 97% PII redaction labor and 70% man-hours for data entry, enabling end-to-end paperless workflows.
Filevine: Offers fast, accurate, and scalable document processing, meeting legal organization requirements for case management.
Perfios Software: Tests Textract to transform the Banking, Financial Services, and Insurance (BFSI) industry, reducing turnaround time for document processing.
QL Resources: Digitizes handwritten forms, completing production data digitization for manufacturing operations.
The Globe and Mail: Extracts table data from PDFs, achieving 10x efficient access for journalists, enhancing newsroom productivity.
Vidado: Provides template-less form recognition, automating workflows and reducing production time in document-intensive industries.
ClearDATA: Extracts medical data from PDFs, integrating with Electronic Health Records (EHR), improving patient experience in healthcare IT.
Inforuptcy: Automates data entry, unlocking insights from bankruptcy documents, increasing business value in legal services.
Kablamo: Reduces labor and time, integrating paper documents, processing hundreds in minutes for various business operations.
MSP Recovery: Handles various document types scalably, automating reading of thousands of documents for healthcare recovery audits.
Camelot: Extracts text, forms, and tables, reducing post-processing efforts and quickly adding new document types for retail operations.
Tekstream: Automates document processing, with Textract Queries improving flexibility and accuracy for enterprise solutions.
Envase Technologies: Simplifies novel document types with Textract Queries, capturing data points efficiently for environmental management.
Client Overview and Detailed Table
The client base for AWS Textract is extensive, spanning multiple industries, each leveraging the service for specific operational needs. Below is a table summarizing key clients, their industries, and notable use cases, extracted from available customer pages and case studies:
Customer | Industry | Key Use Case | Notable Outcome |
Change Healthcare | Healthcare | Unlocks info from millions of docs, HIPAA compliant. | Efficient management of medical records. |
Roche | Healthcare | Extracts text from medical PDFs for NLP. | Comprehensive patient view for research. |
Symbeo, a CorVel Company | Insurance | Processed 16M pages, reduced time from 3 min to 1 min, 68% automation. | Faster claims processing. |
Elevance Health | Insurance | Extracts and indexes claims data using OCR. | Improved data accessibility. |
Healthfirst | Insurance | Analyzed 50,000+ charts, revenue savings 10-20x, referred 5,000 members. | Cost-effective operations. |
nib Group | Insurance | Speeds up claims, enhances customer experience. | Better mobile app integration. |
Wrapped Insurance | Insurance | Reads policies from different providers automatically. | Streamlined policy management. |
Pennymac | Lending | Reduced doc processing from hours to minutes. | Faster loan approvals. |
Black Knight | Lending | AIVA drives efficiency, works with Amazon ML Solutions Lab. | Enhanced loan processing. |
Sun Finance | Lending | Automates KYC, processes loan request every 0.63 seconds. | Real-time document analysis. |
Biz2Credit | Lending | 80% reduction in human effort, near 0 error rate. | Significant labor savings. |
NHS, Business Services Authority | Public Sector | Processes 54M prescriptions/month, uses Amazon Augmented AI. | Efficient public health operations. |
Met Office | Public Sector | Digitizes millions of historical weather observations. | Enhanced climate research. |
Alfresco | Software & Internet | Automates data extraction, improves data integrity, security compliance. | Better document management systems. |
Cox Automotive | Software & Internet | Captures data from loan apps/vehicle titles. | Streamlined automotive financing. |
BlueVine | Others | High automation for PPP, saved 400,000 jobs. | Rapid small business relief. |
Kabbage | Others | 80% PPP applicants automated, reduced approval to 4 hours, served 297,000 businesses. | Preserved 945,000 jobs. |
Paymerang | Others | Extracts data from invoices, HIPAA eligible. | Standardized financial operations. |
Assent Compliance | Others | Processes compliance docs, saves hundreds of hours. | Enhanced regulatory compliance. |
HelloSign, a Dropbox Co. | Others | Increased engagement, 83% found useful, 26% month-over-month growth. | Improved form processing efficiency. |
This table is not exhaustive but represents a subset of the extensive client list, showcasing the breadth of adoption across industries. For a complete list, refer to Amazon Textract Customers.
Additional Insights and Unexpected Applications
An unexpected application of AWS Textract is its use in the public sector for digitizing historical records, such as the Met Office’s work on weather observations, which extends beyond typical business document processing into environmental science. This highlights Textract’s flexibility in handling diverse document types, including handwritten and archival materials.
Case studies, such as Indecomm Case Study, provide concrete metrics, showing Indecomm reduced mortgage document processing time from 30 minutes to 5–7 minutes for a 100-page document, achieving 100% data classification accuracy and 97% data extraction accuracy, with a cost per page processed at 2 cents on average. Such detailed outcomes underscore the service’s impact on operational efficiency and cost savings.
Conclusion
AWS Textract is a robust tool for automating document processing, with real-life use cases spanning healthcare, insurance, lending, public sector, software, and beyond. Clients like Change Healthcare, Pennymac, and Symbeo demonstrate significant benefits, including time savings, cost reductions, and improved accuracy. The service’s adoption across industries reflects its versatility, with unexpected applications like historical data digitization adding to its value proposition.
Key Citations

Comments