Whether your documents are medical claims, genealogy records or invoices, Microtask Digest service provides an automated pipeline for processing and extracting the relevant data in the format you need. The pipeline combines human labor with machine intelligence to efficiently digest the scanned material and produce the results quickly and with high quality.
Sometimes you already have all the documents you need to be processed and sometimes it's an ongoing activity. Cloud based Digest service can take care of both cases by providing a customized pipeline for your project.
Contact us for more information and a quote for your project!
Most OCR solutions are intended for capturing full text of a document and to produce a Word or an Excel file as a result. When you need to extract specific data in a specific format, these products are of little help. Digest OCR is a unique solution that utilizes human assisted machine intelligence to extract just the data you need with high quality.
Unlike regular OCR software that operates on individual pages, Digest OCR looks at all the pages at once, doing statistical analysis, dynamic language modelling and building custom dictionaries. This allows it to reconstruct broken words and determine correct characters even when the original image is damaged or has poor scanning quality.
Download our whitepaper comparing Digest OCR to industry leading OCR products to see the difference!
For handwritten material Digest provides an ICR solution based on machine assisted human keying. Digest breaks the document down into small segments and these microtasks are then completed by a cloud based workforce. The ICR system provides assistance such as dictionaries, dynamic rules and reference information to ensure high quality and effective data entry.
For documents containing both typewritten and handwritten text, Digest will automatically use both OCR and ICR to deliver the best results cost effectively.