Amazon Textract, a machine learning service that extracts text and structured data from any document or image, now offers specialized support for invoices and receipts. Until today, these important documents were difficult to process at scale because they do not follow set design rules, and often require context to interpret correctly. For example, customers might need to extract the vendor name from the Amazon logo at the top of an invoice even though it is not labeled “Vendor: Amazon”. Now with Textract, customers can extract explicitly labeled data, implied data, and line items from itemized list of goods or services from almost any invoice or receipt without any templates or configuration.
Source:: Amazon AWS