National Institute of Electronics & Information Technology: Gorakhpur Center
National Institute of Electronics & Information Technology: Gorakhpur Center
National Institute of Electronics & Information Technology: Gorakhpur Center
Gorakhpur Center
Ministry of Electronics & Information Technology (MeitY), Government of India
Shubhra Dubey
Project Engineer
In this pre-processing step, you can add multiple document types and the
fields you are interested in extracting. For example, you can work with
Invoices, wanting to extract the vendor and the total amount, and with
medical forms, wanting to extract insured ID number and patient name.
• Extraction is getting just the data you are interested in. For
example, extracting specific data from a 5-page document is quite
troublesome if you want to do it with string manipulation. In this
framework, you can use different extractors, for the different
document structures, in the same scope application. The
extraction results are passed further for validation.
• Once you have your validated information, you can use it as it is, or
save it in a DataTable format that can be converted very easy into
an Excel file.