

Here we've loaded a 17 page scan and processed the splits in accordance with our newly defined delivery notes document type. Step 5 - Processing the splits - Once set up, we can now load individual files or set up folder watching to process files. Step 4 - File Naming and paths we can finish the document type configuration by defining how the file is to be named using barcode, system or text extraction keywords, and defining where the files will be coming from and go to. All text will utilize the character definitions defined earlier and will use the regular expression to extract just the specific text we desire. Step 3 Define our Zone - Now select the icon to define a rectangular zone from which we will perform the OCR extraction of text. In this script we are looking for "" at the end of the string and extracting everything in between. This offers a way to pinpoint the exact text patterns we want to extract.

Step 2 Regular Expressions - We can also take advantage of the uniqueness of the text and Regular Expression. You can enter any specific characters or use the built in character types found in the OCR Settings panel. All text identified in the region of interest will only result in these characters. Step 1 OCR Fine Tuning - One of the first things to do is to set the OCR engine to only recognize specific characters.

We will walk through the steps of ensuring the highest degree of automation with ImageRamp in this article.
Ocr automator using pdfscanner how to#
How to extract Zonal OCR text to name and split files?Įxtracting content from existing scanned documents can save significant time when done right.
