Workflow Assistant – Recognize Images

Layout Description: Choose a layout description for your incoming pages to influence how the program auto-zones pages. Select Automatic to let the program decide; this is best in most cases or if the pages in the document have different layouts. When all pages are similar, choose the setting that most closely matches your document.

Optimize the OCR process for Speed / Accuracy: Choose to optimize recognition for speed or accuracy. Using Speed for good quality documents can still yield acceptable accuracy.

Languages and dictionaries

Languages in document: Select any languages that are on the page(s) that you are going to recognize. For better recognition, select only languages that are on the page(s). The top languages are your most recent choices, below that, the languages are listed alphabetically. Key in a letter to jump to it.

User dictionary: Select the user dictionary that you want to use for proofreading OCR results.  Custom may appear; it is Microsoft Word’s default user dictionary.

Professional dictionaries: Use this to choose specialized dictionaries for certain professions. These dictionaries are consulted in addition to the standard and user dictionaries.

Verify language choices: Select this to have warnings of conflicts between selected languages and languages detected in incoming documents. It differentiates: Japanese, Chinese, Korean and non-Asian language. See Asian language recognition.

Detect single language. This removes the need to choose languages beforehand; it can be useful for unattended processing when documents in different languages may be encountered. It assigns one language to each incoming page. See OCR options. Select the checkbox to see which languages can be handled; while this option is selected individual language choices cannot be made.

Choose language group: This lets you define which language groups will be enabled for the single-language detection. The languages with dictionary support that use the Latin alphabet, the four Asian languages, or both groups.

Fonts and characters

Font Matching…: Click this to select how fonts are matched to font types detected in your document. It applies only to non-Asian languages.

Reject character: Type in the character that you want OmniPage to substitute for characters that it cannot recognize. Reject characters, tildes <~> by default, appear in red in the recognized document.

Additional characters: Enter accented letters here that you want validated for recognition in addition to those already validated by your language choice. Enter characters from your keyboard or from the character map. The button with three dots opens this.

Retain features

Look for headers and footers: Select this to have headers and footers detected and available for export. Deselect this to have headers and footers treated as ordinary text.

Look for hyperlinks: Select this to have hyperlinks detected, displayed as functioning links in the Text Editor and available for export.

Retain text and background color: Select this to have colored texts and backgrounds detected and displayed in the Text Editor and available for export. This option is grayed, if Optimize the OCR process for Speed is turned on

Retain inverted text: Select this to retain inverted text (white or pale letters on a black or dark background). Deselect this to have inverted text transformed to normal text. This option is grayed, if Optimize the OCR process for Speed is turned on.

Use PDF fonts: Select this to override your font matching instructions, whenever possible, when opening PDF files. OmniPage can read the font list in PDF files and will search these fonts on your system and use them for the converted document.

 

Workflow Assistant – Recognize Images