Creating tagged PDF files
The PDF output format in OmniPage uses predefined MarkedContent object types defined in PDF 1.4.
Human reading object is the original object type in OmniPage.
Tagged object is a MarkedContent object type used in a PDF StructTree, the Attributes column shows the standard attributes of the Tagged object. If an attribute is not filled in a PDF file, the value of the attribute is the default (leftIndent, 0 spacebefore, …) according to the PDF 1.4 reference.
Human reading object |
Tagged object |
Attributes |
Document |
Document |
|
Section, |
Sect |
|
Box, |
Div |
Position |
Picture |
Figure |
Position, |
Table |
Table |
Spacebefore |
Table Row |
TR |
|
Table Cell |
TD |
Width |
Para |
P |
Spacebefore |
TextRun |
Span |
Underline xor |
To create a tagged PDF
-
Select a PDF output format except “PDF Searchable Image” in the Save to File dialog box.
-
Click Options….
-
In the Converter Options dialog box, under PDF compatibility, select PDF 1.4 or PDF 1.5 or Optimize for quality according to your needs.
-
Check that Create tagged PDF is selected. If not, select it.
A non-tagged PDF will be created in the following cases:
-
if Optimize for size is selected under PDF compatibility, or
-
if Show background image layer is selected under Processing mode, or
-
if pages are appended to a tagged PDF file, the appended pages will not have tag information.