OCR PDF with ABBYY

From xBio:D Wiki
Revision as of 20:04, 4 June 2013 by Jcora (talk | contribs)
Jump to navigation Jump to search

Introduction

This section contains instruction on how to OCR text from a PDF in Abbey FineReader 9.0. In order to be accurately OCR'd, the resolution of the PDF should be at 600 dpi. Abbey provides an interface to verify the interpreted text against the original PDF scan.

Open PDF

Start Abbey, then click the Open button under the "1 Document" heading.

Open PDF

Navigate to the PDF you wish to OCR, and select. The first stage of the OCR process within Abbey is to analyze the document. Analysis will identify the various elements within the PDF like inline, tables, figures, and headers. Make sure to change "Document Languages" to the proper value(s).

OCR Text

After the document has been loaded and analyzed, the text will need to be OCR'd, or "Read" as is the vernacular within Abbey. Under the "2 Image" heading, press the "Read Document" button to initialize the OCRing process. This may take some time for large documents.