PDF OCR X

A Product of Web Lite Solutions
PDF OCR X
About |Getting Started | Language Packs | Support  

Using PDF OCR X

PDF OCR X is a very simple application. There is only one dialog box that allows you to choose your input and output settings. This dialog appears after you drag a PDF on the PDF OCR X icon to be converted:

About these options:

  • Language: The language that the source document is in. Some languages include special characters and it helps PDF OCR X to know what the language of your source document is for maximum accuracy. Download additional language packs for PDF OCR X here.
  • Layout: If your document is formatted in a single column with flowing text, then you should select the "Single Column" layout option as it is faster than the multi-column option. If, however, your document is formatted in multiple columns or sections, you should select the "Multi-Column" option, as this will instruct PDF OCR X to try to guess the structure of the document and detect where columns begin and end.
  • Text Wrap:
    • Soft wrap: Assume that the text is meant to flow from one line to the next in most cases.
    • Hard wrap: Forcefully add line breaks at the end of each line, even if it may occur mid-sentence.

 

Disclaimer
PDF OCR uses OCR (optical character recognition) to convert images of text into text. While the technology is quite good at deciphering legible text, there are limitations and some text may not be extracted correctly.