Advanced OCR
with OmniPage and FineReader
Overview
Optical character recognition
Structural recognition
Options
Loading
Zoning
OCR
Editing
Optical Character
Recognition (OCR)
OCR turns pictures of text into e-text
Does well unless…
– The
– The
– The
– The
– The
picture is fuzzy
contrast is poor
font is unusual
font is too small or too large
material has unusual characters
Structural Recognition
Analyzes the layout of the page
– Columns
– Headings
– Graphics
– Tables
Usually does fairly well, unless the
layout is non-standard
Programs that Run OCR
Programs for consumers
– Kurzweil 1000, 3000
– OpenBook
– Intel Reader
– Many others…
Programs for production
– ABBYY FineReader
– Nuance OmniPage
Consumer Programs
Highly automated
Designed for individuals who have
print disabilities
Are not good production tools
– Do not provide flexibility
– Do not allow much overriding
– Interfaces not designed for editing
Production Programs in
General
A good program for production allows
you to…
– Control the zones (areas or blocks of text
and graphics)
Add, delete, change
– Edit easily
– Improve recognition
Preferred Programs
ABBYY FineReader
– Relatively easy to learn
– Fairly intuitive
– Good structural recognition
Nuance OmniPage
– Less intuitive but more accessible
– Often does better with technical materials
Both Good Tools
If you can afford to have both, it’s
nice, but not absolutely necessary.
If you have both, run a couple test
pages through each to see which is
doing better on a particular job.
Under the Hood
For best results with a program, set up
your options before you begin!
Tools > Options
Lots of Languages
FineReader and OmniPage handle
multiple languages.
For foreign language, turn on all the
languages in the book.
– It will recognize the diacritical marks.
– Turn on what you need, but only what
you need.
Math
If you are running OCR on math, try
turning on Greek.
– Greek will allow the program to recognize
alphas, deltas, sigmas, etc.
Another Decision
Detect page orientation or not?
– Does not always get it right
– Try it if you have many pages turned
Considerations
You may or may not want to keep
headers and footers.
– I generally keep them to pull the page
numbers.
You may want to keep the page
breaks.
– Retaining page breaks helps to maintain
one-to-one page correspondence with the
book.
Fitting Everything
In some cases, you may need to work
with a custom paper size to fit
everything onto one page.
This feature can be helpful when you
are retaining everything on the page
but not the layout.
Loading Files
“Open”
– Opens saved program files
“Load”
– Loads image files to process
Note that this same issue comes up
with saving!
Wizards Are Evil…
Do not rely on the automation
Load the image file and choose the
processes you want
Workspace
The program has three primary areas
Pages Pane
– Either thumbnails or details
– Allows simple navigation of pages
Image Pane
– Your graphic
Text Pane
– Area where the text from OCR will show
More Accessible
Both programs have a detail view.
– Shows text instead of graphics
Detail view is more accessible for
screen readers.
Otherwise, it is personal preference.
Two Ways to Save
To Save the program file to access
later in the OCR program, choose File
> Save
– This saves your work file.
You save your converted file during
the last phase of the processing.
Production Tips
Work with dual monitors
– Check your computer and video card
Stretching an OCR program across two
monitors is a HUGE time-saver!
Learn to use keyboard shortcuts.
– They save tons of time!