Text recognition from scanned documents using Optical Character Recognition (OCR) software. Automate data entry or create searchable PDF files from scanned text documents.

ABBYY Cloud OCR SDK

ABBYY® Cloud OCR SDK is a web-based document processing service that will enhance your enterprise software systems, SaaS platforms, or your mobile apps with the ability to convert documents and utilize textual information from scans, PDFs, document images, smartphone photos, or screenshots.

Combining ABBYY’s latest AI-based technologies for information extraction with the highly scalable processing power of the Microsoft® Azure® computing infrastructure, this secure and reliable ABBYY cloud service can be easily integrated into your application via a REST API—empowering it to precisely convert virtually any number of pages within the shortest amount of time.

Why are the prices of OCR applications so different?

OCR software ranges in price from freeware all the way up to tens of thousands of dollars. What explains the difference between these applications? Here’s the breakdown:

  • OCR Freeware uses the SimpleOCR or Tesseract engines and provide limited scanning and output format capabilities. Recognition quality is generally poor except for the highest quality document images.
  • PDF OCR Converters provide good quality OCR engines like ABBYY, IRIS and OmniPage, but limit the output to searchable PDF files. These cost less than $100.
  • Standard OCR applications range from $100-$200 and provide full OCR capabilities including converting scans to Word, Excel, HTML and other editable formats.
  • Corporate OCR applications add advanced features like automated hotfolder processing, concurrent licensing and other features useful for business applications. Pricing for these is $200-$500.
  • OCR Servers provide scalable, enterprise OCR services for processing very high volumes of documents or providing OCR capabilities to users throughout the organization. Prices start around $1,500 and go up based on processing volume.
  • Enterprise Data Capture and Forms Processing applications are used to capture structured data from complex documents like healthcare claim forms and invoices that include things like tables, handwriting, checkboxes, and movable zones. These solutions can cost anywhere from around $1,000 to hundreds of thousands of dollars depending on the document volume and complexity of the project.

Can OCR be trained for specific fonts?

OCR training was once a critical part of the conversion process. After a document was read, the operator would review the results to correct mistaken characters and these corrections would be used to train the engine so the next time you read a similar document the results are improved.

Modern OCR applications no longer rely on user training for accuracy unless you have very non-standard fonts. These engines have had decades of development and billions of samples used to train their algorithms. In most cases, the introduction of user training will only diminish the results for any documents that are different than the ones being trained.

The training functions still exist for these edge cases, but they are no longer an integral part of the OCR process.

Training in modern OCR is more likely to refer to enterprise data capture applications that use AI-based learning algorithms to find the locations of data points on documents with various different formats, such as invoices.

Knowledge Base

The SimpleOCR Knowledge Base contains frequently asked questions and answers, technical guides and general information on a broad range of optical character recognition, handprint recognition, data capture, PDF OCR, AP invoice scanning and zone OCR applications.

Contact Us for FREE Consultation on Your OCR Project

Kofax OmniPage – Ultimate

Kofax OmniPage Ultimate has several unique features that make it stand out for a variety of applications. Some of these include auto-redaction, SharePoint integration, automatic filing with barcodes, PDF auto-bookmarking, form data collection and MFP support. Most of these new features are not available in the Standard edition.

Kofax OmniPage – Standard

Kofax OmniPage Standard converts paper, picture, and PDF files into editable documents to save you considerable time and money by eliminating retyping. Your documents look just like the original – complete with text, tables, and graphics. OmniPage uses superior character accuracy to precisely format your documents so you can easily make changes.

IRIS ReadIRIS Corporate 17 for PC

The Corporate edition of ReadIRIS adds support for files over 50 pages, business card recognition, as well as automatic processing of hot folders.

IRIS ReadIRIS Pro 17 for PC

Affordable OCR software for business and home users. ReadIRIS Pro provides a very accurate OCR recognition rate at a low cost, but still has some of the advanced features that higher priced professional OCR software includes. The main limitation is that the Pro version is limited to documents under 50 pages.

ABBYY FineReader PDF 15 Corporate

ABBYY FineReader PDF 15 Corporate is an all-in-one business toolset for working with PDFs and document digitization. With FineReader PDF employees can work with both digitally created and scanned paper documents to fulfill various document-related tasks in the digital workplace effortlessly. ABBYY FineReader PDF 15 Corporate allows you to view, edit, search, comment and collaborate, sign and protect PDFs or compare document versions in different file formats to identify differences efficiently. Thanks to the seamlessly integrated AI-based OCR technology with FineReader you can also extract information from a PDF or convert the entire document to Word, Excel® for further editing. Document conversion can also be automated to prepare multiple documents for further processing.

ABBYY FineReader PDF 15 Standard

FineReader PDF 15 Standard is a PDF software application for working with PDF documents and scans. Powered by ABBYY’s AI-based OCR technology it allows you to convert and edit not only digital PDF documents, but also scanned paper documents with the same ease-of-use. With FineReader PDF you can view, edit, search, comment, sign, protect, extract text from PDFs and convert documents into Word, Excel® for further editing.

 

Convert Scanned Image to Text Document

The primary purpose of Optical Character Recognition is to quickly and automatically convert scanned images of machine-printed (typed) text – which to a computer are no more meaningful a collection of pixels than any other image, such as a landscape photo – into actual text data that you can search through and modify.

OCR Software comes in many different types, which vary in price range based on their features, speed, and accuracy. One of the main qualities that OCR producers are using to differentiate their products is volume of the documents OCR will allow you to process. That may be a bit counter intuitive but features that are needed to process hundreds, thousands or millions pages a year are rather different ones.

In case of several hundreds of pages (receipts, checks, medical, tax or legal forms, personal memorabilia)  you need to scan for personal use you would need light, highly versatile, easy to use, not expensive software that will convert images just to text. It may not have automation features, and processing data further will be done manually by you. Thou it is not too hard since volume of documents is not very large and you can treat each of them individually.

Small business users usually process thousands of pages a year and require some automation features. Images need to be converted not just to text, but also to spreadsheets to be processed further. Once the system is set up it is assumed that it will run without much of the interference, and people in charge of document processing would be able to do that with certain ease.

Larger companies processing millions of documents require much larger levels of automation when each small, fine tuned feature would save thousands of work hours in a long run. Multiple machines will be processing documents […]

Go to Top