OCR form pages that have been filled out by hand and have data that needs collected and entered into a database can be automated with OCR and ICR technology using forms processing software applications.

Creating forms optimized for handprint recognition

Handprint recognition applications can provide dramatically different results in terms of accuracy depending on whether the form is designed with intelligent character recognition (ICR) in mind.

Forms Processing applications like ABBYY FlexiCapture have a built-in form design tool with ICR-optimized field layout elements and rules that validate whether your form uses best practices for recognition. These forms can be automatically converted to recognition templates for scanning for data capture. This saves you dozens of hours of trial and error during the design process and even more in data entry once the filled in forms are collected.

Best practice recommendations for ICR and OCR forms include:

  • Plenty of space between form elements and labels, at least 0.5cm / 0.25in
  • Use drop out colors for form backgrounds when possible
  • Hand printed characters should be constrained with boxes or combs to force filler to write legible, separated, printed characters
  • Use check boxes instead of handprint when possible since these are nearly 100% accurate
  • Use numeric codes instead of alphanumeric text when possible to reduce the number of possible characters and increase accuracy
  • Use validation rules to check against possible values and flag data with incorrect values
  • Check box fields can be used to verify the presence of signatures

Using ABBYY Vantage Document Skills

Processing Your First Documents with Vantage

Learn how easy it is to get started with Vantage – upload your documents and Vantage will take care of the rest.

 

How to Create and Train a Vantage Document Skill

Learn how to use the Vantage Skill Designer to create and train a new Document Skill with just a few sample documents.

 

How to Create and Train a Classification Skill in ABBYY Vantage

Learn how to use the Vantage Skill Designer to train a new Classification Skill. You need just a few samples of each document class.

 

 

How to Automate a Complete Workflow, by Creating a Vantage Process Skill

 

 

How to Edit a Document Skill

Learn how to adapt already existing skills to your specific documents and business requirements.

 

 

How to perform the first authentication in Vantage Swagger UI?

To get a first access token perform the initial authentification using the default client, one does not need to enter any passwords or client ID. The initial authentication is preconfigured. Just open a Swagger page (EU link or US link), click Authorize:

mceclip1.png

Select all scopes, and click Authorize again:

mceclip0.png

The password should be specified only for a custom client. A custom client can be created after the initial initialization.

References

EU Help: Getting a Tenant Identifier or US Help: Getting a Tenant Identifier

EU Help: Creating a Client or US Help: Creating a Client

Learn more at ABBYY […]

Reading Handprint, Checkmarks, and Forms with FlexiCapture and Vantage

ICR – Intelligent Character Recognition

Intelligent Character Recognition

  • Intelligent Character Recognition (ICR) is an extended technology of the optical character recognition (OCR ). While the OCR technology is designed to extract machine-printed characters, the ICR technology retreives information provided as hand-printed characters
  • The ICR technology can extract hand-printed characters that are separated and written as individualcharacters in areas/zones – these areas/zones needs to be specified as fixed fields of a machine readable forms. Alternativelly, they need to be automatically detected.

Example of a form containing hand-printed characters:

icr-form-illu.png

Important note: ICR is not able to extract texts in “cursive handwriting” as in this example:

old-handwriting-illu.png

  • In most cases, the ICR technology is linked to Field Level / Zonal Recognition and forms processing.
  • To enhance the ICR recognition accuracy, it is recommended to use meta data, for example regular expressions, dictionaries or database lookups.

ICR in ABBYY SDKs

The following ABBYY SDKs and products support ICR

  • FineReader Engine
    Since the version 12, Release 3, ICR is as well included in the Linux version. Since the Release 4 of the version 12, it is as well included in the Mac version of FineReader Engine (in lower versions, the ICR technology was only supported in the Windows version.
  • FlexiCapture SDK – this SDK is designed for forms processing and data extraction, ICR and template matching for fixed forms are part of the default feature set. In addition, ABBYY offers this technology as a product in form of the FlexiCapture platform.
  • Cloud OCR SDK – the ABBYY OCR service, allows reading zones that contain hand-printed, separated characters. This online OCR service […]

How to configure a Batch Splitting step to split on a blank value

In PaperVision Capture a batch splitting step can be configured to meet one or more of many conditions. In some cases it may be desirable to split a batch based off a blank value within an index field. This can be achieved by using a String Comparison or Regular Expression.

The following steps should be used to configure batch splitting using a blank value. Note: These steps assume you will be splitting the batch based on an index field called “ExampleIndexField”. The index field should already exist in the job.

To split the batch on a blank value using the String Comparison type:

  1. Setup the Target Job Configuration.
  2. Add a batch split step.
  3. Add a New Condition.
    • The condition source: Capture Index
    • Choose Capture Index: “ExampleIndexField”
    • Choose Comparison Type: String Comparison
    • Leave the drop down on the equal sign “=” and leave the text box, blank.
    • Click Finish
  4. The condition should read (CI.ExampleIndexField = “”)

 

To split the batch on a blank value using the Regular Expression Comparison type:

  1. Setup the Target Job Configuration.
  2. Add a batch split step.
  3. Add a New Condition
    • The condition source: Capture Index
    • Choose Capture Index: “ExampleIndexField”
    • Choose Comparison Type: Regular Expression
    • Input the Regular Expression which represents any blank space characters: ^\s*$
    • Click Finish
  4. The condition should read (CI.ExampleIndexField RegEx.Match(“^\s*$”)

How to use the [CURRENTDATETIME] tag in WorkFlow Pre-Conditions

The [CURRENTDATETIME] tag can be used to set up WorkFlow pre-conditions where [CURRENTDATETIME] represents the current system time and also dates within Records Retention jobs.

When setting up Records Retention policies and WorkFlow definitions, it may be helpful to be able to reference the current system time to determine if documents should be selected.  The [CURRENTDATETIME] is the current system time of the automation server when it runs the specified operation.  This means the value is always changing.  Date ranges can also be used, for example:

[CURRENTDATETIME+1Y] = Current Date/Time plus 1 year

Example:

If you want to bring documents into a WorkFlow 90 days after a specified date index field, set up the WorkFlow pre-condition so the date fields From range is [CURRENTDATETIME-50Y] and the To range is [CURRENTDATETIME-90D].  This means that every time the WorkFlow checks for new documents it uses the time the operation runs for the CURRENTDATETIME value.  Any document where the date field falls in the range of the current system time minus 50 years to the current system time minus 90 days will be brought into the WorkFlow.  If a document is added with today’s date, it will not enter the WorkFlow.

Applications

When you scan a document that has text or numeric data on it, you are able to read and understand what is written in the scanned image. However, to a computer, the resulting image file is just as meaningless an assortment of pixels as a landscape photo. In order to transform this information into an editable format that you can search through, copy, and modify without retyping it manually, you will need the an Optical Character Recognition (OCR) software.

There is a wide variety of OCR software available. While they all share the ability to convert images of machine printed (not handwritten) text or numbers into an editable format, the various software often have different features, accuracy, prices, and language options.

You can find the various types of OCR software with a description of each below.

Users within a single department, working from home or who have a small business can simply scan their documents to a folder that is shared to everyone. In this “ad-hoc” scenario you only need some basic document scanning software to simplify and bring consistency to your filing system.

If you want to move to the next level, there are Desktop Document Management options that provide an all-in-one means for capture, storage, search and retrieval of documents. Additionally, they provide security, advanced capabilities and ease of use above that of the ad-hoc methods

And let’s not forget cloud-based options that alleviate the need to maintain storage servers or keep software up to date.

Need a simple, no frills OCR solution without spending hundreds of dollars on a professional software package? Look no further. There is a no cost, donation optional, OCR freeware solution for […]

2022-06-21T12:06:06-04:00Tags: , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |

ABBYY OCR

ABBYY is one of the leading OCR (Optical Character Recognition) companies in a world. They offer a large variety of document capture and automation products starting with FineReader Pro for individual or small business scale companies and FineReader Corporate. If you need to process many thousands or millions of pages, ABBYY has FineReader Server for full-text OCR and FlexiCapture for OCR data capture. Many companies are using their products for its flexibility and scalability, there is always a way to customize ABBYY OCR products to fit your automation needs.

ABBYY FineReader OCR software helps individuals turn scans of paper documents, PDF files, and digital photographs into searchable and editable formats. Unmatched text recognition accuracy and document conversion capabilities virtually eliminate retyping and reformatting. Intuitive use and one-click automated conversion tasks let you do more with this OCR software in fewer steps. Up to 190 languages supported for text recognition and document conversion – absolute record on OCR/PDF software market!

ABBYY FlexiCapture is a powerful data capture and document processing solution. It is designed to transform streams of documents of any structure and complexity into business-ready data.  Solid recognition technologies, automatic document classification and a highly scalable and customizable architecture, will allow it to help companies and organizations of any size to streamline their business processes, increase efficiency and reduce costs.

ABBYY FineReader Server is powerful server-based OCR software for automated document capture and PDF conversion. Designed for mid- to high-volume batch processing, it enables organizations and scanning service providers to establish cost-efficient processes for converting paper, as well as TIFF, JPEG, and PDF image documents into electronic files suitable for full-text search and long-term digital archiving.

ScanStore and SimpleSoftware are highly experienced integrators of ABBYY […]

Batch OCR Software

Batch OCR for Full-Text Conversion & Searchable PDF

Batch OCR PDF to Text, Excel, WordThe primary purpose of Optical Character Recognition is to quickly and automatically convert scanned images of machine-printed (typed) text into actual text data that you can search through and modify.

Batch OCR software allows for the conversion of multiple files at once, usually through a hot folder or watched email inbox method that converts any files added to a particular folder.

The ability to watch a hotfolder and automatically convert documents is included in the complete versions of desktop OCR products, like FineReader Corporate, OmniPage Ultimate or ReadIRIS Corporate.

While automatic processing is available in these applications, they are not designed for true server-based processing since the application has to be running on the user’s desktop. OCR servers are designed for unattended batch OCR processing and high-volume applications that require multiple CPUs and processing workflows.

Those applications are all designed for traditional, full-page OCR conversions to text, Word, Excel, or searchable PDF documents.

Batch OCR for Data Capture

OCR Data CaptureForms Processing OCR Data Capture systems are designed to read specific data points from documents and output structured data like CSV, XML, JSON or SQL databases. SimpleIndex, FlexiCapture and PaperVision Capture all offer batch zone OCR as well as advanced features like AI-based training, invoice processing and line items.

OCR Experts At Your Service

Our OCR experts can help you find the batch OCR software that is right for your project, as well as providing remote installation, setup, training and support that’s not available for most desktop OCR applications. We can also help with enterprise implementations, custom API integrations, […]

Forms Processing

What is ICR, Survey & Forms Processing?

ICR stands for Intelligent Character Recognition and is the technology that allows software to interpret hand printed text on scanned images.

Data Capture Forms OCRForms Processing Software uses ICR technology to automate data entry tasks involving hand-filled surveys, applications and forms. It provides interfaces for scanning, recognition, data verification and export, as well as management and monitoring tools to track large volumes of documents and data through the workflow.

Forms Processing also includes OCR (Optical Character Recognition) technology to recognize machine printed text, and OMR (Optical Mark Recognition) for check boxes and multiple choice bubbles.

It is also possible to use these applications to automate data collection from PDF forms, Word documents, Excel spreadsheets, and other formats used to fill out forms electronically. Many include the ability to publish forms as paper, fillable PDF and web pages simultaneously to distribute and collect data from multiple sources into one dataset.

Who can benefit from forms processing software?

Any organization that collects data on paper-based forms, surveys or applications on a regular basis can get a very high return on investment by automating the data entry with forms processing software.

You do need to have a significant number of forms to justify the expense– at least a hundred forms per month or more depending on how much data is being captured. If the data entry task can be done in under 100 man-hours then it is not a good candidate for automation with ICR software.

Organizations that have many separate departments that collect data on forms can share the budget for forms processing software by re-using it for other projects. Your current project may not be big enough to justify the expense, but when combined with one or two others it would be.

How much do […]

SimpleOCR | OCR Software Experts

Learn More Download Now

Document Scanners
& Scanner Parts

Accurate OCR starts with quality images. Efficient OCR starts with fast scanning. Find Document Scanners built for OCR at ScanStore.

Our Team of OCR experts is here to help! SimpleOCR is not just Freeware, we have every kind of OCR solution from PDF Converters to Enterprise Data Capture, OCR Servers and Handprint Recognition for Forms and Surveys. Live chat with an OCR specialist now or Contact Us for a consultation on your OCR project.

SimpleOCR is the popular freeware OCR Software with hundreds of thousands of users worldwide. SimpleOCR is also a royalty-free OCR SDK for developers to use in their custom applications.

SimpleIndex is OCR built for business, offering powerful batch scanning, OCR server, and data capture features with a simple user interface and affordable licensing.

If you like free stuff, freeware versions of our SimpleView Document Viewer (with Tesseract OCR), SimpleCoversheet Bar Code Printer, and SimpleExport CSV to XML Converter are also available.

If you have a scanner and want to avoid retyping your documents, SimpleOCR is the fast, free way to do it. The SimpleOCR freeware is 100% free and not limited in any way. Anyone can use SimpleOCR for free–home users, educational institutions, even corporate users.

If your documents have multi-column layouts, non-standard fonts, tables, poor quality or digital camera images, you will not have much success with applications based on free and open source engines like SimpleOCR and Tesseract. You will need a commercial OCR application to get an accurate read. Our OCR Guide compares desktop and […]

Simple Software

SimpleIndex can bring speed and efficiency to your scanning or doc filing no matter the process. Even if all you are doing is hand keying a few basic details about a document, breaking those details into individual indexes and adding tools like drop down choice lists, automatic orientation, and blank page deletion ensure a smoother, more consistent process.

Automation

Here’s where things start to get interesting. From basic tasks like splitting individual documents within at stack of pages by spotting a blank page, a specific mark, or a barcode separator to capturing index data directly from the page or looking up additional details about a document in a database, SimpleIndex has a host of powerful tools to tame your piles of paper or drives full of digital files. Let’s look at a few.

OCR

Optical Character Recognition is the ability to take a scan, which is merely a picture of a page, and turn it into words that the computer can understand and use to index your files. SimpleIndex leverages the power of ABBYY FineReader, recognized as one of the best OCR engines on the market, to accurately capture names, dates, important numbers, document types, and other details about your file. Some products have you set a box and capture whatever information happens to fall in that zone. SimpleIndex takes it further with Dynamic Zone OCR to enable you to set an oversized zone that allows for shifting of the pages between scans, but still captures just the date you need by matching against templates, lists, or even Regular Expressions (RegEx). You can also skip the zones entirely and use the full text of a page to find matches for your index data.

Barcodes

HealthCare Claims Forms Processing

Automating Medical Forms with OCR

Perform your claims processing faster, with less errors and effort using OCR.

Once the form is scanned, our software:

  • Electronically isolates and records information provided in the different fields
  • Transfers and fills that information
  • Checks for errors and flags those errors for easy verification.
  • Automatically generates the proper organization and format of the form to meet requirements
  • Accurately submits and stores your claim

By expediting claim processing and increasing accuracy,

you can decrease the overall time it takes to process claims to ensure faster payments.

The open architecture and technology allow for flexibility

to customize and enhance system operations to specifically fit your organization’s unique requirements and easily meet changing healthcare industry standards.

In coordination with our team of experienced professionals,

we can provide the critical expertise and effective ongoing support that you need to succeed in the ever-changing healthcare industry.

Other functions and benefits include:

  • Index forms for instant searching and viewing
  • Reduce the risk of combining redundant member information
  • Customized to specifically fit your company’s requirements
  • Improve exception handling
  • Maintains both professional and institutional claim types with all required information
  • Reduce errors
  • Increase accuracy, productivity, and profitability
  • Predefined templates for fast implementation

The forms CMS 1500 (formerly HCFA 1500) and UB-04 are the most common medical forms used to submit a claim. The form CMS 1500 is the universal claim form used by non-institutional healthcare providers, such as private practices, to bill specific covered services. The form UB-04 is very similar to the CMS 1500, but it is used by institutional healthcare providers, such as hospitals, to submit claims. […]

Handprint Recognition Guide

What is ICR, Handprint Recognition?

ICR stands for Intelligent Character Recognition and is the technology that allows software to interpret hand printed text on scanned images.

Forms Processing Software uses ICR technology to automate data entry tasks involving hand-filled surveys, applications and forms. It provides interfaces for scanning, recognition, data verification and export, as well as management and monitoring tools to track large volumes of documents and data through the workflow.

Forms Processing also includes OCR (Optical Character Recognition) technology to recognize machine printed text, and OMR (Optical Mark Recognition) for check boxes and multiple choice bubbles.

Traditional forms processing relies on constrained handwriting, where boxes on the form force the filler to write with separated, printed block characters. Modern AI technology has dramatically improved the ability to recognized unconstrained handwriting and cursive script. Hand printed notes, free-form comments blocks, non-segmented fields, historic documents, and more can now be converted to text with acceptable accuracy where these were impossible just a few years ago.

Who can benefit from handwritten recognition software?

Any organization that collects data on paper-based forms, surveys or applications on a regular basis can get a very high return on investment by automating the data entry with forms processing software.

You do need to have a significant number of forms to justify the expense, at least a hundred forms per month or more depending on how much data is being captured. If the data entry task can be done in under 25 working hours then it is probably not a good candidate for automation with ICR software.

Organizations that have many separate departments that collect data on forms can share the budget for forms processing software by re-using it for other projects. Your current project may not be big enough to justify […]

OCR Data Capture

What is OCR Data Capture?

document OCR process automationOCR stands for Optical Character Recognition and is the technology that allows software to interpret text on scanned images. When this technology is applied to automating business data entry processes it’s referred to as OCR Data Capture.

Many are familiar with popular desktop OCR applications designed to convert scanned images to editable documents. When this process is applied to specific areas of the document containing data fields it’s called zone OCR. But OCR data capture software is more than just simple zone OCR. Modern applications use some or all of these technologies:

Enterprise data capture systems provide interfaces for scanning, recognition, data verification and export, as well as management and monitoring tools to track large volumes of documents and data through the workflow.

Who can benefit from OCR data capture software?

messy business information made easy with ocr data captureAny organization that collects data from paper documents, or electronic files like PDF and Office documents, can get a very high return on investment by automating the data entry with OCR data capture software.

You do need to have a significant number of documents to […]

OCR Consulting Services

OCR Experts for Any Project

Our unique team of OCR experts are equipped to help out with OCR projects of any size or complexity. We have support specialists that can remotely configure desktop solutions in a matter of minutes and expert systems integrators with years of programming, database design, and robotic process automation experience.

Desktop OCR

Batch Document Scanning and OCRUse our online store to order desktop OCR applications and our staff will be happy to answer your setup questions via email or web chat.

Remote configuration and training services using GotoMeeting are available for a low hourly rate.

Let Us OCR That For You

Got a one-time conversion and don’t want to hassle with software? Upload your scanned document to us and we’ll send back the converted files. Optional verification service corrects recognition errors and layout issue for a low hourly rate.

Data processing for forms, reports, directories, and other documents is also available with output to CSV, Excel, XML, JSON, SQL, etc.

Contact us and if possible provide a sample, total pages, desired output and whether you want us to correct the results after OCR and we’ll reply back with a quote right away. Prices start at $50 for up to 1,000 pages.

Batch Scanning & OCR Servers

Data Capture Forms OCRAutomate document scanning and digital document archival processes using zone OCR, barcode recognition, database integration and other technologies.

Small business systems and single document workflows can be setup remotely via GotoMeeting, usually in just a few hours. Chat now if we’re online or leave a message to schedule a consultation.

Data Capture and Forms Processing

Advanced data extraction solutions that can turn the most complex documents into structured data ready […]

ABBYY FlexiCapture On-Premise

ABBYY FlexiCapture On-Premise – Distributed – Perpetual License PPY 50K Pages

ABBYY FlexiCapture is a powerful data capture and document processing solution from a world-leading technology vendor. It is designed to transform streams of documents of any structure and complexity into business-ready data. And its award-winning recognition technologies, automatic document classification, plus a highly scalable and customizable architecture, mean that it can help companies and organizations of any size to streamline their business processes, increase efficiency and reduce costs.

ABBYY FlexiCapture Cloud

ABBYY FlexiCapture Cloud

ABBYY FlexiCapture Cloud delivers ABBYY’s advanced data capture platform capabilities via REST API and web interfaces. ABBYY FlexiCapture Cloud customers can rapidly configure and deliver their Content IQ solution, taking advantage of our cloud services to automate and accelerate their document-driven processes. The advanced machine learning and AI in the platform improve classification and data extraction results, enabling core processes to support better, smarter, faster decisions.

FlexiCapture Cloud enables organizations to accelerate digital transformation by complementing their automation systems with new and advanced cognitive capabilities that liberate the intelligence locked in their documents.

PaperVision Capture Forms Magic

PaperVision Capture Forms Magic adds handwriting recognition, forms processing, invoice processing or healthcare claims forms templates and business rules to their high-volume document scanning and data capture platform.

ABBYY Vantage

ABBYY Vantage leverages AI machine learning and a huge library of document “skills” to provide out-of-the-box data capture for all kinds of documents.

Vantage provides a simple way to implement new data capture processes without the need for programmers.

It takes the FlexiCapture platform, hosts it in the cloud, and dramatically simplifies the interface. The thousands of settings you can use with FlexiCapture to build templates are managed by the AI, giving you a simple point and click interface to create new document capture workflows.

The “Skills” library gives you pre-configured capture workflows for hundreds of the most common documents. Simply connect them to your import and export destinations and you are ready to go, saving you hours or even days of development time.

Remark Office OMR Software

Data collection and analysis software for surveys, tests and other plain paper forms. You create your own forms that are scanned with an image scanner or copier. Remark Office OMR product has been used to scan and process billions of forms. Remark gives you the tools you need to get your results quickly. Through years of customer feedback, we’ve carefully designed our products to be user-friendly while providing a rich feature set to satisfy the specific needs of individuals like you.

Remark Test Grading

Remark Test Grading is an easy-to-use solution to quickly grade online and paper tests, saving you time and money. Remark Test Grading Cloud allows busy instructors to quickly create and grade tests in the cloud so they can get more accomplished with less. With just a few clicks of the mouse, instructors can create an online test or a printable test answer sheet to be distributed to their students.

SimpleIndex Cloud OCR

SimpleIndex Cloud OCR adds Amazon AWS Textract OCR to any SimpleIndex workstation or server license.

Textract capabilities include the most accurate OCR and handprint recognition available, automatic form field detection, accounts payable invoice and receipt processing.

Amazon Textract is only available as an API that requires custom programming to make it work. SimpleIndex turns it into a complete document and data capture application designed for easy batch processing on a workstation or server.

Requires an AWS account. Standard Textract transaction fees will apply.

Grooper Document Processing

Grooper was built from the ground up by BIS, a company with 35 years of continuous experience developing and delivering new technology. Grooper is an intelligent document processing and digital data integration solution that empowers organizations to extract meaningful information from paper/electronic documents and other forms of unstructured data.

The platform combines patented and sophisticated image processing, capture technology, machine learning, natural language processing, and optical character recognition to enrich and embed human comprehension into data. By tackling tough challenges that other systems cannot resolve, Grooper has become the foundation for many industry-first solutions in healthcare, financial services, oil and gas, education, and government.

  • Single platform
  • Patented OCR
  • Image processing
  • Machine learning
  • Natural language processing
  • Zero code
  • Zero templates
  • Open architecture

Title

Go to Top