The most comprehensive OCR SDK for software developers
Integrate AI-powered OCR features into your applications.
The software development kit ABBYY FineReader Engine allows software developers to create applications that extract textual information from paper documents, images or displays. This AI-powered OCR SDK provides your application with excellent text recognition, PDF conversion, and data capture functionalities, enabling it to convert scans into searchable PDF, Word or Excel documents, and access data on photos or screenshots.
Comprehensive set of recognition technologies
With OCR toolkit integration, applications can extract machine printed text in over 200 languages as well as hand printed text, optical marks and barcode values.
AI, ML and other advanced technologies provide outstanding recognition accuracy for multi-language documents and deliver searchable and editable documents that reflect their originals.
Powerful PDF processing tools
Versatile APIs allow processing many PDF types and converting scanned documents, TIFFs, JPEGs or image-only PDFs into different searchable PDF and PDF/A files.
Multi-core CPUs, Cloud and Virtual Machines support
Support for document processing in parallel threads on multi-core CPUs, deployment in the cloud and virtual environments guarantees fast, flexible and scalable processing.
ABBYY FineReader Engine software development kit offers advanced features for developers creating applications for Windows, Linux, and Mac OS platforms.
Advanced image pre-processing functions
Image pre-processing increases the recognition accuracy by optimizing the image for OCR. Even low-quality images can deliver best OCR results after de-skewing, rotation, distortion correction, text line straightening, page splitting, adaptive binarization, ISO noise reduction and other image correction steps.
Recognition profiles for fast implementation
Predefined processing profiles contain pre-set parameters for the best OCR results within frequent recognition tasks. They can be used in place of manually fine-tuning the recognition system and provide for faster SDK integration.
Ready-made business card and MRZ recognition
Specific APIs such as BCR API and MRZ API offer recognition of business cards and Machine Readable Zones (MRZ) in ID documents, passports or visas. They provide access to individual field values with minimal programming effort.
Document classification with Machine Learning and NLP
Advanced classification algorithms in FineReader Engine leverage machine learning, OCR and natural language processing technologies, and enable applications to automatically categorize documents into different classes.
User interface components
Developers can use ABBYY’s ActiveX-based visual components to easily integrate user interface into variety of applications such as ERP, ECM or DMS systems, providing their users with the possibility to validate documents, access the recognition results and edit text and directly in the application.
Sample code library
A comprehensive set of code samples provide instructions on how to use the OCR SDK, illustrates different FineReader Engine technologies and can be used for development projects. Each library section contains code snippets accompanied by systematic instructions on how to perform the relevant task.
Detailed SDK documentation and support
Detailed documentation helps to streamline integration. Developers have access to additional information through ABBYY’s Knowledge Base, Developers’ Forum and Technology Portal. In addition to these resources, ABBYY technical support is available to you during your free trial.
Compare documents function
The new functionality ‘Compare Documents’ allows comparing two versions of the same documents and detecting differences in their content. The documents can be in different formats, e.g. Microsoft Word, PDF, TIF, JPEG, and any other of the supported input documents formats.
FineReader’s on-premise OCR SDK lets you integrate OCR directly into your desktop or server applications.
Licensing is based on number of developers for the base SDK, then annual page counts for the perpetual license. Each of the optional components increases the per-page cost slightly.
Optional components include:
- Language pack adds Chinese, Japanese, Korean, Thai, Vietnamese, Gothic Fraktur, Arabic, Farsi and Hebrew languages
- Handprint recognition (ICR) in 126 languages
- Zone OCR (counts as 1/5 page per field)
- Check mark recognition
- ID Card recognition
- Business card recognition
- Barcode processing
- Machine learning document classification
- Document comparison module
- Data export formats: TXT, RTF, DOCX, XLSX, PPTX, PDF (with a text layer under the original image as well as ‘text and images’ format), PDF/A-1a, XML, ALTO, vCard, CSV
- Business process outsourcing support for service bureaus