OCR SDK Technology

Home / LEADTOOLS Products / OCR SDK Technology

OCR SDK Technology

LEADTOOLS provides fast and highly accurate Optical Character Recognition SDK technology for .NET (C# & VB), C/C++, iOS, macOS, Linux, Java, and web developers. Leverage the high-level LEADTOOLS OCR toolkit to rapidly develop robust, scalable, and high-performance recognition and document processing applications that extract text from scanned documents and convert images to text-searchable formats such as PDF, PDF/A, DOC, DOCX, XML, and XPS.

The advanced OCR SDK technology in LEADTOOLS is multi-faceted and can be used as a standalone feature as well as the driving force behind more advanced technologies such as forms recognition, check recognition and document conversion. On its own, programmers can write as few as three lines of code to convert an image to text-searchable documents.

With extensive support for more than forty character sets, programmers can expand their customer base by providing the same solution for many languages, including English, Spanish, French, German, Japanese, Chinese, and Arabic.

LEADTOOLS Product Comparison

Free Live Online Demo – Click here to see LEADTOOLS OCR in action

Overview of LEADTOOLS OCR SDK Technology

Speed and Reliability

Fast, accurate, and reliable optical character recognition for use in any application and environment
- Large volume document batch processing
- Single and multi-page documents
- Tablet and mobile devices
- Web and Cloud applications
Utilise multiple cores for unparalleled performance

Versatility

Fully featured SDK
- High-level classes for one-shot recognition from image to final document
- Low-level functionality for full customisation
Supports multiple text recognition engines
- OCR for machine printed text
- ICR for hand written text
- MICR for check processing
- MRZ & MRP for passport numbers
Fully configurable recognition engine
- Narrow down possible results with character and numeric filters
- Multiple voting techniques for enhanced accuracy
- Trace and progress callbacks
- Enable/disable font characteristics for fast text only recognition
Comprehensive results reporting
- Character location, size and baseline
- Character attributes (end of word, end of line, end of paragraph, etc.)
- Font properties (monospace, proportional, serif, sans-serif, bold, italic, underline, strikethrough)
- Confidence values
- Obtain the recognised words directly for a zone or page without saving to an external document
Integrates seamlessly with LEADTOOLS Forms Recognition and Processing
Implement large-scale distributed OCR applications using LEADTOOLS Distributed Computing SDK

Cross-Platform

Native, full-featured Optical Character Recognition libraries for many platforms
- Windows desktops, servers, and Phones
- iOS and macOS
- Android
- Web Services
Native mobile libraries run on the device without need to call external services
Utilise camera preview on phones and tablets for real-time text recognition and processing
Specialised image processing and recognition for deformations unique to mobile devices
- 3D Perspective Deskew
- Keystone Correction
- Fixed-focus camera optimisations
Programming interfaces closely resemble each other, making it easy to port an application to multiple platforms

Accuracy

Spell checking dictionary support
Automatically detect, segment, and recognize multiple languages on the same document
Full-page analysis and Zonal recognition
- Automatic table area segmentation
- Automatic OMR area segmentation
- Automatic vertical text segmentation with its orientation angle (i.e. 90, 270 or 0)
- Automatic segmentation of vertical text within horizontal text pages
Automatic document cleanup
- Omni-directional noise removal
- Undither text
- Dot matrix correction
- Option to remove lines from tables
Automatic document preprocessing
- Deskew of scanned document
- Detect and correct the orientation of the document (flipped or reversed) with multi or single-page modes
- Remove borders
- Split pages
Unique colour and bitonal image recognition for scanned documents and pictures can detect text regardless of foreground/background colours
Output searchable text document formats such as PDF, PDF/A, DOC, DOCX, XML, XPS, and more, maintaining the original look and feel
- Text with detected font characteristics (font-family name, style, size, bold, italic, underline, strikeout, slope angle, etc.)
- Location
- Tables reconstruction
- Layout
- Graphics

TIFF to Searchable PDF in Three Lines of Code

The LEADTOOLS OCR SDK provides a high-level programming interface that allows developers to create complex recognition applications in record time. For example, with the AutoRecognizeManager, developers can convert any of 150+ image formats into a text-searchable format such as PDF or DOC in as little as three lines of code!

OCR Languages

Recognise text from more than 40 languages and character sets.

LEADTOOLS OCR Engines

The LEADTOOLS OCR SDK contains the following recognition engines:

LEAD OCR Engine

Developed in-house with native libraries for Windows x86/x64, .NET (C# & VB), Linux, iOS, macOS, and Android.

Benefits of In-House Development:

New features and enhancements to speed and accuracy are continuously added
Customer feedback and feature requests are used to enhance and expand the engine with solutions to real-world applications

OmniPage OCR Engine

Licenses the Nuance Omnipage recognition engine and includes native libraries for Windows x86/x64 and .NET.

Interchangeable High-level Interface

LEADTOOLS features a high-level interface that abstracts and simplifies the use of multiple recognition engines with only a single line of code.

// Use LEAD engine
IOcrEngine ocrEngine = OcrEngineManager.CreateEngine(OcrEngineType.LEAD false);
// Use OmniPage engine
IOcrEngine ocrEngine = OcrEngineManager.CreateEngine(OcrEngineType.OmniPage, false);

TIFF to Searchable PDF in Three Lines of Code

IOcrEngine ocrEngine = OcrEngineManager.CreateEngine(OcrEngineType.LEAD, false);
ocrEngine.Startup(null, null, null, null);
ocrEngine.AutoRecognizeManager.Run(@"C:\in.tif", @"C:\out.pdf", 
    DocumentFormat.Pdf, null, null);

LEADTOOLS ICR Module – OmniPage Engine

The LEADTOOLS ICR Module – OmniPage Engine is an add-on product for integrating intelligent character recognition (ICR) and optical character recognition (OCR) technology into applications and provides everything programmers need to develop robust, high-performance, scalable image-recognition solutions. LEADTOOLS ICR Module – OmniPage Engine includes the OmniPage Engine and seamlessly combines with LEADTOOLS SDKs in the Document and Medical product lines.

The OmniPage OCR engine is a 3rd-party engine packaged and licensed for use within LEADTOOLS. It is an interchangeable alternative to the in-house developed LEAD OCR engine. LEADTOOLS ICR Module – OmniPage Engine is the only LEADTOOLS product to provide ICR.

LEADTOOLS OCR Module – OmniPage Engine with Additional Languages

LEADTOOLS OCR Module – OmniPage Engine with Additional Languages is an add-on product for integrating Arabic, Chinese, Japanese, and Korean optical character recognition (OCR) technology into applications and provides everything programmers need to develop robust, high performance, scalable image recognition solutions. LEADTOOLS OCR Module – OmniPage Engine with Additional Languages seamlessly integrates with LEADTOOLS SDKs in the Document and Medical product families.

The OmniPage OCR engine is a 3rd-party OCR engine packaged and licensed for use within LEADTOOLS and is an interchangeable alternative to the in-house developed LEAD OCR engine. LEADTOOLS OCR Module – OmniPage Engine with Additional Languages extends the OmniPage OCR engine to add Arabic and Asian language character set support. LEADTOOLS OCR Module – OmniPage Engine with Additional Languages is one OCR product that provides support for Asian languages within the Document Imaging Family. The LEAD OCR engine, which is included with LEADTOOLS Recognition Imaging and LEADTOOLS Document Imaging Suite products also provides recognition of Asian language character sets.