IronOcr 2022.1.0

Prefix Reserved

There is a newer version of this package available.
See the version list below for details.

dotnet add package IronOcr --version 2022.1.0

NuGet\Install-Package IronOcr -Version 2022.1.0

This command is intended to be used within the Package Manager Console in Visual Studio, as it uses the NuGet module's version of Install-Package.

<PackageReference Include="IronOcr" Version="2022.1.0" />

For projects that support PackageReference, copy this XML node into the project file to reference the package.

paket add IronOcr --version 2022.1.0

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

#r "nuget: IronOcr, 2022.1.0"

#r directive can be used in F# Interactive and Polyglot Notebooks. Copy this into the interactive tool or source code of the script to reference the package.

// Install IronOcr as a Cake Addin
#addin nuget:?package=IronOcr&version=2022.1.0

// Install IronOcr as a Cake Tool
#tool nuget:?package=IronOcr&version=2022.1.0

The NuGet Team does not provide support for this client. Please contact its maintainers for support.

Nuget Downloads

IronOCR is an advanced OCR (Optical Character Recognition) library for C# and .NET

IronOCR provides Tesseract OCR on Mac, Windows, Linux, Azure and Docker for:

.Net Framework 4.0 +
.Net Standard 2.0 +
.Net Core 2.0 +
.Net 5
Mono for MacOS and Linux
Xamarin for MacOS

IronOCR reads Text, Barcodes & QR from all major image and PDF formats using the latest Tesseract 5 engine. This library adds OCR functionality to Desktop, Console and Web applications in minutes.

IronOCR's Unique Features

Pure .Net OCR API
All OCR tasks run locally (no SAAS)
125 languages
Barcode & QR Code reading
Corrects low quality, noisy and distorted scans
Performance tuned above and beyond any other known build of Tesseract OCR.
Reads PDFs and multi-page TIFFs
Can save any OCR Scan to a searchable PDF document or XHTML

Data Output Options Include

Output Plain Text, Barcode Data and an OCR Result class containing paragraphs, lines, words, and characters.

International Language Support

125 Languages supported including Arabic, Chinese, English, Finnish, French, German, Hebrew, Italian, Japanese, Korean, Portuguese, Russian, Spanish... Custom language packs can also be created.

Licensing & Support available

For code examples, documentation & more visit http://ironsoftware.com/csharp/ocr/ Email: developers@ironsoftware.com

Get Started Code Example

string Result = new IronOcr.IronTesseract().Read("scan.pdf").Text;

Why C# developers choose IronOCR over Vanilla Tesseract:

Achieve 99.8%+ OCR accuracy without using external web services.
Includes for Tesseract 5 , 4 and 3 Engines out of the box.
Blazing Speed and MultiThreading
MVC, WebApp, Desktop, Console & Server Application compatible
No Exes or C++ code to work with
Full PDF OCR support
To perform OCR an almost any Image file or PDF
Full .Net Core, Standard and FrameWork support
Deploy on Windows, Mac, Linux, Azure, Docker, Lambda, AWS
Read barcodes and QR codes
Export OCR as to XHTML
Export OCR to searchable PDF documents
Multithreading support
125 international languages all managed via Nuget or OcrData files
Extract Images, Coordinates, Statistics and Fonts. Not just text.
Can be used to redistribute Tesseract OCR inside commercial & proprietary applications.
Supports:

IronOCR shines when working with real world images and imperfect documents such as photographs, or scans of low resolution which may have digital noise or imperfections. Other free OCR libraries for the .NET platform such other .Net Tesseract APIs and web services do not perform so well on these real world use cases.

OCR with Tesseract 5 - Start Coding in C#

These code examples below shows how easy it is to read text from an image using C# or VB .NET.

Configurable Hello World

using IronOcr;

var Ocr = new IronTesseract();
using (var Input = new OcrInput()){
    Input.AddImage("images/sample.jpeg")
    //... you can add any number of images
    var Result = Ocr.Read(Input);
    Console.WriteLine(Result.Text);
}

C# PDF OCR

The same approach can similarly be used to extract text from any PDF document.

var Ocr = new IronTesseract();
using (var input = new OcrInput())
{

    input.AddPdf("example.pdf","password");
    // We can also select specific PDF page numnbers to OCR
        
    var Result = Ocr.Read(input);

    Console.WriteLine(Result.Text);
    Console.WriteLine($"{Result.Pages.Count()} Pages");
    // 1 page for every page of the PDF
}

C# OCR Working Code Examples

Product	Compatible and additional computed target framework versions.
.NET	net5.0 was computed. net5.0-windows was computed. net6.0 was computed. net6.0-android was computed. net6.0-ios was computed. net6.0-maccatalyst was computed. net6.0-macos was computed. net6.0-tvos was computed. net6.0-windows was computed. net7.0 was computed. net7.0-android was computed. net7.0-ios was computed. net7.0-maccatalyst was computed. net7.0-macos was computed. net7.0-tvos was computed. net7.0-windows was computed. net8.0 was computed. net8.0-android was computed. net8.0-browser was computed. net8.0-ios was computed. net8.0-maccatalyst was computed. net8.0-macos was computed. net8.0-tvos was computed. net8.0-windows was computed.
.NET Core	netcoreapp2.0 was computed. netcoreapp2.1 was computed. netcoreapp2.2 was computed. netcoreapp3.0 was computed. netcoreapp3.1 was computed.
.NET Standard	netstandard2.0 is compatible. netstandard2.1 was computed.
.NET Framework	net40 is compatible. net403 was computed. net45 was computed. net451 was computed. net452 was computed. net46 was computed. net461 was computed. net462 was computed. net463 was computed. net47 was computed. net471 was computed. net472 was computed. net48 was computed. net481 was computed.
MonoAndroid	monoandroid was computed.
MonoMac	monomac was computed.
MonoTouch	monotouch was computed.
Tizen	tizen40 was computed. tizen60 was computed.
Xamarin.iOS	xamarinios was computed.
Xamarin.Mac	xamarinmac was computed.
Xamarin.TVOS	xamarintvos was computed.
Xamarin.WatchOS	xamarinwatchos was computed.

Compatible target framework(s)

Included target framework(s) (in package)

Learn more about Target Frameworks and .NET Standard.

.NETFramework 4.0
- No dependencies.
.NETStandard 2.0
- System.Drawing.Common (>= 5.0.3)

NuGet packages (132)

Showing the top 5 NuGet packages that depend on IronOcr:

Package	Downloads
IronOcr.Languages.German The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * German * GermanBest * GermanFast * GermanFraktur ==================================== Deutschsprachige OCR in C# & .NET. Optimierte C# Tesseract 5 OCR in einer eigenständigen .NET OCR-API. Konvertiert Scannerdokumente, Bilder und PDF in Text. C# & VB Beispiele: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also German support including: * German (also known as Deutsch) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in German * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/	101.9K
IronOcr.Languages.Japanese Japanese Language pack for the IronOCR C# and VB.Net OCR library. Reads Japanese language text from images and PDFs in .NET. Ocr Dictionaries in this package: * JapaneseAlphabet * JapaneseAlphabetBest * JapaneseAlphabetFast * JapaneseVerticalAlphabet * JapaneseVerticalAlphabetBest * JapaneseVerticalAlphabetFast * Japanese * JapaneseBest * JapaneseFast * JapaneseVertical * JapaneseVerticalBest * JapaneseVerticalFast This package installs IronOCR and also Japanese support including: * Japanese (also known as 日本語 (にほんご)) OCR for screenshots, cameras, images files, tiffs and PDFs. * Custom OCR that significantly outperforms Tesseract on real world documents. * Can read scans with distortion, skewing, low resolution & contrast, and digital noise. * Also supports Tesseract 3, 4 and 5 in Japanese. * Support for 122 other languages also available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework(4.5 +) * .NET CORE(2.0 +) * .NET Standard(2.0 +) Works on: Windows MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: * Images * TIFFS * PDFs * Screenshots * Camera Input * Scans * Barcodes * QR codes This package also installs: https://www.nuget.org/packages/IronOcr/ For product and licensing support please email us at developers@ironsoftware.com ====== C# と .NET での日本語 OCR スタンドアロン .NET OCR API で最適化された C# Tesseract 5 OCR。スキャナーのドキュメント、画像、PDF をテキストに変換します。 C# と VB の例: https://ironsoftware.com/csharp/ocr/languages/Japanese/	87.6K
IronOcr.Languages.Spanish The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * Spanish * SpanishBest * SpanishFast * SpanishOld * SpanishOldBest * SpanishOldFast ==================================== OCR en español en C# y .NET. C# Tesseract 5 OCR optimizado en una API de .NET OCR independiente. Convierte documentos, imágenes y PDF del escáner en texto. Ejemplos de C# y VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also Spanish support including: * Spanish (also known as Español, Castellano) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in Spanish * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/	82.8K
IronOcr.Languages.French The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * French * FrenchBest * FrenchFast ==================================== OCR de langue française en C# & .NET. OCR C# Tesseract 5 optimisé dans une API OCR .NET autonome. Convertit les documents du scanner, les images et les PDF en texte. Exemples C# et VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also French support including: * French (also known as Français, Langue Française) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in French * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/	71.1K
IronOcr.Languages.Arabic The IronOCR engine adds OCR (Optical Character Recognition) functionality to Web, Desktop, and Console applications. IronOCR reads Barcode and QR codes. Ocr Dictionaries in this package: * Arabic * ArabicBest * ArabicFast * ArabicAlphabet * ArabicAlphabetBest * ArabicAlphabetFast ==================================== OCR للغة العربية في C# & .NET. محسن C# Tesseract 5 OCR في .NET OCR API مستقل. يحول مستندات الماسح الضوئي والصور و PDF إلى نص. أمثلة على C# و VB: https://ironsoftware.com/csharp/ocr/languages/ ==================================== This package installs IronOCR and also Arabic support including: * Arabic (also known as العربية) OCR for screenshots, cameras, images files, tiffs and PDFs in .NET * Custom OCR that can significantly out-perform Tesseract CLI on real world documents * Can read scans with distortion, skewing, low resolution & contrast, and digital noise * Also supports Tesseract 3, 4 and 5 in Arabic * Support for 125 total international languages available Additional Features Include: * Barcode & QR Reading * Output of searchable, search-engine indexable PDF documents * Inspect fonts, headings, paragraphs, lines, words, and characters as structured data Supports: * .NET Framework (4.5+) * .NET Core (2.0+) * .NET Standard (2.0+) Works on: * Windows * MacOS * Linux * Docker * Azure and other Cloud hosting platforms * Web, Console, WinForms, WPF and Services Reads: - Images - TIFFS - PDFs - Screenshots - Scans - Barcodes - QR codes Commercial support available. Email: developers@ironsoftware.com C# & VB Examples: https://ironsoftware.com/csharp/ocr/languages/	70.4K

GitHub repositories

This package is not used by any popular GitHub repositories.

Version	Downloads	Last updated
2024.11.3	1,082	11/11/2024
2024.10.3	9,289	10/3/2024
2024.9.6	8,639	9/9/2024
2024.8.21	16,084	7/31/2024
2024.7.13	10,307	7/10/2024
2024.6.3	17,390	6/4/2024
2024.5.25	13,088	4/29/2024
2024.4.6	13,979	4/5/2024
2024.3.4	24,155	3/8/2024
2024.2.41	28,698	1/29/2024
2024.1.17	14,108	12/29/2023
2023.12.34	116,028	11/27/2023
2023.11.35	13,197	10/27/2023
2023.10.9	13,229	9/26/2023
2023.9.4	7,377	9/8/2023
2023.8.34	40,708	8/1/2023
2023.7.28	54,399	7/3/2023
2023.6.6	49,830	5/29/2023
2023.5.35	21,925	5/1/2023
2023.4.13	19,662	4/6/2023
2023.3.2	34,461	3/1/2023
2023.1.11644	24,693	1/18/2023
2022.12.10830	32,085	12/5/2022
2022.11.10109	70,031	10/26/2022
2022.10.9390	15,890	9/27/2022
2022.8.8198	50,480	8/18/2022
2022.8.7804	80,034	7/26/2022
2022.3.0	191,417	3/10/2022
2022.1.0	54,480	1/17/2022
2021.12.0	22,225	12/21/2021
2021.11.0	760,020	10/29/2021
2021.9.0	27,822	8/24/2021
2021.6.0	33,675	6/24/2021
2021.2.1	41,293	2/24/2021
2020.12.2	32,266	12/14/2020
2020.11.2	147,588	11/13/2020
4.4.0	305,776	6/21/2018
4.3.0.1	37,207	4/9/2018
4.2.2.51	5,658	1/22/2018
4.2.2.1	4,537	12/1/2017
4.2.1.5	5,732	9/9/2017
4.1.1	9,841	8/4/2017
4.0.10	3,125	1/12/2017
4.0.9	2,772	12/20/2016

* Improved: Multithreading & Performance
* Improved: PDF OCR performance
* Feature: OCR Progress Tracking
* Improved: Orientation and Script Detection (OSD)
* Improved: Deskew Methods
* Improved: Secuirty & Performance

Total 2.5M

Current version 54.5K

Per day average 881