- Overview
- Document Understanding Process
- Quickstart tutorials
- Framework components
- Digitization overview
- Digitization related activities
- OCR engines
- ML packages
- Overview
- Document Understanding - ML package
- DocumentClassifier - ML package
- ML packages with OCR capabilities
- 1040 - ML package
- 1040 Schedule C - ML package
- 1040 Schedule D - ML package
- 1040 Schedule E - ML package
- 4506T - ML package
- 990 - ML Package - Preview
- ACORD125 - ML package
- ACORD126 - ML package
- ACORD131 - ML package
- ACORD140 - ML package
- ACORD25 - ML package
- Bank Statements - ML package
- Bills Of Lading - ML package
- Certificate of Incorporation - ML package
- Certificate of Origin - ML package
- Checks - ML package
- Children Product Certificate - ML package
- CMS 1500 - ML package
- EU Declaration of Conformity - ML package
- Financial Statements - ML package
- FM1003 - ML package
- I9 - ML package
- ID Cards - ML package
- Invoices - ML package
- Invoices Australia - ML package
- Invoices China - ML package
- Invoices India - ML package
- Invoices Japan - ML package
- Invoices Shipping - ML package
- Packing Lists - ML package
- Passports - ML package
- Payslips - ML package
- Purchase Orders - ML package
- Receipts - ML package
- Remittance Advices - ML package
- UB04 - ML package
- Utility Bills - ML package
- Vehicle Titles - ML package
- W2 - ML package
- W9 - ML package
- Other Out-of-the-box ML Packages
- Public Endpoints
- Hardware requirements
- Pipelines
- Document Manager
- OCR services
- Deep Learning
- Document Understanding deployed in Automation Suite
- Install and use
- First run experience
- Deploy UiPathDocumentOCR
- Deploy an out-of-the-box ML package
- Offline bundles 2023.10.12+patch1
- Offline bundles 2023.10.12
- Offline bundles 2023.10.11
- Offline bundles 2023.10.10
- Offline bundles 2023.10.9
- Offline bundles 2023.10.8
- Offline bundles 2023.10.7+patch1
- Offline bundles 2023.10.7
- Offline bundles 2023.10.6
- Offline bundles 2023.10.5
- Offline bundles 2023.10.4
- Offline bundles 2023.10.3
- Offline bundles 2023.10.2
- Offline bundles 2023.10.1
- Offline bundles 2023.10.0
- Use Document Manager
- Use the Framework
- Document Understanding deployed in AI Center standalone
- Licensing
- Activities
- UiPath.Abbyy.Activities
- UiPath.AbbyyEmbedded.Activities
- UiPath.DocumentProcessing.Contracts
- UiPath.DocumentUnderstanding.ML.Activities
- UiPath.DocumentUnderstanding.OCR.LocalServer.Activities
- UiPath.IntelligentOCR.Activities
- UiPath.OCR.Activities
- UiPath.OCR.Contracts
- UiPath.OmniPage.Activities
- UiPath.PDF.Activities

Document Understanding user guide
Digitization related activities
Framework components
Digitize Document
Digitizes a document, extracting its Document Object Model (DOM) and text and storing them in their corresponding variable types. More details here.
OCR engines
UiPath Extended Languages OCR
Extracts a string and its information from an indicated UI element or image by using the OCR engine. Visit UiPath Extended Languages OCR for more information.
OCR for Chinese, Japanese, Korean
The UiPath Chinese, Japanese, Korean OCR will be deprecated starting with January 2025. We recommend using the UiPath Extended Languages OCR instead. Check the deprecation timeline for more information about upcoming deprecations and removals.
Extracts a string and its information from an indicated UI element or image by using the OCR engine. Visit OCR for Chinese, Japanese, Korean for more information.
UiPath Document OCR
Extracts a string and associated information about the textual content of document images. More details here.
OmniPage OCR
Extracts a string and its information from an indicated UI element or image using OmniPage OCR Engine. More details here.
Google Cloud Vision OCR
Extracts a string and its information from an indicated UI element or image using the Google Cloud OCR engine. More details here.
Microsoft Azure Computer Vision OCR
Extracts a string and its information from an indicated UI element or image by using the Microsoft Azure Computer Vision OCR engine. More details here.
Microsoft OCR
Extracts a string and its information from the provided image. Microsoft OCR activity uses the Windows 10 built-in OCR, if available, otherwise, it resumes to the default MODI OCR Engine. More details here.
Tesseract OCR
Extracts a string and its information from an indicated UI element or image using Tesseract OCR Engine. More details here.