UiPath Documentation
document-understanding
2023.10
false
UiPath logo, featuring letters U and I in white

Document Understanding user guide

Last updated Apr 6, 2026

ML Packages Offline Installation

Getting started

Depending on the models you want to use, you need the following:

  • For models 2022.10 and newer:
    • Download the needed Document UnderstandingTM bundle. Here are the links for all the available bundles. The du bundle contains information about all models included into a specific version. For example, the dusemistructured-2023.10.0.tar.gz contains information about all out-of-the-box pre-trained ML Packages included in the 2023.10.0 version.
  • For models 2022.4 and older (python37duv3 and python37duv4):
    • All ML Packages are provided as a .zip file which is uploaded directly as a Custom Package in AI Center. To download the models, contact your Account Manager, CSM, or Support to receive a download link per package.
    • Download the needed Document Understanding bundle. Here are the links for all the available bundles.

Install the offline bundle

Offline installations are requiring that the downloaded du bundle to be renamed in the command line into du-ondemand.tar.gz. For instance, if you downloaded the du bundle named dusemistructured-2023.10.0.tar.gz, at installation time you need to rename it as du-ondemand.tar.gz.

  1. For Windows machines, directly download through the bundle link and rename the file to du-ondemand.tar.gz

  2. For Linux machines, from the machine having access to the internet, download the needed bundle following the below command:

    wget -O ~/<bundle-name.tar.gz> 'bundle-link'
    wget -O ~/<bundle-name.tar.gz> 'bundle-link'
    

    Here's an example of how to download the needed bundle for Linux:

    wget -O ~/du-ondemand.tar.gz 'https://download.uipath.com/automation-suite/2023.10.0/dusemistructured-2023.10.0.tar.gz'
    wget -O ~/du-ondemand.tar.gz 'https://download.uipath.com/automation-suite/2023.10.0/dusemistructured-2023.10.0.tar.gz'
    
  3. Copy the following bundle to the /uipath/tmp folder on the main machine of the cluster (where the install took place):

    scp ~/<bundle-name.tar.gz> <username>@<node dns>:/uipath/tmp/
    scp ~/<bundle-name.tar.gz> <username>@<node dns>:/uipath/tmp/
    
  4. Connect to this main machine and load the bundle:

    ./configureUiPathAS.sh registry upload --optional-offline-bundle "/uipath/tmp/du.tar.gz" --offline-tmp-folder "/uipath/tmp"
    ./configureUiPathAS.sh registry upload --optional-offline-bundle "/uipath/tmp/du.tar.gz" --offline-tmp-folder "/uipath/tmp"
    

Upload the model to AI Center

After downloading and installing the models, follow the steps described in the ML packages offline installation page from the AI Center User Guide to upload them to AI Center. Both ML package zip and metadata json files are needed for this procedure.

Form Extractor and Intelligent Keyword Classifier

Access Form Extractor and Intelligent Keyword Classifier, with the following public URL:

  • <FQDN>/du_/svc/formextractor
  • <FQDN>/du_/svc/intelligentkeywords
Note:

When using a public URL, replace the <FQDN> placeholder with the actual environment information.For example <FQDN>/du_/svc/formextractor becomes https://servicefabricserver.domain.com/du_/svc/formextractor when used in a workflow.

Upload a Document Understanding™ bundle to an external docker registry

Follow these steps to upload a Document Understanding bundle to your external docker registry:

  • Pull the desired DU image from the UiPath® registry hosted on registry.uipath.com
  • Rename the image host as per your docker registry name.
  • Push the image to external docker registry

Pull the desired Document Understanding images

Pull the images from UiPath® registry by running these commands:

docker pull <uipath_registry_server>/<image_name>
docker pull <uipath_registry_server>/<image_name>

Here's an example of how to pull the image of the UiPathDocumentOCR bundle from a registry called registry.uipath.com:

docker pull registry.uipath.com/aicenter/du-doc-ocr:v23.10.0
docker pull registry.uipath.com/aicenter/du-doc-ocr:v23.10.0

Rename the image host

Rename the image host by running the following command:

docker tag <uipath_registry_server>/<image_name> <your_registry_server>/<image_name>
docker tag <uipath_registry_server>/<image_name> <your_registry_server>/<image_name>

Here's an example of how to rename the image of the UiPathDocumentOCR bundle from a registry called registry.uipath.com to a registry called registory.mycompany.com:

docker tag registry.uipath.com/aicenter/du-doc-ocr:v23.4.0-rc70 registory.mycompany.com/aicenter/du-doc-ocr:v23.10.0
docker tag registry.uipath.com/aicenter/du-doc-ocr:v23.4.0-rc70 registory.mycompany.com/aicenter/du-doc-ocr:v23.10.0

Push the image to external docker registry

Push the image to your external docker registry by running the following command:

docker push <your_registry_server>/<image_name>
docker push <your_registry_server>/<image_name>

Here's an example of how to push the image of the UiPathDocumentOCR bundle to an external docker registry:

docker push registory.mycompany.com/aicenter/du-doc-ocr:v23.10.0
docker push registory.mycompany.com/aicenter/du-doc-ocr:v23.10.0

Images for each Document Understanding bundle

2023.10.12+patch1

Document Understanding BundleImage
OCR for Chinese, Japanese, Koreandu/uipath-ocr-cjk:v23.10-11.21-rc02 du/du-cjk-ocr-proxy:v23.10-11.21-rc02
Extended Languages OCRdu/du-extended-ocr-proxy:v23.10-11.21-rc02 du/uipath-ocr-extended:v23.10-11.21-rc02 du/du-extended-ocr-reporting:v23.10-11.21-rc02

2023.10.12

Document Understanding BundleImage
OCR for Chinese, Japanese, Koreandu/uipath-ocr-cjk:v23.10-10.17-rc02 du/du-cjk-ocr-proxy:v23.10-10.17-rc02
Extended Languages OCRdu/du-extended-ocr-proxy:v23.10-10.31-rc06 du/uipath-ocr-extended:v23.10-10.31-rc06 du/du-extended-ocr-reporting:v23.10-10.31-rc06

2023.10.11

Document Understanding bundleImage
UiPathDocumentOCRaicenter/du-doc-ocr:v23.10-08.25-rc02
UiPathDocumentOCR_CPUaicenter/du-doc-ocr-cpu:v23.10-08.25-rc02
OCR for Chinese, Japanese, Koreandu/uipath-ocr-cjk:v23.10-8.26-rc02 du/du-cjk-ocr-proxy:v23.10-8.26-rc02
Extended Languages OCRdu/du-extended-ocr-proxy:v23.10-8.26-rc02 du/uipath-ocr-extended:v23.10-8.26-rc02 du/du-extended-ocr-reporting:v23.10-8.26-rc02
DocumentClassifieraicenter/du-ml-document-type-text-classifier:v23.10-08.25-rc02
Out of the box pre-trained ML packagesaicenter/du-semistructured:v23.10-08.25-rc02

2023.10.10

Document Understanding bundleImage
UiPathDocumentOCRaicenter/du-doc-ocr:v23.10-06.05-rc05
UiPathDocumentOCR_CPUaicenter/du-doc-ocr-cpu:v23.10-06.05-rc05
OCR for Chinese, Japanese, Koreandu/uipath-ocr-cjk:v23.10-6.05-rc02 du/du-cjk-ocr-proxy:v23.10-6.05-rc02
Extended Languages OCRdu/du-extended-ocr-proxy:v23.10-6.05-rc02 du/uipath-ocr-extended:v23.10-6.05-rc02 du/du-extended-ocr-reporting:v23.10-6.05-rc02
DocumentClassifieraicenter/du-ml-document-type-text-classifier:v23.10-06.05-rc05
Out of the box pre-trained ML packagesaicenter/du-semistructured:v23.10-06.05-rc05

2023.10.9

Document Understanding bundleImage
UiPathDocumentOCRaicenter/du-doc-ocr:v23.10-03.14-rc02
UiPathDocumentOCR_CPUaicenter/du-doc-ocr-cpu:v23.10-03.14-rc02
OCR for Chinese, Japanese, Koreandu/uipath-ocr-cjk:v23.10-4.01-rc06 du/du-cjk-ocr-proxy:v23.10-4.01-rc06
Extended Languages OCRdu/du-extended-ocr-proxy:v23.10-4.01-rc06 du/uipath-ocr-extended:v23.10-4.01-rc06 du/du-extended-ocr-reporting:v23.10-4.01-rc06
DocumentClassifieraicenter/du-ml-document-type-text-classifier:v23.10-03.14-rc02
Out of the box pre-trained ML packagesaicenter/du-semistructured:v23.10-03.14-rc02

2023.10.8

Document Understanding BundleImage
UiPathDocumentOCRaicenter/du-doc-ocr:v23.10-01.28-rc07
UiPathDocumentOCR_CPUaicenter/du-doc-ocr-cpu:v23.10-01.28-rc07
OCR for Chinese, Japanese, Koreandu/uipath-ocr-cjk:vision-3.2-20241211.1 du/du-cjk-ocr-proxy:v23.10-02.11-rc01
DocumentClassifieraicenter/du-ml-document-type-text-classifier:v23.10-01.28-rc07
Out-of-the-box Pre-trained ML Packagesaicenter/du-semistructured:v23.10-01.28-rc07

2023.10.7+patch1

Document Understanding BundleImage
OCR for Chinese, Japanese, Koreandu/uipath-ocr-cjk:vision-3.2-20241211.1 du/du-cjk-ocr-proxy:v23.10-12.01-rc01

2023.10.7

Document Understanding BundleImage
OCR for Chinese, Japanese, Koreandu/du-cjk-ocr-proxy:v23.10-12.01-rc01

2023.10.6

Document Understanding BundleImage
UiPathDocumentOCRaicenter/du-doc-ocr:v23.10-09.25-rc08
UiPathDocumentOCR_CPUaicenter/du-doc-ocr-cpu:v23.10-09.25-rc08
OCR for Chinese, Japanese, Koreandu/du-cjk-ocr-proxy:v23.10-10.15-rc03
DocumentClassifieraicenter/du-ml-document-type-text-classifier:v23.10-09.25-rc08
Out-of-the-box Pre-trained ML Packagesaicenter/du-semistructured:v23.10-09.25-rc08

2023.10.5

Document Understanding BundleImage
UiPathDocumentOCRaicenter/du-doc-ocr:v23.10-rc28
UiPathDocumentOCR_CPUaicenter/du-doc-ocr-cpu:v23.10-rc28
OCR for Chinese, Japanese, Koreandu/du-cjk-ocr-proxy:v23.10-rc58
DocumentClassifieraicenter/du-ml-document-type-text-classifier:v23.10-rc28
Out-of-the-box Pre-trained ML Packagesaicenter/du-semistructured:v23.10-rc28

2023.10.4

Document Understanding BundleImage
UiPathDocumentOCRaicenter/du-doc-ocr:v23.10.4-rc125
UiPathDocumentOCR_CPUaicenter/du-doc-ocr-cpu:v23.10.4-rc125
OCR for Chinese, Japanese, Koreandu/du-cjk-ocr-proxy:v23.10.4-rc45
DocumentClassifieraicenter/du-ml-document-type-text-classifier:v23.10.4-rc125
Out-of-the-box Pre-trained ML Packagesaicenter/du-semistructured:v23.10.4-rc125

2023.10.3

Document Understanding BundleImage
UiPathDocumentOCRaicenter/du-doc-ocr:v23.10.3-rc04
OCR for Chinese, Japanese, Koreandu/du-cjk-ocr-proxy:v23.10.3-rc20
DocumentClassifieraicenter/du-ml-document-type-text-classifier:v23.10.3-rc04
Out-of-the-box Pre-trained ML Packagesaicenter/du-semistructured:v23.10.3-rc04

2023.10.2

Document Understanding BundleImage
UiPathDocumentOCRaicenter/du-doc-ocr:v23.10.2-rc22
UiPathDocumentOCR_CPUaicenter/du-doc-ocr-cpu:v23.10.2-rc22
OCR for Chinese, Japanese, Koreandu/du-cjk-ocr-proxy:v23.10.2-rc13
DocumentClassifieraicenter/du-ml-document-type-text-classifier:v23.10.2-rc42
Out-of-the-box Pre-trained ML Packagesaicenter/du-semistructured:v23.10.2-rc22

2023.10.1

Document Understanding BundleImage
OCR for Chinese, Japanese, Koreandu/du-cjk-ocr-proxy:v23.10.1-rc19

2023.10.0

Document Understanding BundleImage
UiPathDocumentOCRaicenter/du-doc-ocr:v23.10.0-rc68
UiPathDocumentOCR_CPUaicenter/du-doc-ocr-cpu:v23.10.0-rc68
OCR for Chinese, Japanese, Koreandu/du-cjk-ocr-proxy:v23.10.0-rc82
Out-of-the-box Pre-trained ML Packagesaicenter/du-semistructured:v23.10.0-rc68

Was this page helpful?

Connect

Need help? Support

Want to learn? UiPath Academy

Have questions? UiPath Forum

Stay updated