- Overview
- Getting started
- Building models
- Consuming models
- Model Details
- Public endpoints for Automation Cloud and Test Cloud
- Public endpoints for Automation Cloud and Test Cloud Public Sector
- 1040 - document type
- 1040 Schedule C - document type
- 1040 Schedule D - document type
- 1040 Schedule E - document type
- 1040x - document type
- 3949a - document type
- 4506T - document type
- 709 - document type
- 941x - document type
- 9465 - document type
- ACORD125 - document type
- ACORD126 - document type
- ACORD131 - document type
- ACORD140 - document type
- ACORD25 - document type
- Bank Statements - document type
- Bills Of Lading - document type
- Certificate of Incorporation - document type
- Certificate of Origin - document type
- Checks - document type
- Children Product Certificate - document type
- CMS 1500 - document type
- EU Declaration of Conformity - document type
- Financial Statements - document type
- FM1003 - document type
- I9 - document type
- ID Cards - document type
- Invoices - document type
- Invoices2 - document type
- Invoices Australia - document type
- Invoices China - document type
- Invoices Hebrew - document type
- Invoices India - document type
- Invoices Japan - document type
- Invoices Shipping - document type
- Packing Lists - document type
- Payslips - document type
- Passports - document type
- Purchase Orders - document type
- Receipts - document type
- Receipts2 - document type
- Receipts Japan - document type
- Remittance Advices - document type
- UB04 - document type
- US Mortgage Closing Disclosures - document type
- Utility Bills - document type
- Vehicle Titles - document type
- W2 - document type
- W9 - document type
- Supported languages
- Insights dashboards
- Data and security
- Logging
- Licensing
- How to
- Troubleshooting
Document Understanding user guide
You decide when your models train. Once you have made enough annotations or classifications, you start a training run by clicking the Start Training button. Training does not begin in the background on its own.
Where the button is
There is one Start Training button per trainable model.
Classifiers:
-
Legacy Classifier: in the classifier's Model Training status pill, shown in the Recommendations area on the Build page.
-
Helix Classifier: in the classifier's Model Training status pill, on the Split & Classify page.
Extractors: Each document type's annotation page, in the header bar (top-right area).
In addition, the Model Training status pill hosts a Start Training action. The pill appears next to each trainable model or document type across the application, which means you can start an extractor training without navigating to its annotation page.
The button is not shown on the Build homepage as a standalone control, but the status pill on each document-type card does include the action.
How to start a training
- Navigate to the model you want to train. Options:
- Open the Split & Classify page (Helix Classifier only).
- Open the classifier's annotation flow (Legacy Classifier).
- Open the annotation page for the document type (extractor).
- Or, find the Model Training status pill for that model (for example, on the Build homepage or Measure overview) and use its Start Training action.
- Check the changes counter next to the Start Training button or inside the pill. This shows how many annotations or classifications have accumulated since the last training.
- Select Start Training. The status changes to Queued.
- The system picks up the queued training and starts the run. This may take a few minutes.
- When the run completes, the status changes to Trained and shows the updated score, last training date, duration, and base model version.
Button states
The button's enabled state depends on what has changed since the last training and whether a training is already in flight.
| State | Button | Popover message |
|---|---|---|
| Enough changes | Enabled | No popover. Select to queue a training. |
| Below the minimum changes threshold | Disabled | "At least N changes are needed before a new training can be started." |
| No changes since the last training | Disabled | "No changes have been made since the last training." |
| Training queued or in progress | Disabled | "A training is already queued or in progress for this model." |
What counts as a change
Each annotation or classification modification counts as one change. For example, annotating a field on a document or classifying a page both count. The changes counter resets after a successful training run. Document type schema or base model changes count as major changes and bypass the required change threshold.
Status pills
Status pills appear next to each trainable model or document type across the application. The pill reflects the current state of that model's training, and (where applicable) exposes the Start Training action.
| State | What you see |
|---|---|
| Not yet trained | Changes counter. Start Training is enabled once the threshold is reached. |
| Queued | Message: "Training is being prepared and will start automatically. This may take a few minutes." |
| Training in progress | In-flight indicator. Start Training is not shown. |
| Failed | Error message, warning icon, and a Retry button to re-queue the training. |
| Trained | Last training date, duration, and base model version used. Changes counter since the last run. Start Training is enabled if enough new changes exist. |
Recommendations and warnings
- When a large number of changes have accumulated, the status pill shows a warning icon to nudge you to start a new training.
- When you change the document type schema, the change is not included in the trained model until you start a new training. A warning is shown.
- When the base model version changes, the existing trained model is not re-aligned automatically. A warning is shown, and you start a new training when ready.
Exceptions
- Zip import. When you import a zip into a project, a training is queued automatically. You do not need to select Start Training.
- One training at a time. You cannot queue a second training for the same model while one is already queued or running. Wait for it to finish (or fail) before starting another.