List View in the Documents window

This view displays the documents in the selected document set in a flat structure, where each document is stand-alone, without any visible relationship to the other documents.

The content displayed here depends on the columns selected on the Choose Details window as well as the selected document set.

Name and Description

Document Set Type Visibility

Filename

Displays the file name for each document in the selected document subset. If any changes are made to a document that require saving, an asterisk (*) is displayed beside the filename.

All document sets.

This column cannot be hidden in the Choose Details window.

Use

Displays whether or not a document in training set is being used. This can be either the Include for Training icon or the Exclude from Training icon.

Classification Set

Extraction Set

Table Extraction Set

Trained

Displays whether or not the document was used for dynamic training. If a document was not used for dynamic training, even if it was used for a different type of extraction online learning, the field displays No. If a document has been excluded from training or not yet trained, the field is blank.

Classification Set

Extraction Set

Table Extraction Set

Layout ID

Displays an integer that indicates how a document relates to the other documents in the Extraction Set based on its layout. Documents with the same layout have the same Layout ID.

For example, invoices from the same vendor will have the same Layout ID. Invoices from other vendors will have different Layout ID values.

Classification Set

Extraction Set

Table Extraction Set

Conflicts

Displays a list of fields where a conflict is present.

Classification Set

Extraction Set

Table Extraction Set

Validation Information

Displays whether a document has validation information or not.

Classification Set

Extraction Set

Table Extraction Set

Classification Result

Displays the classification result when one or more documents are manually classified. This value can differ from the Assigned Class if there are problems with your classification settings or the assigned class is incorrect.

All document sets.

If you want to hide this column, remove columns from the List View.

Confidence

Displays the classification confidence for each classified document in the selected document subset.

All

All document sets.

If you want to hide this column, remove columns from the List View.

Assigned Class

Displays the assigned classification result for each document in the selected document subset. This value is only populated if a document is successfully classified or if you manually assign the document to a class.

All document sets.

This column cannot be hidden in the Choose Details window.

Fallback Reason

The reason a fallback recognition engine was used.

This column is visible in the following document sets when you add the column to the List View only.

Classification Set

Extraction Set

Table Extraction Set

Test Set

Benchmark Set

Fallback Engine Name

The name of the fallback recognition engine.

This column is visible in the following document sets when you add the column to the List View only.

Classification Set

Extraction Set

Table Extraction Set

Test Set

Benchmark Set

Resolution (DPI)

The resolution of a document in DPI.

If you are using a project created in a version of Tungsten TotalAgility earlier than 8.1.0. Any document sets loaded in an earlier version do not have the necessary information to display the resolution. As a result, the value for this column is set to -. Click on each document to load this information. Alternately, close the document set and open it again by selecting Source files. This forces the XDoc to be created from scratch.

Classification Set

Extraction Set

Table Extraction Set

Test Set *

Benchmark Set *

* Visible only when you add the column to the List View.

Dimensions (px)

The dimensions for a document in pixels.

If you are using a project created in a version of Tungsten TotalAgility earlier than 8.1.0. Any document sets loaded in an earlier version do not have the necessary information to display the dimensions. As a result, the value for this column is set to -. Click on each document to load this information. Alternately, close the document set and open it again by selecting Source files. This forces the XDoc to be created from scratch.

Classification Set

Extraction Set

Table Extraction Set

Test Set *

Benchmark Set *

* Visible only when you add the column to the List View.

Color Depth

This indicates the color saturation of the documents in a document set.

The following are the possible values for this column:

  • N/A - Not applicable because the document has no color depth information.

  • 1 - The image is black and white.

  • 8 - The image is grayscale.

  • 16|24|32|64 - The bit value of a color image.

  • - - No color depth value is currently loaded.

Text documents Text document icon, PDF documents PDF icon, and protected documents Protected document icon do not have a color depth. These are marked with N/A.

For projects created in a version of Tungsten TotalAgility earlier than 8.1.0, document sets do not have the necessary information to display the color depth. As a result, the color depth value is set to -. This data is also not available for old data sets that already have XDoc values when they are added to a new project.

If you want to see the color depth values for your document set, click on each document with a - to load this information. Alternately, close the document set and open it again by selecting Source files. This forces the XDoc to be created from scratch.

If you click on a document that does not have color depth information, that information is displayed and the document is marked as modified. If you want to see the color depth data the next time the document set is opened, save the changes to the document set.

This column is visible in the following document sets only when you add the column to the List View.

Classification Set

Extraction Set

Table Extraction Set

Test Set

Benchmark Set

In addition to these columns, there is a shortcut menu that contains many settings that can also be performed via the Documents or Process Ribbon tabs. The following settings may be available only using the document shortcut menu, depending on the selected document set:

Shortcut Menu Icon

Shortcut Menu Name and Description

Show Document icon

Show Document

Opens the selected document in the Document Viewer window.

Train for Classification icon

Add to Training Set of Selected Class (Classification)

Adds the selected documents to the Classification Set, assigns the selected class in the Project Tree, and confirms the documents.

Train for Extraction icon

Add to Training Set of Selected Class (Extraction)

Opens the Edit Document window so you can add the selected documents to the Extraction Set.

Add to Training Set of Selected Class (Table Extraction) icon

Add to Training Set of Selected Class (Table Extraction)

Adds the selected documents to the Table Extraction Set for the selected class.

When adding documents to this training set recognition and table detection are performed on-demand for any documents that are missing OCR or table data.

Assign Class icon

Assign Class

Select a class from the list for the selected document or documents.

Rename icon

Rename

Renames the selected XDoc.

After renaming an XDoc your document set is saved automatically and any other changes are saved as well. If you rename an XDoc in a training set, you must retrain that document set in order for the change to take effect. You cannot have more than one XDoc with the same name.

Delete Document icon

Delete

Removes the selected documents from their document subset.

Test - Recognize Selection icon

Recognize

This setting provides a list of available recognition engines so you can perform recognition for the selected document or documents.

Detect Tables icon

Detect Tables

Detects all tables on a document and stores that information in the XDoc.

Test - Classify Selection icon

Classify

Performs classification on the selected documents.

Test - Extract Selection icon

Extract

Extracts the data from the selected documents using the field and locator definition of the selected class.

Test - Process Selection icon

Process

Performs classification and extraction on the selected documents.

Convert to Black & White icon

Convert to Black & White

This setting is disabled for benchmark document sets and protected projects.

Converts the selected documents to black and white. Once complete, this conversion cannot be undone. If the selected documents include protected documents or .txt files, these are skipped during conversion.

PDF documents are converted to bitonal TIFFs and bitonal black and white documents are smaller than the original source files. Because of this document sets with black and white documents are significantly smaller in size.

A slight reduction in quality occurs during conversion. However, the reduction in size outweighs the loss in quality.

For the best results, Train your project before converting the training documents to bitonal format. This ensures that any quality lost during conversion does not negatively affect the training results.

Similarly, ensure that all configuration and testing is complete before converting any Test Sets. This ensures that you are using the best quality documents to configure and test your extraction results.

Clear Document Data icon

Clear Data

Removes the classification and extraction data for the selected documents.

Load Document Data icon

Load Data

Reads the XDoc and loads the classification and extraction data for the selected document or documents. This data is required in order for sorting, filtering, and benchmarks to work.

Resolve Conflict icon

Resolve Conflicts

Displays the Resolve Conflicts window where you can compare conflicting documents and resolve the conflict by correcting field data.

This setting is available only when there is at least one conflict to resolve.

Open in XDoc Browser icon

Open in XDoc Browser

Opens the selected document in Tungsten TotalAgility - XDoc Browser

Open in XDoc Browser icon

Open in XDoc Browser

Opens the selected document in Tungsten TotalAgility - XDoc Browser

Related topics: