Hierarchy View in the Documents window

This view displays the documents in the selected document set in a hierarchical structure. Each document is represented as an XDocument (.xdc) and this is represented using a Document Hierarchy View - Document icon icon. If you expand a document, all of its pages Hierarchy View - Page icon are displayed. This view is available for Test Sets and Benchmark Sets only.

If a column is not relevant or has no data, the hyphen character (-) is displayed.

In addition, the following columns are displayed and are populated with data for each document.

Filename

Displays the file name for each document in the selected document subset. If any changes are made to a document that require saving, an asterisk (*) is displayed beside the filename.

Classification Result

Displays the classification result when one or more documents are manually classified. This value can differ from the Assigned Class if there are problems with your classification settings or the assigned class is incorrect.

Confidence

Displays the classification confidence for each classified document in the selected document subset.

Assigned Class

Displays the assigned classification result for each document in the selected document subset. This value is only populated if a document is successfully classified or if you manually assign the document to a class.

Fallback Reason

The reason a fallback recognition engine was used.

Fallback Engine Name

The name of the fallback recognition engine.

Resolution (DPI)

The resolution of a document in DPI.

If you are using a project created in a version of Tungsten TotalAgility earlier than 8.1.0. Any document sets loaded in an earlier version do not have the necessary information to display the resolution. As a result, the value for this column is set to -. Click on each document to load this information. Alternately, close the document set and open it again by selecting Source files. This forces the XDoc to be created from scratch.

Dimensions (px)

The dimensions for a document in pixels.

If you are using a project created in a version of Tungsten TotalAgility earlier than 8.1.0. Any document sets loaded in an earlier version do not have the necessary information to display the dimensions. As a result, the value for this column is set to -. Click on each document to load this information. Alternately, close the document set and open it again by selecting Source files. This forces the XDoc to be created from scratch.

Color Depth

This indicates the color saturation of the documents in a document set.

The following are the possible values for this column:

  • N/A - Not applicable because the document has no color depth information.

  • 1 - The image is black and white.

  • 8 - The image is grayscale.

  • 16|24|32|64 - The bit value of a color image.

  • - - No color depth value is currently loaded.

Text documents Text document icon, PDF documents PDF icon, and protected documents Protected document icon do not have a color depth. These are marked with N/A.

For projects created in a version of Tungsten TotalAgility earlier than 8.1.0, document sets do not have the necessary information to display the color depth. As a result, the color depth value is set to -. This data is also not available for old data sets that already have XDoc values when they are added to a new project.

If you want to see the color depth values for your document set, click on each document with a - to load this information. Alternately, close the document set and open it again by selecting Source files. This forces the XDoc to be created from scratch.

If you click on a document that does not have color depth information, that information is displayed and the document is marked as modified. If you want to see the color depth data the next time the document set is opened, save the changes to the document set.

In addition to these columns, there is a shortcut menu that contains many settings that can also be performed via the Documents or Process Ribbon tabs. The following settings may be available only using the document shortcut menu, depending on the selected document set:

Shortcut Menu Icon

Shortcut Menu Name and Description

Show Document icon

Show Document

Opens the selected document in the Document Viewer window.

Train for Classification icon

Add to Training Set of Selected Class (Classification)

Adds the selected documents to the Classification Set, assigns the selected class in the Project Tree, and confirms the documents.

Train for Extraction icon

Add to Training Set of Selected Class (Extraction)

Opens the Edit Document window so you can add the selected documents to the Extraction Set.

Add to Training Set of Selected Class (Table Extraction) icon

Add to Training Set of Selected Class (Table Extraction)

Adds the selected documents to the Table Extraction Set for the selected class.

When adding documents to this training set recognition and table detection are performed on-demand for any documents that are missing OCR or table data.

Edit Document icon

Edit

This setting provides a list of available editing operations to test the selected document. This setting is available when in the Hierarchy view only and not for PDF documents.

The following editing operations are available:
Cut icon Cut

Removes the selected documents and saves it them the clipboard so you can paste them later.

Paste icon Paste

Pastes the cut or copied documents .

Copy icon Copy

Copies the selected documents so they can be pasted elsewhere.

Delete icon Delete

Deletes the selected documents .

Add Document Before

Adds a new document before the currently selected document.

Add Page

Adds a new page to the end of the currently selected document.

Merge

Merges the selected documents.

Normalize Source File Structure

Normalizes the source files by converting any image source files to TIFF images. Text and PDF files cannot be normalized.

Split

The current document is split into two separate documents. The selected page is the first page of the new document.

Assign Class icon

Assign Class

Select a class from the list for the selected document or documents.

Test - Recognize Selection icon

Recognize

This setting provides a list of available recognition engines so you can perform recognition for the selected document or documents.

Detect Tables icon

Detect Tables

Detects all tables on a document and stores that information in the XDoc.

Test - Classify Selection icon

Classify

Performs classification on the selected documents.

Test - Extract Selection icon

Extract

Extracts the data from the selected documents using the field and locator definition of the selected class.

Test - Process Selection icon

Process

Performs classification and extraction on the selected documents.

Convert to Black & White icon

Convert to Black & White

This setting is disabled for benchmark document sets and protected projects.

Converts the selected documents to black and white. Once complete, this conversion cannot be undone. If the selected documents include protected documents or .txt files, these are skipped during conversion.

PDF documents are converted to bitonal TIFFs and bitonal black and white documents are smaller than the original source files. Because of this document sets with black and white documents are significantly smaller in size.

A slight reduction in quality occurs during conversion. However, the reduction in size outweighs the loss in quality.

For the best results, Train your project before converting the training documents to bitonal format. This ensures that any quality lost during conversion does not negatively affect the training results.

Similarly, ensure that all configuration and testing is complete before converting any Test Sets. This ensures that you are using the best quality documents to configure and test your extraction results.

Clear Document Data icon

Clear Data

Removes the classification and extraction data for the selected documents.

Load Document Data icon

Load Data

Reads the XDoc and loads the classification and extraction data for the selected document or documents. This data is required in order for sorting, filtering, and benchmarks to work.

Resolve Conflict icon

Resolve Conflicts

Displays the Resolve Conflicts window where you can compare conflicting documents and resolve the conflict by correcting field data.

This setting is available only when there is at least one conflict to resolve.

Open in XDoc Browser icon

Open in XDoc Browser

Opens the selected document in Tungsten TotalAgility - XDoc Browser

Open in XDoc Browser icon

Open in XDoc Browser

Opens the selected document in Tungsten TotalAgility - XDoc Browser

In addition, when you expand a document and select a page, the following settings are available depending on what is selected when using the page shortcut menu:

Shortcut Menu Icon

Shortcut Menu Name and Description

Show Document icon

Show Document

Opens the selected document in the Document Viewer window.

Edit Document icon

Edit

This setting provides a list of available editing operations to test the selected page. This setting is available when in the Hierarchy view only and not available for PDF documents.

The following batch editing settings are available:

Cut icon Cut

Removes the selected pages and saves it them the clipboard so you can paste them later.

Paste icon Paste

Pastes the cut or copied pages to the selected document.

Copy icon Copy

Copies the selected pages so they can be pasted elsewhere.

Delete icon Delete

Deletes the selected pages.

Create Document

Creates a new document with the selected pages after the current document.

Add Page(s) Before

Enables you to select image files in Windows Explorer to add to the document before the currently selected page.

Split

The current document is split into two separate documents. The selected page is the first page of the new document.

Rotate Left

Rotates the page 90° to the left.

Rotate Right

Rotates the page 90° to the right.

Open in XDoc Browser icon

Open in XDoc Browser

Opens the selected document in Tungsten TotalAgility - XDoc Browser

Open in XDoc Browser icon

Open in XDoc Browser

Opens the selected document in Tungsten TotalAgility - XDoc Browser

Related topics: