Open Text Corporation

Open Text Capture Center

The Open Text Capture Center is a platform for the development of applications for document classification and data extraction. It can be used to tackle all kinds of input management tasks, regardless of whether the volume of documents is high or low, the workflow is straightforward or complex, the processing is central or distributed, or if the documents to be processed are structured or unstructured. The Capture Suite is typically applied when developing solutions like:

Open Text Capture Center provides the application developer with highest possible flexibility to the extend of easy integration of proprietary components on the module- and workflow level. Open Text Capture Center has been developed based on the Microsoft .net 3.0 framework. Simple applications can be quickly implemented by using the application-designer. The development of complex applications can be done directly in the .net framework while the developer can utilize a vast array of readily available components.

The SOA oriented system concept of Open Text Capture Center allows the implememntation of centralized and decentralized capturing solutions. By increasing processing capacities and adding additional scan- and validation clients the solutions are scalable without requiring any modification of the application itself.

Architektur Open Text Capture Document Reader
DOKUStar Capture Suite (DCS) architecture

The core elements of the Open Text Capture CenterDOKuStar Dispatch, Open Text Capture Document Reader and Open Text Capture Document Validation – ensure reliable, seamless interaction when classifying documents, extracting data and processing documents, when entering data manually and when inspecting and correcting automatically recognized data.

Open Text Capture Document Reader is responsible not just for automatically classifying documents and extracting data.

Open Text Capture Document Validation provides the basis for all interactive processing steps when processing documents, entering data manually, as well as inspecting and correcting automatically recognized data.

DOKuStar Dispatch is the workflow system of the Open Text Capture Center, with which all process steps are guided and controlled. The capture workflow, DOKuStar Dispatch, is made up of three parts:

The Workflow Engine manages the status of all jobs in a database. Each job has a status and is assigned to a queue, where it waits for the next task. With the Workflow Designer, the sequence is determined graphically and interactively. The individual processing steps are defined and configured. Editable rules determine the conditions under which individual processing steps follow each another.

The administration tool is used to check the status of the capture solution during production. On the one hand, the resources are monitored, ie all servers and the processes running on them. These servers can be distributed in the local network or between different sites. On the other hand, all active jobs are checked, eg which document is in which status. The administrator can intervene in the sequences and, for example, reset the processing steps of documents manually.

Many requirements - one solution

The Open Text Capture Center fulfills the necessary requirements of a Capture Workflow: flexibility, power and ease-of-use.The Open Text Capture Center fulfils the necessary requirements of a Capture Workflow: flexibility, power and ease-of-use.

Powerful programming interfaces give almost absolute control; services as components of workflows create flexibility for any imaginable distribution scenario. The core of the Workflow Engine is database-based and has already been in productive use for many years. Pre-defined scenarios and program wizards ensure that the solution is quickly ready for use.

The capabilities of the Open Text Capture Center include:

Application-specific modules: Open Text Capture Document Reader and Open Text Capture Document Validation feature various programming interfaces with which their behavior can be adapted to the specific application. Open Text Capture Center enables fully independent modules to be inserted into the entry workflow and supports the configuration and administration of these modules. Via plug-in interfaces, the Suite’s interactive tools can be expanded to meet application-specific requirements.

Flexible document structure: The Open Text Capture Center supports a three-level structure of the objects to be converted, ensuring flexibility and freedom when designing the document structure as well as easy administration, including with high document volumes.

Import and export: To import the documents to be processed, either the importing program can be used or an application-specific import can be carried out using the import interfaces. The results data are stored as XML documents. They can be transferred to subsequent systems in one export step.

Forward-looking architecture (SOA / .Net Platform) The Open Text Capture Center has been created according to the latest software architecture principles and can be easily integrated into existing and future complete systems (keyword: SOA) – thus protecting your investments.

The system’s components are loosely coupled. Replacing individual components is simple and the dependencies within the system are minimized. XML and other standards facilitate communication with the surrounding IT and safeguard future scalability. Open Text Capture Center is integrated into Microsoft’s .Net architecture. Using various interfaces, existing modules can be adapted and entirely new modules and processing steps can be added to the system. If necessary, the program interfaces can be adjusted to create specially configured administration interfaces.

Related Documents

English Open Text Capture Center Brochure (English - PDF)