Open Text Corporation

Open Text Capture Center

Document Capture System

The Open Text Capture Center is a platform for the development of applications for document classification and data extraction. It can be used to tackle all kinds of input management tasks, regardless of whether the volume of documents is high or low, the workflow is straightforward or complex, the processing is central or distributed, or if the documents to be processed are structured or unstructured. The Open Text Capture Center is typically applied when developing solutions like:

The Open Text Capture Center provides the application developer with highest possible flexibility to the extend of easy integration of proprietary components on the module- and workflow level. Open Text Capture Center has been developed based on the Microsoft .Net 3.0 framework. Simple applications can be quickly implemented by using the application-designer. The development of complex applications can be done directly in the .Net framework while the developer can utilize a vast array of readily available components.

The SOA oriented system concept of Open Text Capture Center allows the implementation of centralized and decentralized capturing solutions. By increasing processing capacities and adding additional scan- and validation clients the solutions are scalable without requiring any modification of the application itself.


Open Text Capture Document Reader

The core elements of the Open Text Capture CenterDispatch, Document Reader and Document Validation – ensure reliable, seamless interaction when classifying documents, extracting data and processing documents, when entering data manually and when inspecting and correcting automatically recognized data.

Document Reader is responsible not just for automatically classifying documents and extracting data.

Document Validation provides the basis for all interactive processing steps when processing documents, entering data manually, as well as inspecting and correcting automatically recognized data.

Dispatch is the capture workflow of the Open Text Capture Center, with which all process steps are guided and controlled.

Dispatch, is made up of three parts: Workflow Engine, Workflow Designer and Administration Tool.

The Workflow Engine manages the status of all jobs in a database. Each job has a status and is assigned to a queue, where it waits for the next task.

With the Workflow Designer, the sequence is determined graphically and interactively. The individual processing steps are defined and configured. Editable rules determine the conditions under which individual processing steps follow each another.

The Administration Tool is used to check the status of the capture solution during production. On the one hand, the resources are monitored, ie all servers and the processes running on them. These servers can be distributed in the local network or between different sites. On the other hand, all active jobs are checked, eg which document is in which status. The administrator can intervene in the sequences and, for example, reset the processing steps of documents manually.

Many requirements - one solution

The Open Text Capture Center fulfils the necessary requirements of a capture workflow: flexibility, power and ease-of-use.

Powerful programming interfaces give almost absolute control; services as components of workflows create flexibility for any imaginable distribution scenario. The core of the Workflow Engine is database-based and has already been in productive use for many years. Pre-defined scenarios and program wizards ensure that the solution is quickly ready for use.

The capabilities of the Open Text Capture Center include:

Related Documents

English Open Text Capture Center Brochure (English - PDF)