The tool will generate text of images and documents without embedded text. Several features use this generated text such as:
•Keyword searching
•Productions (exporting a separate .txt file for each document)
•Enabling AI Search
To access the functions:
•Right-click on the filter tree and select "Extract Text..."
•Right-click one or more selected documents and select "Extract text..."
In the dialog choose the option needed for the data set:
•"Make production images searchable" - Create text-searchable PDFs for export either in a production or as individual documents (see Exporting Documents)
•"Generate text files for production export" - Generate/extract the text as a separate file. This option is necessary if generating a production requiring text files to be included.
•"Enable AI Searching" - Turn this on to generate AI Search indexes
When data sets involve languages other than English, the OCR engine can be optimized to recognize those languages.
To monitor the progress of OCR go to the Process tab and select View Jobs.
Note: When adding new data to a matter, DWR automatically generates a keyword index of the newly added documents. Any document without indexable text will be automatically OCR'ed and the results from that OCR will be added to the Index.