Softi Free OCR for English, French, Italian, German, Spanish, Dutch, supporting TIFF images


Softi Free OCR is a scan and OCR program which uses the Windows compiled Tesseract free ocr engine also known as a Tesseract GUI. It supports multi-page tiff’s, fax documents as well as most image types including compressed Tiff’s which the Tesseract engine on its own cannot read.

The Tesseract free OCR engine is an open source product released by Google. It was developed at Hewlett Packard Laboratories between 1985 and 1995. In 1995 it was one of the top 3 performers at the OCR accuracy contest organized by University of Nevada in Las Vegas. The Tesseract engine code is now maintained by Google.

Tesseract OCR engine requires images at a resolution of 200 dpi or greater and as such it is not suited for reading PC screen shots which are only about 72dpi although there have been made some enhancements achieving improved accuracy from low quality image sources.

Manual zoning allows you to select an area to process. This helps to increase the accuracy by eliminating borders, pictures etc. Also this makes the software useable to OCR documents which contain columns. To select an area just draw a box on the image with the mouse using the left button.