delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2010/05/10/07:29:21

X-Recipient: archive-cygwin AT delorie DOT com
X-SWARE-Spam-Status: No, hits=-2.9 required=5.0 tests=ALL_TRUSTED,BAYES_00
X-Spam-Check-By: sourceware.org
X-SWARE-Spam-Status: No, hits=-1.0 required=5.0 tests=AWL,BAYES_40,RCVD_IN_DNSWL_NONE
Message-Id: <announce.4BE7E151.90603@x-ray.at>
Date: Mon, 10 May 2010 12:34:57 +0200
From: Reini Urban <rurban AT x-ray DOT at>
User-Agent: Mozilla/5.0 (Windows; U; Windows NT 5.1; de; rv:1.9.1.9) Gecko/20100317 SeaMonkey/2.0.4
MIME-Version: 1.0
To: cygwin AT cygwin DOT com
Subject: [ANNOUNCEMENT] New package: tesseract-ocr-2.04-1 et al
Reply-To: cygwin AT cygwin DOT com
X-IsSubscribed: yes
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

tesseract-ocr, a command line ocr package, been added to the cygwin 
distribution.

The Tesseract OCR engine was originally developed at HP between 1985 and 
1995. It was open-sourced by HP and UNLV in 2005 and Google has lead 
further development.
The Tesseract OCR engine was one of the top 3 engines in the 1995 UNLV 
Accuracy test. Between 1995 and 2006 it had little work done on it, but 
it is probably one of the most accurate open source OCR engines 
available. It will read a binary, grey or color image and output text.

Homepage: http://code.google.com/p/tesseract-ocr/

Notes:
* Built with libtiff, nevertheless it only accepts certain
   tiff image formats. convert with -depth from the ImageMagick
   package is my friend. I use convert <any> -depth 8 <any.tif>
* I haven't tried http://code.google.com/p/ocropus/

Packages:
tesseract-ocr
tesseract-ocr-devel

And the following languages as in debian:
tesseract-ocr-eng (default)
tesseract-ocr-deu
tesseract-ocr-deu-f (deutsch fraktur)
tesseract-ocr-fra
tesseract-ocr-ita
tesseract-ocr-nld
tesseract-ocr-por
tesseract-ocr-spa
tesseract-ocr-vie


If you have questions or comments, please send them to
the Cygwin mailing list at: cygwin AT cygwin DOT com .
I'll answer only there and I don't answer private mails.

                 *** CYGWIN-ANNOUNCE UNSUBSCRIBE INFO ***

If you want to unsubscribe from the cygwin-announce
mailing list, look at the "List-Unsubscribe: " tag in
the email header of this message. Send email to the
address specified there. It will be in the format:

cygwin-announce-unsubscribe-you=yourdomain DOT com AT cygwin DOT com

If you need more information on unsubscribing, start
reading here:

http://sources.redhat.com/lists.html#unsubscribe-simple

Please read *all* of the information on unsubscribing
that is available starting at this URL.





--
Problem reports:       http://cygwin.com/problems.html
FAQ:                   http://cygwin.com/faq/
Documentation:         http://cygwin.com/docs.html
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019