delorie.com/archives/browse.cgi | search |
X-Recipient: | archive-cygwin AT delorie DOT com |
X-SWARE-Spam-Status: | No, hits=-2.9 required=5.0 tests=ALL_TRUSTED,BAYES_00 |
X-Spam-Check-By: | sourceware.org |
X-SWARE-Spam-Status: | No, hits=-1.0 required=5.0 tests=AWL,BAYES_40,RCVD_IN_DNSWL_NONE |
Message-Id: | <announce.4BE7E151.90603@x-ray.at> |
Date: | Mon, 10 May 2010 12:34:57 +0200 |
From: | Reini Urban <rurban AT x-ray DOT at> |
User-Agent: | Mozilla/5.0 (Windows; U; Windows NT 5.1; de; rv:1.9.1.9) Gecko/20100317 SeaMonkey/2.0.4 |
MIME-Version: | 1.0 |
To: | cygwin AT cygwin DOT com |
Subject: | [ANNOUNCEMENT] New package: tesseract-ocr-2.04-1 et al |
Reply-To: | cygwin AT cygwin DOT com |
X-IsSubscribed: | yes |
Mailing-List: | contact cygwin-help AT cygwin DOT com; run by ezmlm |
List-Id: | <cygwin.cygwin.com> |
List-Subscribe: | <mailto:cygwin-subscribe AT cygwin DOT com> |
List-Archive: | <http://sourceware.org/ml/cygwin/> |
List-Post: | <mailto:cygwin AT cygwin DOT com> |
List-Help: | <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs> |
Sender: | cygwin-owner AT cygwin DOT com |
Mail-Followup-To: | cygwin AT cygwin DOT com |
Delivered-To: | mailing list cygwin AT cygwin DOT com |
tesseract-ocr, a command line ocr package, been added to the cygwin distribution. The Tesseract OCR engine was originally developed at HP between 1985 and 1995. It was open-sourced by HP and UNLV in 2005 and Google has lead further development. The Tesseract OCR engine was one of the top 3 engines in the 1995 UNLV Accuracy test. Between 1995 and 2006 it had little work done on it, but it is probably one of the most accurate open source OCR engines available. It will read a binary, grey or color image and output text. Homepage: http://code.google.com/p/tesseract-ocr/ Notes: * Built with libtiff, nevertheless it only accepts certain tiff image formats. convert with -depth from the ImageMagick package is my friend. I use convert <any> -depth 8 <any.tif> * I haven't tried http://code.google.com/p/ocropus/ Packages: tesseract-ocr tesseract-ocr-devel And the following languages as in debian: tesseract-ocr-eng (default) tesseract-ocr-deu tesseract-ocr-deu-f (deutsch fraktur) tesseract-ocr-fra tesseract-ocr-ita tesseract-ocr-nld tesseract-ocr-por tesseract-ocr-spa tesseract-ocr-vie If you have questions or comments, please send them to the Cygwin mailing list at: cygwin AT cygwin DOT com . I'll answer only there and I don't answer private mails. *** CYGWIN-ANNOUNCE UNSUBSCRIBE INFO *** If you want to unsubscribe from the cygwin-announce mailing list, look at the "List-Unsubscribe: " tag in the email header of this message. Send email to the address specified there. It will be in the format: cygwin-announce-unsubscribe-you=yourdomain DOT com AT cygwin DOT com If you need more information on unsubscribing, start reading here: http://sources.redhat.com/lists.html#unsubscribe-simple Please read *all* of the information on unsubscribing that is available starting at this URL. -- Problem reports: http://cygwin.com/problems.html FAQ: http://cygwin.com/faq/ Documentation: http://cygwin.com/docs.html Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
webmaster | delorie software privacy |
Copyright © 2019 by DJ Delorie | Updated Jul 2019 |