delorie.com/archives/browse.cgi | search |
X-Recipient: | archive-cygwin AT delorie DOT com |
X-SWARE-Spam-Status: | No, hits=-2.1 required=5.0 tests=AWL,BAYES_00,SPF_PASS |
X-Spam-Check-By: | sourceware.org |
Date: | Wed, 4 Mar 2009 09:56:49 -0800 |
From: | Gary Johnson <garyjohn AT spocom DOT com> |
To: | cygwin AT cygwin DOT com |
Subject: | Re: pdftk and apropos - general questions |
Message-ID: | <20090304175648.GA5388@KCJs-Computer> |
Mail-Followup-To: | cygwin AT cygwin DOT com |
References: | <BLU113-W74226535EC192149C5AEABEA60 AT phx DOT gbl> <49AE9494 DOT 1000804 AT veritech DOT com> <BLU113-W51FC38A48F454394262F2CBEA70 AT phx DOT gbl> |
Mime-Version: | 1.0 |
In-Reply-To: | <BLU113-W51FC38A48F454394262F2CBEA70@phx.gbl> |
User-Agent: | Mutt/1.4.2.2i |
X-IsSubscribed: | yes |
Mailing-List: | contact cygwin-help AT cygwin DOT com; run by ezmlm |
List-Id: | <cygwin.cygwin.com> |
List-Unsubscribe: | <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com> |
List-Subscribe: | <mailto:cygwin-subscribe AT cygwin DOT com> |
List-Archive: | <http://sourceware.org/ml/cygwin/> |
List-Post: | <mailto:cygwin AT cygwin DOT com> |
List-Help: | <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs> |
Sender: | cygwin-owner AT cygwin DOT com |
Mail-Followup-To: | cygwin AT cygwin DOT com |
Delivered-To: | mailing list cygwin AT cygwin DOT com |
Note-from-DJ: | This may be spam |
On 2009-03-04, Mike Marchywka wrote: > > Mike Marchywka wrote: > >> I've had a persistent problem getting apropos to work > >> as it never finds anything appropriate. Is there > >> something I need to do to make this work? > >> > > After each setup session, you need to run, /usr/sbin/makewhatis -u. > > > Thanks but I did get that far after earlier hints and you list > below is about what I ended up with too. One problem > I ran into was trying to extract sensical text from the > IRS instructions. I have that problem with the printed versions. > I used the pdftotext utility IIRC from > > http://www.foolabs.com/xpdf/download.html > > and it didn't seem to be able to separate multi-column text > automatically ( with sed and awk I got what I needed but what > a mess). Did you use the -layout option to pdftotext? It makes a huge difference on the documents I've converted, but they've all been single column. Regards, Gary -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/
webmaster | delorie software privacy |
Copyright © 2019 by DJ Delorie | Updated Jul 2019 |