X-Recipient: archive-cygwin AT delorie DOT com X-SWARE-Spam-Status: No, hits=-2.1 required=5.0 tests=AWL,BAYES_00,SPF_PASS X-Spam-Check-By: sourceware.org Date: Wed, 4 Mar 2009 09:56:49 -0800 From: Gary Johnson To: cygwin AT cygwin DOT com Subject: Re: pdftk and apropos - general questions Message-ID: <20090304175648.GA5388@KCJs-Computer> Mail-Followup-To: cygwin AT cygwin DOT com References: <49AE9494 DOT 1000804 AT veritech DOT com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.4.2.2i X-IsSubscribed: yes Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com Note-from-DJ: This may be spam On 2009-03-04, Mike Marchywka wrote: > > Mike Marchywka wrote: > >> I've had a persistent problem getting apropos to work > >> as it never finds anything appropriate. Is there > >> something I need to do to make this work? > >> > > After each setup session, you need to run, /usr/sbin/makewhatis -u. > > > Thanks but I did get that far after earlier hints and you list > below is about what I ended up with too. One problem > I ran into was trying to extract sensical text from the > IRS instructions. I have that problem with the printed versions. > I used the pdftotext utility IIRC from > > http://www.foolabs.com/xpdf/download.html > > and it didn't seem to be able to separate multi-column text > automatically ( with sed and awk I got what I needed but what > a mess). Did you use the -layout option to pdftotext? It makes a huge difference on the documents I've converted, but they've all been single column. Regards, Gary -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/