delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2009/03/04/10:36:05

X-Recipient: archive-cygwin AT delorie DOT com
X-SWARE-Spam-Status: No, hits=-0.7 required=5.0 tests=AWL,BAYES_50,SPF_PASS
X-Spam-Check-By: sourceware.org
Message-ID: <BLU113-W51FC38A48F454394262F2CBEA70@phx.gbl>
From: Mike Marchywka <marchywka AT hotmail DOT com>
To: <cygwin AT cygwin DOT com>
Subject: RE: pdftk and apropos - general questions
Date: Wed, 4 Mar 2009 10:35:51 -0500
In-Reply-To: <49AE9494.1000804@veritech.com>
References: <BLU113-W74226535EC192149C5AEABEA60 AT phx DOT gbl> <49AE9494 DOT 1000804 AT veritech DOT com>
MIME-Version: 1.0
X-IsSubscribed: yes
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com
Note-from-DJ: This may be spam



----------------------------------------
> Date: Wed, 4 Mar 2009 09:47:48 -0500
> From:
> To: cygwin AT cygwin DOT com
> Subject: Re: pdftk and apropos - general questions
>
> Mike Marchywka wrote:
>> I've had a persistent problem getting apropos to work
>> as it never finds anything appropriate. Is there
>> something I need to do to make this work?
>>
> After each setup session, you need to run, /usr/sbin/makewhatis -u.


Thanks but I did get that far after earlier hints and you list
below is about what I ended up with too. One problem
I ran into was trying to extract sensical text from the=20
IRS instructions. I used the pdftotext utility IIRC
from=20

http://www.foolabs.com/xpdf/download.html

and it didn't seem to be able to separate multi-column text
automatically ( with sed and awk I got what I needed but what
a mess). Is there a toolkit source or compiled program
I could use to diagnose or fix this? I'd also like to be able
to fill out forms programmatically- I would love to print
out a filled-in 1040 form but I'm not going to buy software to
do this or type it into a GUI.

I'm going on a bit of a cusade about proprietary format
or limited-supoort formats for public documents.=20
You'd be amazed how many public filings that should
contain information are in a format like a scanned pdf from
which little usable information can be extracted. The FCC
even seems to accept locked PDF submissions...


[ at this point, people concerned about top-posting should=20
be exploding over gh-osting or posting about text which is gone. LOL]


>
>

_________________________________________________________________
Express your personality in color! Preview and select themes for Hotmail=AE=
.=20
http://www.windowslive-hotmail.com/LearnMore/personalize.aspx?ocid=3DTXT_MS=
GTX_WL_HM_express_032009#colortheme

--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019