delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2008/08/29/08:21:33

X-Recipient: archive-cygwin AT delorie DOT com
X-Spam-Check-By: sourceware.org
Message-ID: <6910a60808290520n2c1747afg455abbd411f7690c@mail.gmail.com>
Date: Fri, 29 Aug 2008 14:20:29 +0200
From: "Reini Urban" <rurban AT x-ray DOT at>
To: cygwin AT cygwin DOT com
Subject: Re: html parser location:
In-Reply-To: <48B7E49D.7050005@byu.net>
MIME-Version: 1.0
References: <20080829054916 DOT 5C79147808F AT ws1-5 DOT us4 DOT outblaze DOT com> <48B7E49D DOT 7050005 AT byu DOT net>
X-Google-Sender-Auth: c2cfc265b472cf23
X-IsSubscribed: yes
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

> According to Tamirat Tesfaye on 8/28/2008 11:49 PM:
>> Dear Eric, I am working on an application that parses the content of
>> HTML remotely and populate it in database. I have used  perl for this.
>> However the cygwin was saying that it was unable to locate the HTML
>> Content Parser  files. I have tried to reinstall the cygwin from several
>> mirrors to check its problem. However I was in vain. Would you please
>> lend a hand on this matter.
>
> I don't know why you picked me to send your private mail to, but it was
> the wrong choice.  Redirecting to the list.  And consider following the
> directions at http://cygwin.com/problems.html next time.

The cygwin list can also not help.
You need a perl module and perl help from the locations recommended in
the perl docus.

$ perldoc perlfaq
is a good start.

cygwin perl does only come with the core modules plus some additional modules
so that CPAN works out of the box.
If it does not come with "HTML Content Parser files", you have to install
them by your own via cpan.

There are several modules for parsing HTML.
Search them at http://search.cpan.org/
http://search.cpan.org/search?query=HTML+Content+Parser&mode=all
HTML::Parser  e.g. is often used.

And as it turns out HTML-Parser-3.56, XML-Parser-2.36, XML-SAX-0.16
and XML-LibXML-1.66 are already included with cygwin perl.
Other modules should be installed via cpan.
-- 
Reini Urban
http://phpwiki.org/ http://murbreak.at/

--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019