X-Recipient: archive-cygwin AT delorie DOT com X-Spam-Check-By: sourceware.org Message-ID: <6910a60808290520n2c1747afg455abbd411f7690c@mail.gmail.com> Date: Fri, 29 Aug 2008 14:20:29 +0200 From: "Reini Urban" To: cygwin AT cygwin DOT com Subject: Re: html parser location: In-Reply-To: <48B7E49D.7050005@byu.net> MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline References: <20080829054916 DOT 5C79147808F AT ws1-5 DOT us4 DOT outblaze DOT com> <48B7E49D DOT 7050005 AT byu DOT net> X-Google-Sender-Auth: c2cfc265b472cf23 X-IsSubscribed: yes Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm Precedence: bulk List-Id: List-Unsubscribe: List-Subscribe: List-Archive: List-Post: List-Help: , Sender: cygwin-owner AT cygwin DOT com Mail-Followup-To: cygwin AT cygwin DOT com Delivered-To: mailing list cygwin AT cygwin DOT com > According to Tamirat Tesfaye on 8/28/2008 11:49 PM: >> Dear Eric, I am working on an application that parses the content of >> HTML remotely and populate it in database. I have used perl for this. >> However the cygwin was saying that it was unable to locate the HTML >> Content Parser files. I have tried to reinstall the cygwin from several >> mirrors to check its problem. However I was in vain. Would you please >> lend a hand on this matter. > > I don't know why you picked me to send your private mail to, but it was > the wrong choice. Redirecting to the list. And consider following the > directions at http://cygwin.com/problems.html next time. The cygwin list can also not help. You need a perl module and perl help from the locations recommended in the perl docus. $ perldoc perlfaq is a good start. cygwin perl does only come with the core modules plus some additional modules so that CPAN works out of the box. If it does not come with "HTML Content Parser files", you have to install them by your own via cpan. There are several modules for parsing HTML. Search them at http://search.cpan.org/ http://search.cpan.org/search?query=HTML+Content+Parser&mode=all HTML::Parser e.g. is often used. And as it turns out HTML-Parser-3.56, XML-Parser-2.36, XML-SAX-0.16 and XML-LibXML-1.66 are already included with cygwin perl. Other modules should be installed via cpan. -- Reini Urban http://phpwiki.org/ http://murbreak.at/ -- Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple Problem reports: http://cygwin.com/problems.html Documentation: http://cygwin.com/docs.html FAQ: http://cygwin.com/faq/