X-Recipient: archive-cygwin@delorie.com
X-SWARE-Spam-Status: No, hits=-2.0 required=5.0 	tests=BAYES_00,SARE_MSGID_LONG40,SPF_PASS
X-Spam-Check-By: sourceware.org
MIME-Version: 1.0
In-Reply-To: <1f3387ff0902050655pfa755c3w443e0f642ab3059c@mail.gmail.com>
References: <1f3387ff0902050655pfa755c3w443e0f642ab3059c@mail.gmail.com>
Date: Thu, 5 Feb 2009 16:46:27 +0100
Message-ID: <6910a60902050746g753e7a1dn45b9da83ce7d005e@mail.gmail.com>
Subject: Re: How to get HTML page having embedded javascript
From: Reini Urban <rurban@x-ray.at>
To: cygwin@cygwin.com
Content-Type: text/plain; charset=ISO-8859-1
Content-Transfer-Encoding: quoted-printable
X-IsSubscribed: yes
Mailing-List: contact cygwin-help@cygwin.com; run by ezmlm
Precedence: bulk
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie.com@cygwin.com>
List-Subscribe: <mailto:cygwin-subscribe@cygwin.com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin@cygwin.com>
List-Help: <mailto:cygwin-help@cygwin.com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner@cygwin.com
Mail-Followup-To: cygwin@cygwin.com
Delivered-To: mailing list cygwin@cygwin.com

2009/2/5 TAJTHY Tam=E1s:
> I have a problem. I'd like to get HTML pages, but not their plain
> sources. If it has an embedded javascript and it generates HTML code I
> need the resutling HTML code. Now I just run a perl script which
> launches a firefox and I copy the resulting page to the clipboard. But
> this is not too nice solution as I can not detect when firefox
> finished downloading and processing the page.
>
> Is there a library which can do this? Can anyone give some help, how
> can solve this?

The libraries are called gecko and webkit.

You need to evaluate the js which leads to the final html, as
interpreted client-side
in the browser.
This is done by dom manipulation from the original html, but by hooking
into the layout renderer you should be able to get at some sort of final la=
yout.

It looks like a nice project for the next two years or so. Maybe GSOC
sponsors it,
because Google already has such emulators.

This has nothing to do with cygwin, ask this at some web list.
--=20
Reini Urban
http://phpwiki.org/              http://murbreak.at/

--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/

