delorie.com/archives/browse.cgi   search  
Mail Archives: cygwin/2009/02/05/10:47:23

X-Recipient: archive-cygwin AT delorie DOT com
X-SWARE-Spam-Status: No, hits=-2.0 required=5.0 tests=BAYES_00,SARE_MSGID_LONG40,SPF_PASS
X-Spam-Check-By: sourceware.org
MIME-Version: 1.0
In-Reply-To: <1f3387ff0902050655pfa755c3w443e0f642ab3059c@mail.gmail.com>
References: <1f3387ff0902050655pfa755c3w443e0f642ab3059c AT mail DOT gmail DOT com>
Date: Thu, 5 Feb 2009 16:46:27 +0100
Message-ID: <6910a60902050746g753e7a1dn45b9da83ce7d005e@mail.gmail.com>
Subject: Re: How to get HTML page having embedded javascript
From: Reini Urban <rurban AT x-ray DOT at>
To: cygwin AT cygwin DOT com
X-IsSubscribed: yes
Mailing-List: contact cygwin-help AT cygwin DOT com; run by ezmlm
List-Id: <cygwin.cygwin.com>
List-Unsubscribe: <mailto:cygwin-unsubscribe-archive-cygwin=delorie DOT com AT cygwin DOT com>
List-Subscribe: <mailto:cygwin-subscribe AT cygwin DOT com>
List-Archive: <http://sourceware.org/ml/cygwin/>
List-Post: <mailto:cygwin AT cygwin DOT com>
List-Help: <mailto:cygwin-help AT cygwin DOT com>, <http://sourceware.org/ml/#faqs>
Sender: cygwin-owner AT cygwin DOT com
Mail-Followup-To: cygwin AT cygwin DOT com
Delivered-To: mailing list cygwin AT cygwin DOT com

2009/2/5 TAJTHY Tam=E1s:
> I have a problem. I'd like to get HTML pages, but not their plain
> sources. If it has an embedded javascript and it generates HTML code I
> need the resutling HTML code. Now I just run a perl script which
> launches a firefox and I copy the resulting page to the clipboard. But
> this is not too nice solution as I can not detect when firefox
> finished downloading and processing the page.
>
> Is there a library which can do this? Can anyone give some help, how
> can solve this?

The libraries are called gecko and webkit.

You need to evaluate the js which leads to the final html, as
interpreted client-side
in the browser.
This is done by dom manipulation from the original html, but by hooking
into the layout renderer you should be able to get at some sort of final la=
yout.

It looks like a nice project for the next two years or so. Maybe GSOC
sponsors it,
because Google already has such emulators.

This has nothing to do with cygwin, ask this at some web list.
--=20
Reini Urban
http://phpwiki.org/              http://murbreak.at/

--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/

- Raw text -


  webmaster     delorie software   privacy  
  Copyright © 2019   by DJ Delorie     Updated Jul 2019