> #3
> ==========
> WebPagePortlet
> --------------
> Like FileServerPortlet, but will rewrite <A HREFs, so that they point back
> relative to the original website. <IMG SRCs will be rewritten, so
> that they
> will be requested from the local cache.
>
> An extension to the diskcache has to be made, to allow cached
> entries to be
> directly requested by using http get's.
>
> Reason: Integration of seperately maintained (i.e. other host, other
> country, etc.) departmental pages into a company's intranet/portal.
>

I have a 'WebPagePortlet' that I use here.
It isn't completely functional, i.e. it can't handle every website out
there, but it does handle basic relative links for images,hrefs, etc.
It uses the OpenXML html parser.
Not sure about the licensing on openxml http://www.openxml.org/, and I don't
want to introduce another jar file unless its really needed.

The problem with this is that there are a lot of special cases you have to
code for with namespace collisions of javascript, css, etc.
Its not really a trivial problem. I dont think we should put out a solution
unless it handles most of these cases.
Marcus Schwarz documented the problems in detail in an email on this list.
Search the email list, there were discussions on this back in October or so.
http://www.mail-archive.com/[email protected]/



--
--------------------------------------------------------------
To subscribe:        [EMAIL PROTECTED]
To unsubscribe:      [EMAIL PROTECTED]
Search: <http://www.mail-archive.com/[email protected]/>
List Help?:          [EMAIL PROTECTED]

Reply via email to