Hi folks,

nowadays, most of the webpages are rather complex. I wonder how do you handle those webpages? I have tried to download and convert them simply with plucker, but unfortunately the results did not always satisfy me.

Is there something like a "code cleanup" or anthying similar? How do you cleanup html-code? I'd be very glad, if someone could post the scripts he is using, because I don't believe that cleaning up the code is done manually! :-0

Here are 2 examples of what webpages I mean:

http://www.galileocomputing.de/openbook/unix_guru/
http://www.galileocomputing.de/openbook/ubuntu/

I think it is pretty complex to edit every html-document manually, in order to get the main content from the middle of the webpage.

Marius
_______________________________________________
plucker-list mailing list
[email protected]
http://lists.rubberchicken.org/mailman/listinfo/plucker-list

Reply via email to