Hi,
I just released JPluck 0.8.5. This release has a lot of additions and enhancements, the highlights of which are: - Added JTidy parser. JTidy is specifically geared towards fixing invalid HTML, so it might do a better job than the default NekoHTML parser. You can select the HTML parser on a per-Document basis. - Added Document-specific URL inclusion and exclusion patterns. (See below for more info.) - Added option to change the default Document settings. - Added customizable image conversion. You can set the maximum size of an image, use linear filtering for resizing (higher quality) and change the image brightness and contrast. - Added option to set the default link color. - Changes to the layout and organization of the GUI. http://jpluck.sourceforge.net/download.html Document-specific URL inclusion and exclusion is a particularly useful addition. One of my news sites is BBC News. I always skip the Business, Health and Education sections. By adding three URL exclusion patterns ".*/business/.*", ".*/health/.*" and "*./education/.*" I exclude these sections from the document. This results in less pages to download and a smaller PDB size. (True enough, you can also add exclusion patterns to the global Rule Filter file, but adding patterns to the Document itself is much clearer and more convenient.) For those unfamiliar with it: JPluck is Java-based toolkit for creating Plucker documents. It includes the JPluck X GUI for managing and downloading sites. JPluck is an alternative to the Python spider and Plucker Desktop. Its main advantages over the existing Plucker tools are that it downloads sites much faster and that it supports scheduling for HotSync to the SD card. JPluck requires Java 1.4 and runs under Windows 98/ME/NT/2000/XP, Linux, Mac OS X, and Solaris. (For Mac OS X you need the JDK 1.4.1 Developer Preview.) Let me know what you think. Thanks -Laurens --- http://jpluck.sourceforge.net/ Open-source Plucker document creation toolkit. _______________________________________________ plucker-list mailing list [EMAIL PROTECTED] http://lists.rubberchicken.org/mailman/listinfo/plucker-list

