Hi,

I just released JPluck 0.8.5. This release has a lot of additions and
enhancements, the highlights of which are:

- Added JTidy parser. JTidy is specifically geared towards fixing invalid
HTML, so it might do a better job than the default NekoHTML parser. You can
select the HTML parser on a per-Document basis.
- Added Document-specific URL inclusion and exclusion patterns.  (See below
for more info.)
- Added option to change the default Document settings.
- Added customizable image conversion. You can set the maximum size of an
image, use linear filtering for resizing (higher quality) and change the
image brightness and contrast.
- Added option to set the default link color.
- Changes to the layout and organization of the GUI.

http://jpluck.sourceforge.net/download.html

Document-specific URL inclusion and exclusion is a particularly useful
addition. One of my news sites is BBC News. I always skip the Business,
Health and Education sections. By adding three URL exclusion patterns
".*/business/.*", ".*/health/.*" and "*./education/.*" I exclude these
sections from the document. This results in less pages to download and a
smaller PDB size. (True enough, you can also add exclusion patterns to the
global Rule Filter file, but adding patterns to the Document itself is much
clearer and more convenient.)

For those unfamiliar with it: JPluck is Java-based toolkit for creating
Plucker documents. It includes the JPluck X GUI for managing and downloading
sites. JPluck is an alternative to the Python spider and Plucker Desktop.
Its main advantages over the existing Plucker tools are that it downloads
sites much faster and that it supports scheduling for HotSync to the SD
card.

JPluck requires Java 1.4 and runs under Windows 98/ME/NT/2000/XP, Linux, Mac
OS X, and Solaris. (For Mac OS X you need the JDK 1.4.1 Developer Preview.)


Let me know what you think.


Thanks
-Laurens
---
http://jpluck.sourceforge.net/
Open-source Plucker document creation toolkit.

_______________________________________________
plucker-list mailing list
[EMAIL PROTECTED]
http://lists.rubberchicken.org/mailman/listinfo/plucker-list

Reply via email to