Hi David,
The latest Nutch release candidate (1.1,
http://svn.apache.org/repos/asf/lucene/nutch/tags/1.1) includes the tika-parser
plugin, which provides a JpegParser (see here: http://bit.ly/b0zRX8) that
hopefully can suit your needs.
Let me know what you think.
Cheers,
Chris
On 4/10/10 6:
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
Hello.
Im working on a school task, wich is to modify nutch to be able to
identify, and download jpegs , creaty a thumbnail , and index the url of
this jpegs with the other crawl result so that the web interface can
show images as well.
At the start