-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Hello.
Im working on a school task, wich is to modify nutch to be able to identify, and download jpegs , creaty a thumbnail , and index the url of this jpegs with the other crawl result so that the web interface can show images as well. At the start i found that ParserNotFound.java can do the trick for me. I modified the constructor so that it matches the url-s end to a pattern, and if it ends to jpeg it creates a file with the name of the md5sum of the url and writes the url in it to a directory found in my filesystem. Well.. this is ugly, i wanted to add the working directory to the parsernotfound.java , but i couldnt. And to move forward with my work, i need to know how to make my own jpeg parser as first task. After that i would like to index my result somehow :) So.. my question.. how can i add my jpeg parser? Or, how can i add a new parser to the nutch system? Thanks for your awnsers. -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.10 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/ iQEcBAEBAgAGBQJLwIObAAoJEIJu8h6i9aAHb6AH/jegl+oqvUg8nJCJo1p/IuVx KuWthxGn0S+qDMfXrYb+AIRpmuj2YAWQwEE9Lhw2ftSJwFqH4gf4VwmDJq8CDTto BDX+/lOOI7ZVtKzNmDgaN2nwX0gwn0PJgKTV8BGkUbVy3McfisQ/9v9UBzhjj7f7 DTvsZN2yNyv9PUls9GSqXw9czFsuKB7PLGnssqB6a8DTgFeoLT2F8e0B9q2Tht92 eAZV2awEnnH/wNTIjfwO00YXNdvNcGANiFzz0v4CoMekSEigoRBSemtYhsYCOppo S0OUy8SCT4A2B6sWADIQjMKgnWuLm53dkHl9D91p0zMpnCTcq5u3hjLnxgq69L8= =M7VY -----END PGP SIGNATURE-----