(E.G. Nutch define one url == one index document.)
Why can't we create a document for every image that is found? Then it is as if we will have a parse-image plugin just like we have a parse-html and parse-pdf plugin, with the only difference that it will be run after all the pages in the segment have been fetched? Rgrds, Thomas
