> (E.G. Nutch define one url == one index document.) Why can't we create a document for every image that is found?
Then it is as if we will have a parse-image plugin just like we have a parse-html and parse-pdf plugin, with the only difference that it will be run after all the pages in the segment have been fetched? Rgrds, Thomas _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
