[
https://issues.apache.org/jira/browse/NUTCH-296?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13102085#comment-13102085
]
Lewis John McGibbney commented on NUTCH-296:
--------------------------------------------
Having had a look at this, it is not appropriate for inclusion in current Nutch
implementations and would have suited a JSP based web application e.g.
Nutch-1.2.
I'm going to reclose the issue at this point in time, should we get another web
application up and running at least there has been some recent correspondence
and the code is available should anyone wish to pursue the issue further.
> Image Search
> ------------
>
> Key: NUTCH-296
> URL: https://issues.apache.org/jira/browse/NUTCH-296
> Project: Nutch
> Issue Type: New Feature
> Reporter: Thomas Delnoij
> Assignee: Lewis John McGibbney
> Priority: Minor
>
> Per the discussion in the Nutch-User mailing list, there is a wish for an
> "Image Search" add-on component that will index images.
> Must have:
> - retrieve outlinks to image files from fetched pages
> - generate thumbnails from images
> - thumbnails are stored in the segments as ImageWritable that contains the
> compressed binary data and some meta data
> Should have:
> - implemented as hadoop map reduce job
> - should be seperate from main Nutch codeline as it breaks general Nutch
> logic of one url == one index document.
> Could have:
> - store the original image in the segments
> Would like to have:
> - search interface for image index
> - parameterizable thumbnail generation (width, height, quality)
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira