Hello, Looks like I will have some spare time in the next month, so I may work on writing this image indexing plugin. I wondered if there is a similar plugin to leverage code from or follow it?
Thanks. Alex. -----Original Message----- From: Andrzej Bialecki <[email protected]> To: user <[email protected]> Sent: Wed, Mar 9, 2011 12:24 am Subject: Re: will nutch-2 be able to index image files On 3/8/11 10:50 PM, [email protected] wrote: > I meant to extract image title, src link and alt from<img tags and not store image files. For a keyword search in must display link, which automatically displays image itself in the search page. > Not sure what do you mean image content-based retrieval? Do image files have tags like mp3 ones? Yes, for example http://en.wikipedia.org/wiki/Exchangeable_image_file_format > Must a parse plugin be written in both cases? Yes - most data is already available either in the DOM tree, or can be obtained from a Tika image parser, it just needs to be wrapped in a plugin. -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com

