On Fri, Sep 03, 2004 at 01:54:45PM -0500, Steven C. Perkins wrote:
> Hello:
> 
> I am a librarian who is interested in resource discovery and retrieval.  I 
> would like to know if NUTCH can be made to search within parts/sections of 
> a document.

Yes, it can be made such. You (or someone) need to write
new or modify existing index and query plugins for it.

I am re-directing your message to [EMAIL PROTECTED]

John

> 
> Lets say I want to search within the <head></head> section of a document 
> for all DC.title tags or any other schema.  I want to combine that with 
> searching only on pages from a specific domain and which have a specific 
> word or words in the <title></title> section.  I can do some of this in 
> some search engines and some in others, but not all in the same search 
> engine.  I have been meaning to set up HARVEST and try it.  Can NUTCH do 
> this easily?
> 
> I am interested in this since I want to propose some metatag schemas that 
> would make it easier to find academic or professional papers on the web or 
> allow for the creation of metadata repositories for discipline specific 
> searches.
> 
> Thanks for your time.
> 
> Regards,
> 
> Steven C. Perkins, JD MLL
> Coordinator of Reference Services
> U of Houston Libraries
> [EMAIL PROTECTED]
> [EMAIL PROTECTED]
> 
> 
> 
> 
> -------------------------------------------------------
> This SF.Net email is sponsored by BEA Weblogic Workshop
> FREE Java Enterprise J2EE developer tools!
> Get your free copy of BEA WebLogic Workshop 8.1 today.
> http://ads.osdn.com/?ad_id=5047&alloc_id=10808&op=click
> _______________________________________________
> Nutch-general mailing list
> [EMAIL PROTECTED]
> https://lists.sourceforge.net/lists/listinfo/nutch-general
> 
__________________________________________
http://www.neasys.com - A Good Place to Be
Come to visit us today!


-------------------------------------------------------
This SF.Net email is sponsored by BEA Weblogic Workshop
FREE Java Enterprise J2EE developer tools!
Get your free copy of BEA WebLogic Workshop 8.1 today.
http://ads.osdn.com/?ad_id=5047&alloc_id=10808&op=click
_______________________________________________
Nutch-developers mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to