enabled plugins that implement IndexingFilter are run for each file to 
generate the fields to index. enabled plugins can be found in 
conf/nutch-default.xml or conf/nutch-site.xml.

You can look at http://wiki.apache.org/nutch/IndexStructure.


Kai_testing Middleton wrote:
> Not sure ... this is kind of an off-the-cuff reply, but Luke might give you 
> that information (google for apache luke).
>
> ----- Original Message ----
> From: Daniel Clark <[EMAIL PROTECTED]>
> To: [EMAIL PROTECTED]
> Sent: Tuesday, July 17, 2007 3:22:26 PM
> Subject: IndexFilter
>
> Which indexFilter plugin does Nutch use out-of-the-box?  Or how do I find
> out?  I'm trying to figure out how the following fields are being indexed.
>
>  
>
> anchor
>
> boost
>
> content
>
> digest
>
> host
>
> segment
>
> site
>
> title
>
> tstamp
>
> url
>
>  
>
>  
>
>
>
>
>
>
>
>
>        
> ____________________________________________________________________________________
> Moody friends. Drama queens. Your life? Nope! - their life, your story. Play 
> Sims Stories at Yahoo! Games.
> http://sims.yahoo.com/  
>   

-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to