enabled plugins that implement IndexingFilter are run for each file to
generate the fields to index. enabled plugins can be found in
conf/nutch-default.xml or conf/nutch-site.xml.
You can look at http://wiki.apache.org/nutch/IndexStructure.
Kai_testing Middleton wrote:
Not sure ... this is kind of an off-the-cuff reply, but Luke might give you
that information (google for apache luke).
----- Original Message ----
From: Daniel Clark <[EMAIL PROTECTED]>
To: [email protected]
Sent: Tuesday, July 17, 2007 3:22:26 PM
Subject: IndexFilter
Which indexFilter plugin does Nutch use out-of-the-box? Or how do I find
out? I'm trying to figure out how the following fields are being indexed.
anchor
boost
content
digest
host
segment
site
title
tstamp
url
____________________________________________________________________________________
Moody friends. Drama queens. Your life? Nope! - their life, your story. Play
Sims Stories at Yahoo! Games.
http://sims.yahoo.com/