If you look into the BasicIndexingFilter.java plugin source you will
see that this is where those default fields get indexed. So, you can
either create a new plugin that is configurable for the properties you
want to index, or remove this plugin. Here is the snippet of code
that is in the filter:
if (host != null) {
// add host as un-stored, indexed and tokenized
doc.add(new Field("host", host, Field.Store.NO,
Field.Index.TOKENIZED));
// add site as un-stored, indexed and un-tokenized
doc.add(new Field("site", host, Field.Store.NO,
Field.Index.UN_TOKENIZED));
}
// url is both stored and indexed, so it's both searchable and returned
doc.add(new Field("url", url.toString(), Field.Store.YES,
Field.Index.TOKENIZED));
// content is indexed, so that it's searchable, but not stored in index
doc.add(new Field("content", parse.getText(), Field.Store.NO,
Field.Index.TOKENIZED));
// anchors are indexed, so they're searchable, but not stored in index
try {
String[] anchors = (inlinks != null ? inlinks.getAnchors()
: new String[0]);
for (int i = 0; i < anchors.length; i++) {
doc.add(new Field("anchor", anchors[i],
Field.Store.NO, Field.Index.TOKENIZED));
}
} catch (IOException ioe) {
if (LOG.isWarnEnabled()) {
LOG.warn("BasicIndexingFilter: can't get anchors for "
+ url.toString());
}
}
On 4/3/07, Ratnesh,V2Solutions India
<[EMAIL PROTECTED]> wrote:
exactly offcourse ,
I want this only, Do you have any solution for this??
looking forwards for your reply
Thnx
Siddharth Jonathan wrote:
>
> Do you mean how do you get rid of some of the fields that are indexed by
> default? eg. content, anchor text etc.
>
> Jonathan
> On 4/2/07, Ratnesh,V2Solutions India
> <[EMAIL PROTECTED]>
> wrote:
>>
>>
>> Hi,
>> I have written a plugin , which finds no. of Object tags in a html and
>> corresponding urls.
>> I am storing "objects" as fields and page url as values.
>>
>> And finally interested in seeing the search realted with "objects"
>> indexed
>> fields not those which is already stored as indexed fields.
>>
>> So how shall I delete those index fields which is already stored????
>>
>> Looking forward towards your reply(Valuable
>> inputs).........................
>>
>> Thnx to Nutch Community
>> --
>> View this message in context:
>>
http://www.nabble.com/How-to-delete-already-stored-indexed-fields----tf3504164.html#a9786377
>> Sent from the Nutch - User mailing list archive at Nabble.com.
>>
>>
>
>
--
View this message in context:
http://www.nabble.com/How-to-delete-already-stored-indexed-fields----tf3504164.html#a9803792
Sent from the Nutch - User mailing list archive at Nabble.com.
--
"Conscious decisions by concious minds are what make reality real"