I'm sorry I partially missed the point. I don't know how the internals of the indexing work, maybe someone else here can give an explanation?

[EMAIL PROTECTED] wrote:

But if I do that, then the other fields wouldn't get indexed, would they? What if I wanted to 
search for the keywords "sapphire" (which might only appear in the general description 
for the merchant), and "beenie baby" (which might appear in the content on one of thier 
pages). Wouldn't both need to be indexed?

What you're suggesting would only allow me to have a search engine for my 
pages, correct? and I think what I'm asking is can I use Nutch as a search 
engine for my pages in addition to some additional meta data about the content 
-- and if so, how? Would I have to tag each of the pages with the meta data? 
because that seems like a lot of redundancy... i.e. if I index 50 pages for 
merchant www.xyzcollectibles.com they're all going to have the same name, 
general description, etc... in thier metadata.

Add an unique identifier to the document and use a separate external database.

Reply via email to