Greetings Solr folk,
How can I instruct the extract request handler to ignore metadata/headers
etc. when it constructs the content of the document I send to it?
For example, I created an MS Word document containing just the word
SEARCHWORD and nothing else. However, when I ship this doc to my
: ExtractRH: How to strip metadata
Greetings Solr folk,
How can I instruct the extract request handler to ignore metadata/headers
etc. when it constructs the content of the document I send to it?
For example, I created an MS Word document containing just the word
SEARCHWORD and nothing else. However, when
.
-- Jack Krupansky
-Original Message- From: Joseph Hagerty
Sent: Wednesday, May 02, 2012 9:56 AM
To: solr-user@lucene.apache.org
Subject: ExtractRH: How to strip metadata
Greetings Solr folk,
How can I instruct the extract request handler to ignore metadata/headers
etc. when
: Wednesday, May 02, 2012 11:10 AM
To: solr-user@lucene.apache.org
Subject: Re: ExtractRH: How to strip metadata
I do not. I commented out all of the copyFields provided in the default
schema.xml that ships with 3.5. My schema is rather minimal. Here is my
fields block, if this helps:
fields
field
11:10 AM
To: solr-user@lucene.apache.org
Subject: Re: ExtractRH: How to strip metadata
I do not. I commented out all of the copyFields provided in the default
schema.xml that ships with 3.5. My schema is rather minimal. Here is my
fields block, if this helps:
fields
field name=cust