ExtractRH: How to strip metadata

2012-05-02 Thread Joseph Hagerty
Greetings Solr folk, How can I instruct the extract request handler to ignore metadata/headers etc. when it constructs the content of the document I send to it? For example, I created an MS Word document containing just the word SEARCHWORD and nothing else. However, when I ship this doc to my

Re: ExtractRH: How to strip metadata

2012-05-02 Thread Jack Krupansky
: ExtractRH: How to strip metadata Greetings Solr folk, How can I instruct the extract request handler to ignore metadata/headers etc. when it constructs the content of the document I send to it? For example, I created an MS Word document containing just the word SEARCHWORD and nothing else. However, when

Re: ExtractRH: How to strip metadata

2012-05-02 Thread Joseph Hagerty
. -- Jack Krupansky -Original Message- From: Joseph Hagerty Sent: Wednesday, May 02, 2012 9:56 AM To: solr-user@lucene.apache.org Subject: ExtractRH: How to strip metadata Greetings Solr folk, How can I instruct the extract request handler to ignore metadata/headers etc. when

Re: ExtractRH: How to strip metadata

2012-05-02 Thread Jack Krupansky
: Wednesday, May 02, 2012 11:10 AM To: solr-user@lucene.apache.org Subject: Re: ExtractRH: How to strip metadata I do not. I commented out all of the copyFields provided in the default schema.xml that ships with 3.5. My schema is rather minimal. Here is my fields block, if this helps: fields field

Re: ExtractRH: How to strip metadata

2012-05-02 Thread Joseph Hagerty
11:10 AM To: solr-user@lucene.apache.org Subject: Re: ExtractRH: How to strip metadata I do not. I commented out all of the copyFields provided in the default schema.xml that ships with 3.5. My schema is rather minimal. Here is my fields block, if this helps: fields field name=cust