Re: Can ExtractingRequestHandler ignore documents metadata

2011-05-11 Thread Grant Ingersoll
You can map the attributes to the ignore field.  Alternatively, override the 
SolrContentHandler's newMethod() method to skip adding them.  Come to think of 
it, I'll put up a quick patch that breaks that out a bit more and makes it 
easier to override.  Longer term, a patch to exclude metadata would be good.


On May 9, 2011, at 9:47 AM, Tod wrote:

> I'm indexing content from a CMS' database of metadata.  The client would 
> prefer that Solr exclude the properties (metadata) of any documents being 
> indexed.  Is there a way to tell Tika to only index a document's text and not 
> its properties?
> 
> Thanks - Tod

--
Grant Ingersoll
http://www.lucenerevolution.org



Can ExtractingRequestHandler ignore documents metadata

2011-05-09 Thread Tod
I'm indexing content from a CMS' database of metadata.  The client would 
prefer that Solr exclude the properties (metadata) of any documents 
being indexed.  Is there a way to tell Tika to only index a document's 
text and not its properties?


Thanks - Tod