[Solr Wiki] Update of "ExtractingRequestHandler" by Pet erWolanin

Apache Wiki Tue, 27 Oct 2009 16:37:36 -0700

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Solr Wiki" for change 
notification.


The "ExtractingRequestHandler" page has been changed by PeterWolanin.
http://wiki.apache.org/solr/ExtractingRequestHandler?action=diff&rev1=49&rev2=50

--------------------------------------------------

   * xpath=<XPath expression> - When extracting, only return Tika XHTML content 
that satisfies the XPath expression.  See 
http://lucene.apache.org/tika/documentation.html for details on the format of 
Tika XHTML.  See also TikaExtractOnlyExampleOutput.
   * lowernames=true|false - Map all field names to lowercase with underscores. 
 For example, Content-Type would be mapped to content_type.
  
+ If extractOnly is true, additional input parameters:
+ 
+  * extractFormat=xml|text - Default is xml.  Controls the serialization 
format of the extract content.  xml format is actually XHTML, like passing the 
-x command to the tika command line application, while text is like the -t 
command.  See [[https://issues.apache.org/jira/browse/SOLR-1274|SOLR-1274]].
+ 
  
  == Order of field operations ==
   1. fields are generated by Tika or passed in as literals via 
{{{literal.fieldname=value}}}

[Solr Wiki] Update of "ExtractingRequestHandler" by Pet erWolanin

Reply via email to