[ 
https://issues.apache.org/jira/browse/TIKA-3172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17180244#comment-17180244
 ] 

Tilman Hausherr commented on TIKA-3172:
---------------------------------------

Valid field names with the change:

ocrStrategy, ocrImageFormatName, extractBookmarksText, averageCharTolerance, 
extractAcroFormContent, extractInlineImages, ocrDPI, extractFontNames, 
extractMarkedContent, enableAutoSpace, ocrImageScale, ifXFAExtractOnlyXFA, 
extractInlineImageMetadataOnly, allowExtractionForAccessibility, 
catchIntermediateExceptions, spacingTolerance, ocrImageQuality, 
extractUniqueInlineImagesOnly, setKCMS, initializableProblemHander, 
ocrImageType, extractActions, detectAngles, maxMainMemoryBytes, sortByPosition

> PDF Parser configuration enable auto space using tika config file
> -----------------------------------------------------------------
>
>                 Key: TIKA-3172
>                 URL: https://issues.apache.org/jira/browse/TIKA-3172
>             Project: Tika
>          Issue Type: Wish
>          Components: parser
>    Affects Versions: 1.24.1
>            Reporter: Akash
>            Priority: Major
>
> Need information on how to set enableAutoSpace using tika config file.
> {code:java}
> /<properties>
>   <parsers>
>     <parser class="org.apache.tika.parser.DefaultParser">
>       <parser-exclude class="org.apache.tika.parser.pdf.PDFParser"/>
>     </parser>
>     <parser class="org.apache.tika.parser.pdf.PDFParser">
>       <params>
>         <param name="enableAutoSpace" type="bool">false</param>
>       </params>
>     </parser>
>   </parsers>
> </properties>/ 
> {code}
> Above configuration is not working.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to