[
https://issues.apache.org/jira/browse/SOLR-10350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Waleed Raza updated SOLR-10350:
-------------------------------
Comment: was deleted
(was: Aby bata dega to kia mar jayega)
> By posting documents by post.jar i saw that it uses
> org.apache.tika.parser.txt.TXTParser" how can i change the parse that it also
> extract text from images which are inside pdf and also separate images like
> jpg
> -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
> Key: SOLR-10350
> URL: https://issues.apache.org/jira/browse/SOLR-10350
> Project: Solr
> Issue Type: Bug
> Security Level: Public(Default Security Level. Issues are Public)
> Components: Schema and Analysis
> Affects Versions: 6.4.1
> Reporter: Waleed Raza
>
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]