SV: Tesseract language

2018-10-22 Thread Martin Frank Hansen (MHQ)
Hi Erick, Thanks for the help! I will take a look at it. Martin Frank Hansen, Senior Data Analytiker Data, IM & Analytics Lautrupparken 40-42, DK-2750 Ballerup E-mail m...@kmd.dk Web www.kmd.dk Mobil +4525571418 -Oprindelig meddelelse- Fra: Erick Erickson Sendt: 21. oktober 2018

SV: Tesseract language

2018-10-22 Thread Martin Frank Hansen (MHQ)
Hi Gus, Thank you so much! I will definitely take a look at it during the day. Martin Frank Hansen, -Oprindelig meddelelse- Fra: Gus Heck Sendt: 22. oktober 2018 00:06 Til: solr-user@lucene.apache.org Emne: Re: Tesseract language Hi Martin, I wrote a framework

SV: Tesseract language

2018-10-21 Thread Martin Frank Hansen (MHQ)
Hi Alex, Thanks again for your reply, much appreciated. Martin Frank Hansen, Senior Data Analytiker Data, IM & Analytics Lautrupparken 40-42, DK-2750 Ballerup E-mail m...@kmd.dk Web www.kmd.dk Mobil +4525571418 -Oprindelig meddelelse- Fra: Alexandre Rafalovitch Sendt: 21. oktober

SV: Tesseract language

2018-10-21 Thread Martin Frank Hansen (MHQ)
Hi Alexandre, Thanks for your reply. Yes right now it is just for testing the possibilities of Solr and Tesseract. I will take a look at the Tika documentation to see if I can make it work. You said that DIH are not recommended for production usage, what is the recommended method(s) to upload

SV: Tesseract language

2018-10-21 Thread Martin Frank Hansen (MHQ)
Hi again, Is there anyone who has some experience of using Tesseract’s OCR module within Solr? The files I am trying to read into Solr is Danish Tiff documents. Martin Frank Hansen, Senior Data Analytiker Data, IM & Analytics [cid:image001.png@01D383C9.6C129A60] Lautrupparken 40-42, DK-2750