Re: [tesseract-ocr] Train Tesseract 4.0 on Windows 8

2018-04-19 Thread ShreeDevi Kumar
tesstrain.sh is a bashshell  script. You don't need python for it.

try the following: (give the correct path)

bash ./tesstrain.sh



ShreeDevi

भजन - कीर्तन - आरती @ http://bhajans.ramparivar.com

On Thu, Apr 19, 2018 at 8:01 PM,  wrote:

> I have installed the lastest tesseract 4.0 binary from UB Mannheim, along
> with python, Git & Java on my Windows 8 64bit.
> I am trying to run the "tesstrain.sh" script, but an erro message appears,
> any help?
>
>
> 
>
> --
> You received this message because you are subscribed to the Google Groups
> "tesseract-ocr" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to tesseract-ocr+unsubscr...@googlegroups.com.
> To post to this group, send email to tesseract-ocr@googlegroups.com.
> Visit this group at https://groups.google.com/group/tesseract-ocr.
> To view this discussion on the web visit https://groups.google.com/d/
> msgid/tesseract-ocr/8a9b6f88-2770-423f-b566-54846e9e2586%
> 40googlegroups.com
> 
> .
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/CAG2NduUy52m8VnxfaCedhMqtLGsWhuJLHnKBN_Yf_qCVCxQeiQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Train Tesseract 4.0 on Windows 8

2018-04-19 Thread crytoy
I have installed the lastest tesseract 4.0 binary from UB Mannheim, along 
with python, Git & Java on my Windows 8 64bit.
I am trying to run the "tesstrain.sh" script, but an erro message appears, 
any help?



-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/8a9b6f88-2770-423f-b566-54846e9e2586%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: tesseract not working properly

2018-04-19 Thread Atul Singh
What are the settings that you are using.

On Wednesday, April 18, 2018 at 1:42:26 AM UTC+5:30, Vibhakar wrote:
>
> im trying to OCR this image 
>
>
> 
> and the output is :- 
> —mmanudthMNaGrDcI1f0ithWTOEs+qua*mxen
>
> what should i do ??
>
>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/58f3c17e-96cb-49bb-9078-6422322e63d4%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: How can I know whichever file format types Tesseract will recognize and able to process them ?

2018-04-19 Thread abdu
Thanks.. appreciated so much

On Thursday, 19 April 2018 12:16:54 UTC+7, abdu wrote:
>
> How do we get information for the file types in that Tesseract would 
> capable of processing ?
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/bcbe648e-6f77-407c-b999-3f216fcec624%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


[tesseract-ocr] Re: Page Separator

2018-04-19 Thread CK
Thanks a lot.  I can see the file names in the hocr file!  

On Wednesday, April 18, 2018 at 2:51:31 PM UTC-5, Ewan Mellor wrote:
>
> If you use the hOCR output, it puts the image name in the  tag at 
> the top of the output.
>
> The plain text doesn't have any way to include the title in the output as 
> far as I can see; you'll need to modify Tesseract to do that.  Check 
> out TessHOcrRenderer::BeginDocumentHandler.  You'll want to make a 
> similar TessTextRenderer::BeginDocumentHandler.
>
> HTH,
>
> Ewan.
>
> On Wednesday, 18 April 2018 10:49:28 UTC-7, CK wrote:
>>
>> Hello,
>>
>> From the command line I call a list.txt for my input images.  I would 
>> like to be able know which image output I am observing in the output text.  
>> I wonder if it is possible to use the input file name instead of simple 
>> text for the page_separator?
>>
>> include_page_breaks=1 -c page_separator="[PAGE SEPARATOR]"
>>
>> Version 3.04
>>
>> Thanks
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to tesseract-ocr+unsubscr...@googlegroups.com.
To post to this group, send email to tesseract-ocr@googlegroups.com.
Visit this group at https://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/tesseract-ocr/582bd31c-63a0-4a4c-a5b6-69609a932ff4%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.