when i check my end of line marker with emacs it says that it is Ctrl-J as
follows:
position: 80 of 82 (96%), column: 6
character: C-j (displayed as C-j) (codepoint 10, #o12, #xa)
preferred charset: ascii (ASCII (ISO646 IRV))
code point in charset: 0x0A
syntax: which means: whitespace
to input: type "C-x 8 RET HEX-CODEPOINT" or "C-x 8 RET NAME"
buffer code: #x0A
file code: #x0A (encoded by coding system undecided-dos)
display: by this font (glyph code)
uniscribe:-outline-Courier
New-normal-normal-normal-mono-13-*-*-*-c-*-iso8859-1 (#x03)
Character code properties: customize what to show
name: <control>
old-name: LINE FEED (LF)
general-category: Cc (Other, Control)
decomposition: (10) ('
')
On Monday, June 2, 2014 11:44:55 PM UTC-7, zdenop wrote:
> Can you check end-of-line marker[1] of prov.txt?
>
> [1]
> https://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3#Requirements_for_text_input_files
>
> Zdenko
>
>
> On Tue, Jun 3, 2014 at 7:25 AM, Glen Rubin <[email protected]
> <javascript:>> wrote:
>
>> Actually, I am mistaken...it is still not working.
>>
>> My command is:
>>
>> tesseract image23.png image23g prov.txt
>>
>> result is:
>>
>> Could not open file, C:\Program
>> Files\Tesseract-OCR\tessdata/eng.user-words.txt
>>
>>
>> On Monday, June 2, 2014 1:54:09 PM UTC-7, zdenop wrote:
>>
>>> Can you please provide exact information what command you used, exact
>>> error message???
>>>
>>> Zdenko
>>>
>>>
>>> On Mon, Jun 2, 2014 at 10:03 PM, Glen Rubin <[email protected]> wrote:
>>>
>>>> I am running Teseract on windows. When I try running from the
>>>> commandline and specifying a config file, tesseract will give me an error
>>>> message saying that it cannot open the user-words file. Weirder still it
>>>> looks like the eng.user-words file has been renamed somehow to just eng??
>>>>
>>>> --
>>>> You received this message because you are subscribed to the Google
>>>> Groups "tesseract-ocr" group.
>>>> To unsubscribe from this group and stop receiving emails from it, send
>>>> an email to [email protected].
>>>> To post to this group, send email to [email protected].
>>>>
>>>> Visit this group at http://groups.google.com/group/tesseract-ocr.
>>>> To view this discussion on the web visit https://groups.google.com/d/
>>>> msgid/tesseract-ocr/325cb025-cadd-48e6-a3c7-ff558fbe11db%
>>>> 40googlegroups.com
>>>> <https://groups.google.com/d/msgid/tesseract-ocr/325cb025-cadd-48e6-a3c7-ff558fbe11db%40googlegroups.com?utm_medium=email&utm_source=footer>
>>>> .
>>>> For more options, visit https://groups.google.com/d/optout.
>>>>
>>>
>>> --
>> You received this message because you are subscribed to the Google Groups
>> "tesseract-ocr" group.
>> To unsubscribe from this group and stop receiving emails from it, send an
>> email to [email protected] <javascript:>.
>> To post to this group, send email to [email protected]
>> <javascript:>.
>> Visit this group at http://groups.google.com/group/tesseract-ocr.
>> To view this discussion on the web visit
>> https://groups.google.com/d/msgid/tesseract-ocr/a0968077-d497-4548-ba66-7f6a617047ba%40googlegroups.com
>>
>> <https://groups.google.com/d/msgid/tesseract-ocr/a0968077-d497-4548-ba66-7f6a617047ba%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>> For more options, visit https://groups.google.com/d/optout.
>>
>
>
--
You received this message because you are subscribed to the Google Groups
"tesseract-ocr" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To post to this group, send email to [email protected].
Visit this group at http://groups.google.com/group/tesseract-ocr.
To view this discussion on the web visit
https://groups.google.com/d/msgid/tesseract-ocr/5bb0844f-72eb-463b-8b4f-05d1a0592142%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.