And in case you thought it referred to the ethnic term, here is what a
DAWG is http://en.wikipedia.org/wiki/Directed_acyclic_word_graph
:-)
--Sven

On Thu, Jun 7, 2012 at 7:50 AM, zdenko podobny <[email protected]> wrote:
> http://tesseract-ocr.googlecode.com/svn/trunk/doc/combine_tessdata.1.html#_components:
>
> lang.punc-dawg
>
> (Optional) A dawg made from punctuation patterns found around words. The
> "word" part is replaced by a single space.
>
> lang.number-dawg
>
> (Optional) A dawg made from tokens which originally contained digits. Each
> digit is replaced by a space character.
>
>
> --
> Zdenko
>
> On Thu, Jun 7, 2012 at 12:48 PM, Nick White <[email protected]> wrote:
>>
>> Does anybody have any clue as to what number-dawg and punc-dawg are
>> supposed to contain? There is no information on them in the
>> TrainingTesseract3 wiki page, and I couldn't find anything anywhere
>> else. I looked briefly at the dawg2wordlist for other trainings, but
>> it didn't reveal anything as obvious as I had hoped.
>>
>> Nick
>>
>> --
>> You received this message because you are subscribed to the Google
>> Groups "tesseract-ocr" group.
>> To post to this group, send email to [email protected]
>> To unsubscribe from this group, send email to
>> [email protected]
>> For more options, visit this group at
>> http://groups.google.com/group/tesseract-ocr?hl=en
>
>
> --
> You received this message because you are subscribed to the Google
> Groups "tesseract-ocr" group.
> To post to this group, send email to [email protected]
> To unsubscribe from this group, send email to
> [email protected]
> For more options, visit this group at
> http://groups.google.com/group/tesseract-ocr?hl=en



-- 
``All that is gold does not glitter,
  not all those who wander are lost;
the old that is strong does not wither,
  deep roots are not reached by the frost.
>From the ashes a fire shall be woken,
  a light from the shadows shall spring;
renewed shall be blade that was broken,
  the crownless again shall be king.”

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to