For spaces, put quotes around the name. Sven On Sunday, January 13, 2013, gold snake wrote:
> thanks, the problem is fixed now,because the font_properties and the [ > lang].[fontname].exp[num] on the command , must same. > > but one thing i cant understand. the fontname is a real font Name?? or > just a mark?? if it's a real font name , the program is using or not? if > my font name have a space in the middle ,how can i do? font name like: <My > Font>. > > very thanks... > > 在 2013年1月14日星期一UTC+8上午2时55分49秒,zdenop写道: >> >> On Sun, Jan 13, 2013 at 6:06 PM, zdenko podobny <[email protected]> wrote: >> >>> If you want to help, that make sure you read documentation[1], follow it >>> closely and search forum/issues. Making multiple posts (forum+issues) will >>> not help you. >>> >>> Just reading your post it is clear that you do not follow wiki at least >>> in there cases: >>> >>> - name of input files. If documentation states it should be "[lang].[ >>> fontname].exp[num].ti**f" why do you use "[lang].[fontname].[num].t** >>> if"??? >>> - font_properties - it is not according documentation. >>> >>> If you want to run traning for non-latin based language - make sure you >>> are able to run it for English first. There are reported some problems with >>> LTR training, >>> >> >> Ups it should be RTL training... >> >> >>> so it will help you to eliminate problems with not following >>> documentation and possible problems with non-latin based language. >>> >>> [1] >>> https://code.google.com/p/**tesseract-ocr/wiki/**TrainingTesseract3<https://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3> >>> >>> Zdenko >>> >>> >>> On Sun, Jan 13, 2013 at 3:57 AM, gold snake <[email protected]> wrote: >>> >>>> help~~~~ >>>> >>>> 在 2013年1月12日星期六UTC+8下午4时15分09秒,**gold snake写道: >>>> >>>>> *the display error content is :* >>>>> D:\Little\Tesseract-OCR\build>****shapeclustering -F font_properties >>>>> -U unicharset - >>>>> O oybab.unicharset oybab.A.0.tr >>>>> Reading oybab.A.0.tr ... >>>>> Font id = -1/0, class id = 1/2 on sample 0 >>>>> font_id >= 0 && font_id < font_id_map_.SparseSize():**Erro**r:Assert >>>>> failed:in file >>>>> ..\..\classify\**trainingsamples**et.cpp, line 622 >>>>> >>>>> *there is my font_properties file content:* >>>>> TheFont 0 0 0 0 0 >>>>> >>>>> *there is when i make tr files commandLine display content:* >>>>> D:\Little\Tesseract-OCR\build>****tesseract oybab.A.0.tif oybab.A.0 >>>>> nobatch box.trai >>>>> n >>>>> Tesseract Open Source OCR Engine v3.02 with Leptonica >>>>> TIFFReadDirectory: Warning, TIFFstream: wrong data type 7 for >>>>> "RichTIFFIPTC"; ta >>>>> g ignored. >>>>> TIFFReadDirectory: Warning, TIFFstream: unknown field with tag 37724 >>>>> (0x935c) en >>>>> countered. >>>>> TIFFReadDirectory: Warning, TIFFstream: wrong data type 7 for >>>>> "RichTIFFIPTC"; ta >>>>> g ignored. >>>>> TIFFReadDirectory: Warning, TIFFstream: unknown field with tag 37724 >>>>> (0x935c) en >>>>> countered. >>>>> TIFFReadDirectory: Warning, TIFFstream: wrong data type 7 for >>>>> "RichTIFFIPTC"; ta >>>>> g ignored. >>>>> TIFFReadDirectory: Warning, TIFFstream: unknown field with tag 37724 >>>>> (0x935c) en >>>>> countered. >>>>> row xheight=120.333, but median xheight = 83.5 >>>>> row xheight=46.6667, but median xheight = 83.5 >>>>> APPLY_BOXES: boxfile line 3/卅 ((312,53),(385,204)): FAILURE! Couldn't >>>>> find a ma >>>>> tching blob >>>>> APPLY_BOXES: >>>>> Boxes read from boxfile: 4 >>>>> Boxes failed resegmentation: 1 >>>>> APPLY_BOXES: Unlabelled word at :Bounding box=(312,53)->(369,122) >>>>> Found 3 good blobs. >>>>> 1 remaining unlabelled words deleted. >>>>> >>>>> >>>>> >>>>> >>>>> *there is my box file content:* >>>>> ئ 18 48 142 227 0 >>>>> ئ 173 43 218 223 0 >>>>> ئ 254 39 274 228 0 >>>>> ئ 312 53 385 204 0 >>>>> >>>>> *ps: my language is something like arab, it's writing right to left. >>>>> so what is the problem ??? please help. thanks so much...* >>>>> >>>> -- >>>> You received this message because you are subscribed to the Google >>>> Groups "tesseract-ocr" group. >>>> To post to this group, send email to [email protected] >>>> To unsubscribe from this group, send email to >>>> tesseract-oc...@**googlegroups.com >>>> For more options, visit this group at >>>> http://groups.google.com/**group/tesseract-ocr?hl=en<http://groups.google.com/group/tesseract-ocr?hl=en> >>>> >>> >>> >> -- > You received this message because you are subscribed to the Google > Groups "tesseract-ocr" group. > To post to this group, send email to > [email protected]<javascript:_e({}, 'cvml', > '[email protected]');> > To unsubscribe from this group, send email to > [email protected] <javascript:_e({}, 'cvml', > 'tesseract-ocr%[email protected]');> > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en > -- ``All that is gold does not glitter, not all those who wander are lost; the old that is strong does not wither, deep roots are not reached by the frost. >From the ashes a fire shall be woken, a light from the shadows shall spring; renewed shall be blade that was broken, the crownless again shall be king.” -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

