On 24 May 2010 17:41, Lars Aronsson <[email protected]> wrote: > Peter Alberti wrote: >>>> >>>> I've trained tesseract r319 (3.0) to support Danish texts written in >>>> fraktur. It is not >>>> perfect but good enough that I hope it may be useful to others. > > Jimmy O'Regan wrote: >> >> With the current SVN version, you can use combine_tessdata -e >> [trainingdata file] [files to extract] to extract the components you >> want, and combine_tessdata [path to files] to make a new trainingdata >> file. > > I tried to compile the current version (svn -r354 up), but failed: > > svshowim.cpp: In function ‘void sv_show_sub_image(IMAGE*, inT32, inT32, > inT32, inT32, ScrollView*, inT32, inT32)’: > svshowim.cpp:37: error: no matching function for call to > ‘ScrollView::Image(Pix*&, inT32&, int)’ > ../viewer/scrollview.h:266: note: candidates are: void > ScrollView::Image(const char*, int, int) > > Versions 340, 351, 352, 353 also failed in the same place. >
Looks like a pair of missing casts - have you opened an issue? > I was successful in compiling -r319, but then combine_tessdata > doesn't have all these flags. > > ---- > > Still, I'm not very interested in running the program, but to > understand the data. Is there no documentation for the format? > Should we write some? > > Or is that something you keep internally at Google? > Err... I have no affiliation with Google. I periodically read the commits to projects I'm interested in, because I find that it's often the only documentation available. > > -- > Lars Aronsson ([email protected]) > Aronsson Datateknik - http://aronsson.se > > > -- > You received this message because you are subscribed to the Google Groups > "tesseract-ocr" group. > To post to this group, send email to [email protected]. > To unsubscribe from this group, send email to > [email protected]. > For more options, visit this group at > http://groups.google.com/group/tesseract-ocr?hl=en. > > -- <Leftmost> jimregan, that's because deep inside you, you are evil. <Leftmost> Also not-so-deep inside you. -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en.

