Looks like it was looking for pffmtable file created by mftraining command, which probably had failed. Did you have the required mow.font_properties file in the directory?
On May 23, 6:20 am, Mow <[email protected]> wrote: > Hi! > > I'm getting errors with your script. I'm on win7 x32. > When it has to execute your script line: > > N. 60: Invoke-Expression ".\mftraining -F $langDir\ > $lang.font_properties -U $langDir\$lang.unicharset $trFiles" > > It crashes mftraining. I'm using 2 .tiff files, with 3 boxes in each. > > Here's the log: > > ////////////////////////////////// > LOG ////////////////////////////////// > === Generating Tesseract language data for language: mow === > ** Your training images should be in "C:\Program Files\Tesseract-OCR > \images" directory. > Make Box Files > ** Processing image: mow.font1.exp1.tif > 0 > .\tesseract images\mow.font1.exp1.tif images\mow.font1.exp1 -l mow > batch.nochop makebox > tesseract.exe : Tesseract Open Source OCR Engine with Leptonica > At line:1 char:12 > + .\tesseract <<<< images\mow.font1.exp1.tif images\mow.font1.exp1 -l > mow batch.nochop makebox > + CategoryInfo : NotSpecified: (Tesseract Open ... with > Leptonica:String) [], RemoteException > + FullyQualifiedErrorId : NativeCommandError > > Number of found pages: 1. > Using substitute bounding box at (0,7)->(28,16) > > ** Processing image: mow.font1.exp2.tif > 1 > .\tesseract images\mow.font1.exp2.tif images\mow.font1.exp2 -l mow > batch.nochop makebox > tesseract.exe : Tesseract Open Source OCR Engine with Leptonica > At line:1 char:12 > + .\tesseract <<<< images\mow.font1.exp2.tif images\mow.font1.exp2 -l > mow batch.nochop makebox > + CategoryInfo : NotSpecified: (Tesseract Open ... with > Leptonica:String) [], RemoteException > + FullyQualifiedErrorId : NativeCommandError > > Number of found pages: 1. > Using substitute bounding box at (0,7)->(28,16) > > ** Processing image: mow.font1.exp3.tif > 2 > .\tesseract images\mow.font1.exp3.tif images\mow.font1.exp3 -l mow > batch.nochop makebox > tesseract.exe : Tesseract Open Source OCR Engine with Leptonica > At line:1 char:12 > + .\tesseract <<<< images\mow.font1.exp3.tif images\mow.font1.exp3 -l > mow batch.nochop makebox > + CategoryInfo : NotSpecified: (Tesseract Open ... with > Leptonica:String) [], RemoteException > + FullyQualifiedErrorId : NativeCommandError > > Number of found pages: 1. > Using substitute bounding box at (0,7)->(28,16) > > ** Box files should be edited before continuing. ** > Generate .tr Files > tesseract.exe : Number of found pages: 1. > At line:1 char:12 > + .\tesseract <<<< images\mow.font1.exp1.tif images\mow.font1.exp1 > nobatch box.train > + CategoryInfo : NotSpecified: (Number of found pages: > 1.:String) [], RemoteException > + FullyQualifiedErrorId : NativeCommandError > > 0 > tesseract.exe : Number of found pages: 1. > At line:1 char:12 > + .\tesseract <<<< images\mow.font1.exp2.tif images\mow.font1.exp2 > nobatch box.train > + CategoryInfo : NotSpecified: (Number of found pages: > 1.:String) [], RemoteException > + FullyQualifiedErrorId : NativeCommandError > > 1 > tesseract.exe : Number of found pages: 1. > At line:1 char:12 > + .\tesseract <<<< images\mow.font1.exp3.tif images\mow.font1.exp3 > nobatch box.train > + CategoryInfo : NotSpecified: (Number of found pages: > 1.:String) [], RemoteException > + FullyQualifiedErrorId : NativeCommandError > > 2 > Compute the Character Set > Extracting unicharset from images\mow.font1.exp1.box > Extracting unicharset from images\mow.font1.exp2.box > Extracting unicharset from images\mow.font1.exp3.box > Wrote unicharset file images/unicharset. > Clustering > Move-Item : Cannot find path 'C:\Program Files\Tesseract-OCR > \pffmtable' because it does not exist. > At C:\Program Files\Tesseract-OCR\auto1.ps1:64 char:10 > + move-item <<<< -force -path pffmtable -destination $langDir\ > $lang.pffmtable > + CategoryInfo : ObjectNotFound: (C:\Program Files > \Tesseract-OCR\pffmtable:String) [Move-Item], ItemNotFoundException > + FullyQualifiedErrorId : > PathNotFound,Microsoft.PowerShell.Commands.MoveItemCommand > > Reading images\mow.font1.exp1.tr ... > Reading images\mow.font1.exp2.tr ... > Reading images\mow.font1.exp3.tr ... > Clustering ... > > Writing normproto ... > Dictionary Data > Reading word list from 'images\mow.frequent_words_list.txt' > Reducing Trie to SquishedDawg > Writing squished DAWG to 'images\mow.freq-dawg' > Reading word list from 'images\mow.words_list.txt' > Reducing Trie to SquishedDawg > Writing squished DAWG to 'images\mow.word-dawg' > The last file (unicharambigs) -- this is to be manually edited > Putting it all together > Combining tessdata files > combine_tessdata.exe : TessdataManager combined tesseract data files. > At line:1 char:19 > + .\combine_tessdata <<<< images\mow. > + CategoryInfo : NotSpecified: (TessdataManager...act > data files.:String) [], RemoteException > + FullyQualifiedErrorId : NativeCommandError > > Offset for type 0 is -1 > Offset for type 1 is 84 > Offset for type 2 is 145 > Offset for type 3 is 148 > Offset for type 4 is 110740 > Offset for type 5 is 110760 > Offset for type 6 is -1 > Offset for type 7 is 111062 > Offset for type 8 is -1 > Offset for type 9 is 111072 > > Hi! > > I'm getting errors with your script. I'm on win7 x32. > When it has to execute your script line: > > N. 60: Invoke-Expression ".\mftraining -F $langDir\ > $lang.font_properties -U $langDir\$lang.unicharset $trFiles" > > It crashes mftraining. I'm using 2 .tiff files, with 3 boxes in each. > > Here's the log: > > ////////////////////////////////// > LOG ////////////////////////////////// -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To post to this group, send email to [email protected] To unsubscribe from this group, send email to [email protected] For more options, visit this group at http://groups.google.com/group/tesseract-ocr?hl=en

