Hi!

I'm getting errors with your script. I'm on win7 x32.
When it has to execute your script line:

N. 60: Invoke-Expression ".\mftraining -F $langDir\
$lang.font_properties -U $langDir\$lang.unicharset $trFiles"

It crashes mftraining. I'm using 2 .tiff files, with 3 boxes in each.

Here's the log:

//////////////////////////////////
LOG   //////////////////////////////////
=== Generating Tesseract language data for language: mow ===
** Your training images should be in "C:\Program Files\Tesseract-OCR
\images" directory.
Make Box Files
** Processing image: mow.font1.exp1.tif
0
.\tesseract images\mow.font1.exp1.tif images\mow.font1.exp1 -l mow
batch.nochop makebox
tesseract.exe : Tesseract Open Source OCR Engine with Leptonica
At line:1 char:12
+ .\tesseract <<<<  images\mow.font1.exp1.tif images\mow.font1.exp1 -l
mow batch.nochop makebox
    + CategoryInfo          : NotSpecified: (Tesseract Open ... with
Leptonica:String) [], RemoteException
    + FullyQualifiedErrorId : NativeCommandError

Number of found pages: 1.
Using substitute bounding box at (0,7)->(28,16)

** Processing image: mow.font1.exp2.tif
1
.\tesseract images\mow.font1.exp2.tif images\mow.font1.exp2 -l mow
batch.nochop makebox
tesseract.exe : Tesseract Open Source OCR Engine with Leptonica
At line:1 char:12
+ .\tesseract <<<<  images\mow.font1.exp2.tif images\mow.font1.exp2 -l
mow batch.nochop makebox
    + CategoryInfo          : NotSpecified: (Tesseract Open ... with
Leptonica:String) [], RemoteException
    + FullyQualifiedErrorId : NativeCommandError

Number of found pages: 1.
Using substitute bounding box at (0,7)->(28,16)

** Processing image: mow.font1.exp3.tif
2
.\tesseract images\mow.font1.exp3.tif images\mow.font1.exp3 -l mow
batch.nochop makebox
tesseract.exe : Tesseract Open Source OCR Engine with Leptonica
At line:1 char:12
+ .\tesseract <<<<  images\mow.font1.exp3.tif images\mow.font1.exp3 -l
mow batch.nochop makebox
    + CategoryInfo          : NotSpecified: (Tesseract Open ... with
Leptonica:String) [], RemoteException
    + FullyQualifiedErrorId : NativeCommandError

Number of found pages: 1.
Using substitute bounding box at (0,7)->(28,16)

** Box files should be edited before continuing. **
Generate .tr Files
tesseract.exe : Number of found pages: 1.
At line:1 char:12
+ .\tesseract <<<<  images\mow.font1.exp1.tif images\mow.font1.exp1
nobatch box.train
    + CategoryInfo          : NotSpecified: (Number of found pages:
1.:String) [], RemoteException
    + FullyQualifiedErrorId : NativeCommandError

0
tesseract.exe : Number of found pages: 1.
At line:1 char:12
+ .\tesseract <<<<  images\mow.font1.exp2.tif images\mow.font1.exp2
nobatch box.train
    + CategoryInfo          : NotSpecified: (Number of found pages:
1.:String) [], RemoteException
    + FullyQualifiedErrorId : NativeCommandError

1
tesseract.exe : Number of found pages: 1.
At line:1 char:12
+ .\tesseract <<<<  images\mow.font1.exp3.tif images\mow.font1.exp3
nobatch box.train
    + CategoryInfo          : NotSpecified: (Number of found pages:
1.:String) [], RemoteException
    + FullyQualifiedErrorId : NativeCommandError

2
Compute the Character Set
Extracting unicharset from images\mow.font1.exp1.box
Extracting unicharset from images\mow.font1.exp2.box
Extracting unicharset from images\mow.font1.exp3.box
Wrote unicharset file images/unicharset.
Clustering
Move-Item : Cannot find path 'C:\Program Files\Tesseract-OCR
\pffmtable' because it does not exist.
At C:\Program Files\Tesseract-OCR\auto1.ps1:64 char:10
+ move-item <<<<  -force -path pffmtable -destination $langDir\
$lang.pffmtable
    + CategoryInfo          : ObjectNotFound: (C:\Program Files
\Tesseract-OCR\pffmtable:String) [Move-Item], ItemNotFoundException
    + FullyQualifiedErrorId :
PathNotFound,Microsoft.PowerShell.Commands.MoveItemCommand

Reading images\mow.font1.exp1.tr ...
Reading images\mow.font1.exp2.tr ...
Reading images\mow.font1.exp3.tr ...
Clustering ...

Writing normproto ...
Dictionary Data
Reading word list from 'images\mow.frequent_words_list.txt'
Reducing Trie to SquishedDawg
Writing squished DAWG to 'images\mow.freq-dawg'
Reading word list from 'images\mow.words_list.txt'
Reducing Trie to SquishedDawg
Writing squished DAWG to 'images\mow.word-dawg'
The last file (unicharambigs) -- this is to be manually edited
Putting it all together
Combining tessdata files
combine_tessdata.exe : TessdataManager combined tesseract data files.
At line:1 char:19
+ .\combine_tessdata <<<<  images\mow.
    + CategoryInfo          : NotSpecified: (TessdataManager...act
data files.:String) [], RemoteException
    + FullyQualifiedErrorId : NativeCommandError

Offset for type 0 is -1
Offset for type 1 is 84
Offset for type 2 is 145
Offset for type 3 is 148
Offset for type 4 is 110740
Offset for type 5 is 110760
Offset for type 6 is -1
Offset for type 7 is 111062
Offset for type 8 is -1
Offset for type 9 is 111072

Hi!

I'm getting errors with your script. I'm on win7 x32.
When it has to execute your script line:

N. 60: Invoke-Expression ".\mftraining -F $langDir\
$lang.font_properties -U $langDir\$lang.unicharset $trFiles"

It crashes mftraining. I'm using 2 .tiff files, with 3 boxes in each.

Here's the log:

//////////////////////////////////
LOG   //////////////////////////////////

-- 
You received this message because you are subscribed to the Google
Groups "tesseract-ocr" group.
To post to this group, send email to [email protected]
To unsubscribe from this group, send email to
[email protected]
For more options, visit this group at
http://groups.google.com/group/tesseract-ocr?hl=en

Reply via email to